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BIRTH ORDER, AUTHORITARIANISM, AND THE 
PROJECTION OF SEX AND AGGRESSION** 


Temple University 


Jan Cart GrossMAN AND RussELL EIsENMAN 


A. InrropuctTion 


The theoretical formulation of the authoritarian personality (1) is based 
on the defense mechanisms of repression and projection. Unacceptable sexual 
and aggressive impulses are repressed by the authoritarian. Never achieving 
integration within the personality and being threatening to the ego, these 
impulses are projected outward in such forms as exaggerated moral concern 
over sex, and hostility and prejudice toward outgroups. 

Attempts to validate empirically the hypothesis that high authoritarians 
project more than low authoritarians have met with moderate success in at 
least two reported studies. Rothstein (8) found that high authoritarians were 
more likely to rate a female in a motion picture as “sexual” when compared 
to low authoritarians. Eliasberg and Stuart (3) found a .46 correlation 
between authoritarianism and ratings of certain works of art as “porno- 
graphic.” Unpublished data, compiled by the authors, indicated that as levels 
of authoritarianism increased, amount of aggression seen in ambiguous stimuli 
also increased. It is important to note that in our work and in Rothstein’s 
study, “rating” of a certain stimulus was equated with projection. 

Another notable aspect of these studies was that only moderate success was 
obtained in correlating authoritarianism and projection. There were many 
subjects who did not fit the high authoritarian-high projection, low authori- 
tarian-low projection theoretical model, In our pilot work, especially, we 
found that although most high authoritarians were high projectors, there 
were many low authoritarians who projected as much, if not more. This led 
us in search of a moderating variable that might account for the differences 
in projection among low authoritarian subjects, especially since it is assumed 
in the psychoanalytic theory behind the authoritarian personality that low 
authoritarians should be better able to handle and integrate sexual and ag- 
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gressive impulses into the personality structure, and have little use of the 
repression-projection mechanisms. 

A possible lead in the uncovering of this moderating variable is suggested 
in a volume by Harris (5), in which he compares famous first born sons to 
famous later born sons. One of the conclusions he draws is that first born 
sons seem to be controlled by a strong internalized moral code, while later 
borns seem to subscribe to a morality relying more on the external social 
situation. Harris and Howard (6) have found that Parents require more 
compliance with moral injunctions and adult expectations from first borns, 
and that first borns more effectively internalize parental injunction, devel- 
oping a stronger sense of responsibility. In the psychoanalytic view (4, 9), 
it is postulated that the reason for the first borns’ stronger internalized 
morality or harsher, more demanding superego, is the stronger identification 
that comes with the resolution of the Oedipal conflict. The Oedipal situation 
is not usually as traumatic for later born children, resulting in less identifica- 
tion at its resolution. Palmer (7) gives two other possible reasons for the 
stronger identification of the first born. First, if an early born child identified 
with a younger sibling, his behavior would be viewed by the parents as 
regressive. Parents would be much more likely to prohibit or suppress this 
type of behavior than behavior resulting from parental identification, Fewer 
sanctions are likely to be applied if a younger sibling identifies with an older 
one, resulting in more mature behavior, A younger sibling has a greater 
number of identification models to choose from, some not as strict and de- 
manding as the parents. Second, it is likely that the first born might view 
the second born as a threat to his newly won controls, thus motivating him 
to even stronger identification with parental moral values. Palmer then 
empirically demonstrated the existence of stronger first born identification, 
particularly in the area of “inhibitory-disciplinarian” behavior, behavior with 
a “distinctly authoritarian and punitive flavor.” 

In applying the above research to our original problem concerning low 
authoritarian-high projectors, it becomes possible to conceive that these high 
projectors might well be the first borns of low authoritarian families; children 
who have internalized the liberal family ideology, but who, in comparison to 
their later born liberal siblings, have identified more total 


eral answers to 
however, the dif- 
als might become 


authoritarian attitude questionnaires, In a projective test, 
ferences between first born liberals and later born liber: 


JAN CARL GROSSMAN AND RUSSELL EISENMAN 5 


obvious, the first borns using projection as a mechanism because of their 
moralistic orientation, 

In the high authoritarian family, where every child is usually exposed to 
a punitive and repressive upbringing, the projective difference between first 
borns and later borns should be somewhat blurred, all children being molded 
into a repression-projection model, the first borns possibly being more in- 
fluenced than later borns. 

It is therefore predicted that first borns will generally employ the mecha- 
nism of projection to a greater extent than later borns, and that differences 
in projection for first borns at different levels of authoritarianism will be 
attenuated, as all first borns employ projection to some extent. Differences 
in projection for later borns at different levels of authoritarianism will be 
accentuated, as only the high authoritarians should be high projectors, 


B. MetHop 
1. Subjects 


Ss were 239 elementary psychology students divided into the following 
four groups: 41 first born males, 77 later born males, 62 first born females, 


59 later born females. 
2. Procedure 


Ss were tested during their psychology class with a test battery consisting 
of the California F Scale as the measure of authoritarian attitudes; a 12- 
question “‘acquiescent response set” measure (2); Rorschach cards III and 
VI; and two quantitative projective tests developed for this experiment, the 
first being composed of 10 out-of-focus blurred photographs ( BLURS) of 
male and female figures in various ambiguous action poses, the second being 
11 full color reproductions of famous abstract paintings (ABSTRACTS). 

This differs from the previous projective work of Rothstein (8) and Elias- 
berg and Stuart (3) in that the projective stimuli used in their experiment 


had a specific “pull” for sex (€.g., nude paintings, a sexual film). The stimuli 


used here were of a more ambiguous nature and included very little in the 


way of specific sexual or aggressive connotations. If one were going to pr 
any sex or aggression in these stimuli, it would have to come mainly from 
within. 

In the testing situation, 
ical data including family size, birth order, etc., 
F Scale and the acquiescent response set measure as 


Ss remained anonymous. They first gave biograph- 
and then took the California 


paper and pencil tests. 
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Then, one at a time, Rorschach cards III and VI were projected onto a large 
screen in front of the subjects, and each was asked to record three “things” 
he saw in each card. Then each BLUR was projected onto the screen and 
each subject was asked to “rate” the amount of aggression he “‘saw in each 
picture” on a zero to 10 scale—zero meaning no aggression, 10 meaning total 
aggression. He did this for each of the 10 BLURS and the 11 ABSTRACTS. 
The BLURS and ABSTRACTS were then presented again, and the sub- 
jects were asked to rate for sex. Sex projection in the Rorschach was con- 
sidered the number of sexual objects seen out of the six “things” observed, 
aggression projection the number of aggressive objects, Projection in the 
BLURS and ABSTRACTS for sex and aggression was defined as the total 
number of points rated for a specific set of pictures. Therefore we obtained 
a “sex BLUR score,” “aggression ABSTRACT score,” etc. To insure that 
in the rating of the BLURS and ABSTRACTS we were not dealing only 
with a “high rater set,” rather than projection, we also asked each subject 
to rate the BLURS and ABSTRACTS for “honesty,” a vague concept not 
closely related to sex and aggression, If a “high rater set” was in effect, one 
would expect high correlations between “honesty” and sex and aggression 
ratings. If the correlations were low, we would discount a “high rater set” 
as a possible artifact. §s scoring above one standard deviation above the mean 
in the acquiescent response set measure were eliminated from the study. 


C. Resuxts 


Correlations between sex, aggression, and “honesty” ratings showed mod- 
erately positive correlations between sex and aggression scores (+.40 to 
+.50, 6’s < .001), as might be expected, but little correlation between 
“honesty” and sex, or “honesty” and aggression. We therefore discount that 
possibility of a “high rater set” as an artifact in this experiment, 

For the male subjects in this study, the predictions were generally borne 
out. Among first born males, there were no significant differences in pro- 
jected sex or aggression for ABSTRACTS or BLURS at different levels of 
authoritarianism. However, in later borns, as authoritarianism rose, we found 
an increase in aggression BLUR projection (F = 9.37; df = 3, 73;p < 
.001) ; an increase in sex ABSTRACT projection (F = Sandhu. 3, 133, 
~ < .05); an increase in Rorschach sex projection (F = 2.44; df = 3, 73; 
p < .07); and a nonsignificant trend toward increases in sex BLUR and 
aggression ABSTRACT projection. 

Among first born females, as expected, there were also no significant dif- 
ferences in projected sex or aggression for ABSTRACTS or BLURS. 


OO 
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Among later born females, aggression BLUR projection increased with in- 
creases in authoritarianism (F = 6.52; df = 3, 53; p < .001), this being 
the only significant finding for later born females. 

As predicted, across all groups, first borns were greater projectors than 
later borns. They projected more aggression in BLURS (F = 3.26; df = 
1, 224; p < .07) and ABSTRACTS (F = 2.73; df = 1, 224; p < .10). 
They also projected more sex in BLURS (F = 1.72; df = 1, 224; p < .19) 
and ABSTRACTS (F = 6.67; df = 1, 224; p < .01). On BLURS, males 
projected more than females: sex BLURS (F = 3.53; df = 1, 224; < 
.05) and aggression BLURS (F = 3.25; df = 1, 224; p < .06). 


D. Discussion 


The birth order studies utilized American subjects. Studies in other cultures 
might not replicate the findings, especially when extended families are together 
in one household. The results are consistent with the identification theory 
previously discussed. Projection was positively correlated with authoritari- 
anism only for later born males and females. These results help to explain 
why previous studies have not achieved better results in trying to link author- 
itarianism and projection. First borns may have a stronger internalized moral 
code than later borns, irrespective of their degree of authoritarianism. ‘The 
first born moral code may lead to problems in handling sexual and aggressive 
impulses, with a resultant tendency to engage in projection. A strong in- 
ternalized moral code cannot necessarily be equated with authoritarianism, 
As proof of this, Eisenman and Cherry (2) found that first born, males 
were significantly less authoritarian than later born males. 

Our findings suggest that birth order should be taken into account when- 
ever authoritarianism is being used as a research variable. As projection is 
said to be one of the main mechanisms of the authoritarian personality, there 
is a possibility that our results may be generalized to other aspects of the 
authoritarian’s personality structure. 


E. Summary 


It was hypothesized that first borns, because of a stronger internalized 
moral code, will project more sex and aggression into ambiguous papa 
were 239 undergraduates. For first borns, no relationship was expected e- 
tween authoritarianism and projection, as first borns are typically settee 
jectors, regardless of authoritarianism. A positive relationship was sue 
between authoritarianism and projection for later borns. Later born author- 
itarians should project more than nonauthoritarian later borns. The predic- 
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tions were generally confirmed, and the results were related to the theory of 


identification. 
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SOME PERSONALITY CORRELATES OF CONTRACEPTIVE 
USE AMONG UNMARRIED FEMALE 
COLLEGE STUDENTS* 


Grand Valley State College 


James R. Lunpy 


A. INTRODUCTION 


In view of the increasing rate of unwanted pregnancies and illegitimate 
births (14), there is a rather surprising lack of empirical data concerning 
the dynamics of the use versus nonuse of contraceptives by unmarried females. 
This is not to imply that there is not a wealth of theorizing about the topic 
of birth control and the related subject of illegitimacy (1, 2, 3, 5, 6, 7, 12, 
15) ; however what is needed is empirical testing of these hypothesized rela- 
tionships. Pohlman charges that “psychologists have done almost no research 
on birth control and related topics” (11, p. 968); he adds that there are 
particular needs for investigations of the independent variables influencing 
the use of contraceptive techniques. This present research is an attempt to 
make a start in that direction; that is, this investigation is concerned with 
identifying some of the personality variables associated with the choosing to 
use or not to use contraceptives by unmarried, female college students. 

There are precedents for the approach taken in this study. Guttmacher (7) 
offered evidence to support his contention that the use or nonuse of contracep- 
tive devices in premarital sexual relations was significantly correlated with 
economic and ethnic factors. Investigating the implications of Guttmacher’s 
findings, Grinder and Schmitt (6) reported data suggesting that the extent 
of knowledge about contraceptives among teen-age females varied with the 
age at which dating started, the age of men that were dated, the frequency 
and whether or not the couple had access to an apartment. How- 
d data that indicated a rather ambivalent attitude 
he part of freshmen college fe- 


of dating, 
ever, Angrist (1) obtaine: 
toward knowledge about contraceptives on t 
males; most of his subjects had some contraceptive information, and nearly all 
expressed a desire for more authoritative information, but the freshmen girls 
also indicated that they tended to view contraceptive techniques as an area 


that was peripheral to their central interest in sexual behavior and standards. 
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Reiss (12) theorized that premarital sexual permissiveness is influenced 
by the freedom allowed by the courtship processes and by the degree of pre- 
marital sexual permissiveness that is acceptable in a particular social or 
cultural grouping. Christensen (2) found evidence that appears to suggest 
that culturally bound sexual permissiveness is associated with a higher inci- 
dence of premarital pregnancies. Coblinger’s (3) data support the cultural 
bound thesis, but Heltsley and Broderick (8), in their investigation of the 
relationship between religiosity and permissiveness, suggest that it is the 
variation in the quality of the stimulus (religion) that influences premarital 
sexual standards; that is, “only when religious groups specifically teach non- 
permissiveness can non-permissiveness be expected to result” (8, p. 442). 

In reporting their research that dealt explicitly with the sexual behavior 
of college students, Kaats and Davis (9) present evidence that indicates that 
the average premarital coital rate for female college students falls in the 41 
to 45 percent range. Their data also indicate that these women, who are more 
liberal than their parents with regard to sexual behavior, are nevertheless still 
adhering to a double standard in their judgment of acceptable sexual activity. 
Their subjects reported feeling that virtually all groups—peers, family, close 
friends—would disapprove of their engaging in premarital sexual intercourse; 
furthermore, they reported perceiving all groups more strongly disapproving 
of their sexual behavior than they would if the same behavior were exhibited 
by males. 

As this brief review suggests, there is little data available concerning the 
personality correlates associated with the use or nonuse of contraceptives by 
single female college students; therefore, the choice of variables used in this 
study was made on the basis of the relevance each trait could have with regard 
to choosing versus not choosing to use contraceptives. The instruments used 
to measure these variables were Rotter’s Internalizer-Externalizer Scale, Ro- 
keach’s Dogmatism Scale, and Rosenberg’s Self-Esteem Scale. 

Liverant and Scodel (10) report that internalizers show a greater tendency 
toward self-regulation with regard to objective probabilities than do ex- 
ternalizers; therefore, it can reasonably be predicted that internalizers would 
take self-protective measures when involved in premarital sexual intercourse. 
Individuals high in dogmatism are described by Rokeach (13) as being resis- 
tive to cognitions differing from their own; hence, it is unlikely that the 
highly dogmatic person would be involved in activities requiring the presence 
of new cognitions—as in choosing to use contraceptive devices, Kaats and 
Davis report that single women who are sexually active perceive “virtually 
all groups” (9, p. 398) as disapproving of their behavior; this suggests that 
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unmarried, female college students who are actively involved in preparing 
for premarital sexual intercourse (by making use of contraceptive devices) 
would reflect this negative evaluation in measures taken of their self-esteem. 
Explicitly, the hypotheses tested in this field study were that sexually active, 
unmarried, female college students who report using contraceptives will (a) 
manifest a greater degree of internalization, (b) display a smaller degree of 
dogmatism, and (c) exhibit a lower degree of self-esteem than will those 
sexually active females who do not report the use of contraceptives. 


B. MerxHop 


Six hundred female undergraduates enrolled in psychology classes in five 
small liberal arts colleges located in the Midwest served as subjects. Two 
colleges are located in Michigan, and one each in Illinois, Ohio, and Ind‘ana. 
The sample included freshmen, sophomores, juniors, and seniors. Each sub- 
ject anonymously completed a booklet containing a questionnaire and The 
Internalizer-Externalizer (I-E) Scale, The Dogmatism Scale, and The Self- 
Esteem Scale. These booklets were administered during class time by a male 


experimenter using a standardized script to give an opening statement tog:ther 


with the instructions and the procedures to be followed by the subjects, The 
questionnaire contained items structured so as to obtain information about 
the respondent’s sexual behavior, use of contraceptives, and certain other 
personal and demographic data. The completed booklets were divided into 
three groups: those reporting being sexually inactive (these data were not 
dealt with in this study), those reporting being sexually active and using 
contraceptives, and those reporting being sexually active but not using con- 
traceptives. 

The responses of the two sexually active groups were evaluated, and the 
proportion of the sample falling into each category was noted. In addition 
to each subject’s I-E score, dogmatism score, and self-esteem score, attention 
was also paid to demographic data, frequency of sexual intercourse, number 
of partners currently involved with, self-reported status of the respondent's 
mental health, and the reason given by the respondent for being involved in 
premarital sexual intercourse. The differences between the mean scores of the 
two groups on The I-E Scale, The Dogmatism Scale, and The Self-Esteem 
Scale were tested for significance by the ¢ test for unequal n’s and hetero- 


geneous variance (4). 
C. RESULTS 


tween the group means on The I-E Scale 


The analysis of the differences be 
and The Dogmatism Scale supported the first two hypotheses. The sexually 
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active contraceptive users were significantly more internalizing than the sex- 
ually active, noncontraceptive users (F = 2.49, ¢ = 1.33, p < .01). The 
sexually active, contraceptive users were also significantly less dogmatic than 
the sexually active, noncontraceptive users (F = 3.32, ¢ = 34.6, p < .01). 
The difference between the group means on T'he Self-Esteem Scale failed to 
reach significance at the .01 lével (F = 1.12, ¢ = 1.69, p < .10). The mean 
scores of each group on the three scales were as follows: for the sexually 
active, contraceptive users—I-E Scale = 10.2, Dogmatism Scale = 142, and 
Self-Esteem Scale = 3.7; for the sexually active, noncontraceptive users— 
I-E Scale = 15.4, Dogmatism Scale = 180, and Self-Esteem Scale = 3.3. 

Of the 600 respondents, 282 or 47 percent stated that they were sexually 
active (defined as having intercourse at least once a month). This average 
premarital coital rate for college females closely approximates that reported 
by Kaats and Davis (9). The contraceptive users numbered 126 or 45 per- 
cent of the sexually active group. The average age of the sexually active, 
contraceptive user was 20.2 years, while the average age for the noncontracep- 
tive user was 19.6 years, This age difference is not statistically significant. 
Both groups were judged to be from middle-class, socioeconomic backgrounds 
with no significant difference in reported religious orientation. 


D. Discussion 


The data from this study clearly supported the first two hypotheses; that 
is, sexually active, contraceptive-using, unmarried, female college students 
were significantly more internalizing and less dogmatic than the sexually ac- 
tive, noncontraceptive users. Although not significant at the .01 level, there 
was a decided tendency for noncontraceptive users to report having a higher 
degree of self-esteem than the contraceptive users. The failure to find clear- 
cut support for the third hypothesis—that noncontraceptive users will report a 


1 Certain of the data not directly related to the hypotheses being tested were never- 
theless quite interesting. The noncontraceptive group’s typical response to the ques- 
tion, “How would you describe your present state of mental health,” was “very good,” 
while the contraceptive users’ typical response was “average” or “in need of help.” 
The nonusers typically replied “strict” or “very strict” to the question, “How did your 
parents treat you when you were young?”; the contraceptive users’ characteristic 
response was “permissive” or “very permissive.” The frequency of sexual intercourse 
for the contraceptive users was approximately four times as great as for the non- 
users; the average number of different sexual partners during the last year was 3.1 
for contraceptive users and 1.1 for nonusers. The reasons given for participation in 
premarital sexual intercourse were distinctively different for the two groups: 99 per- 
cent of the noncontraceptive users said they were involved because of “being in 
love”; on the other hand, only 49 percent of the contraceptive users gave this a8 
their reason for participation in premarital sex, while 27 percent gave “being sexually 
aroused and feeling secure” and 27 percent gave “being strongly attracted to him” as 
their reason. 
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higher degree of self-esteem than contraceptive users—is surprising, partic- 
ularly if it is true that today’s college female is still adhering to the double 
standard as reported by Kaats and Davis (9). One would expect, given the 
double standard, that the female who commits herself to the “forbidden” 
activity of premarital sex by making use of contraceptive devices would feel 
less positively about herself than those sexually active females who are more 
“passive participants” and do not make contraceptive preparations for pre- 
marital sex. Perhaps one explanation for the failure of the empirical data to 
support this argument is the rather ubiquitous hesitancy to rate oneself too 
severely on any evaluative dimension ; however, the trend that was evident in 
the data lends more than a little credibility to the hypothesis. 

Much of the research concerning contraceptive use by unmarried females 
has focused on the importance of cultural and social stimulus variables (2, 6, 
7, 8, 9). However, the implications of the results of this study appear to be 
in the area of personality structure and its relation to manifest behavior. 
Although necessarily tentative in nature, one conclusion suggested by this 
study is that personality variables are significantly related to the dynamics 
of contraceptive use; that is, it would appear that the presence of certain 
personality traits are associated with choosing to use or not to use contracep- 
tive devices. The identification of other relevant personality variables, the 
process by which their presence exerts its influence, and the prospects for 
making positive use of such information warrant further study. 


E. SumMMARY 


A questionnaire measuring degree of sexual activity and use versus nonuse 


of contraceptive devices was administered to 600 nonmarried, female college 
students who were attending five small liberal arts colleges located in the 
Midwest, With each questionnaire the subject responded to Rotter’s J-E 
Scale, Rokeach’s Dogmatism Scale, and Rosenberg’s Self-Esteem Scale. Of 
the females (47 percent of the sample) who reported being sexually active, 
45 percent indicated that they were using contraceptive devices. As hypothe- 
sized, the sexually active, contraceptive users were significantly more in- 
ternalizing and less dogmatic than the sexually active, noncontraceptive users. 
Although not significant at the .01 level, there was the hypothesized tendency 
for the noncontraceptive users to report a higher degree of self-esteem than 


the contraceptive users. 
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Litty Dirtroysky AND Miturz Atmy 


A. INTRODUCTION 


The relationship between thought and language has long intrigued students 
of human behavior. Theories have varied widely. Only too often, however, 
they have veered in the direction of an explicit or implicit identification of 
thought with language. For example, early in the history of scientific psychol- 
ogy Wundt maintained that the key to the study of thought lay in the investi- 
gation of human linguistic accomplishments (17). Somewhat later, Watson 
and his fellow behaviorists went so far as to suggest that thought may be 
nothing but subaudible speech (16). 

Investigators who have focussed on 


guage and on their mutual relatedness ) 
Vygotsky has pointed out that those who identify thought with speech close 


the door on the problem, for if thought and speech are one and the same thing, 
no relationship between them can arise (15). He asserts that a preintellectual 
phase in speech development and a prelinguistic stage in thought development 
can be clearly discerned in the growing child, At about the end of the second 
year of life the two curves of development of thought and speech, which had 
until then been separate, meet; and speech begins to serve intellect, and 
thoughts begin to be spoken. The child’s intellectual growth is then con- 
tingent upon his mastering language, the social means of thought. At first this 
is done externally, and the child uses spoken language both to communicate 
and to think. Finally the speech structures mastered by the child through con- 
tact with the social environment are internalized, and with further develop- 
ment become the basic structures of his thinking. ‘The relation of word to 
thought does not, however, remain static. Rather, it is a process that involves 


the development of thinking and lan- 
have rejected this simplistic position, 
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continuous changes that may themselves be seen as development in the func- 
tional sense. Thought is not merely expressed in words; it comes into existence 
through them. Thus it will be seen that though Vygotsky begins with the 
assertion that speech and thought have separate roots, he ends with a reaffirma- 
tion of the enormous influence of the socially learned structures of speech upon 
thought. 

Bruner (3), concerned with the development of intelligent behavior in 
man, differentiates three modes of internal representation that give the human 
organism the capacity for intellectual growth. These are the enactive, or 
motoric response patterns; the iconic, or perceptual images; and the symbolic, 
or internalized language. He asserts that language not only represents experi- 
ence, but transforms it according to its own transformational rules. Once the 
child has succeeded in internalizing language as a cognitive instrument, it 
becomes possible for him to represent and systematically transform the regu- 
larities of experience with far greater flexibility and power than before. For 
example, according to Bruner, to succeed in Piaget’s conservation problems 
the child must employ some internalized verbal formula that shields him from 
the pull of perceptive appearance. Thus, again, although Bruner does not 
actually identify mature thinking with language, he does in fact move in this 
direction by making internalized speech a prerequisite for logical thinking. 

Furth (6), on the other hand, begins with a description of the surprisingly 
intelligent behavior of the profoundly deaf adult. He points out that since 
deaf children must grow up and develop intellectually without the benefit 
of conventional language, they present a unique opportunity for the study of 
important aspects of the language-thinking relationship. His extensive obser- 
vations lead him to conclude that intellectual functioning cannot depend 
basically upon language. The results of his investigations reveal that the pro- 
foundly deaf do show certain limitations in that they do poorly or fail on all 
tasks that are specifically yerbal and on a few nonverbal tasks in which lin- 
guistic habits afford a direct advantage. The deaf tend also to be somewhat 
more rigid, less intellectually curious, and less well informed. Apart from 
these limitations, which can well be explained on the basis of their more re- 
stricted environments, the basic development and structure of their intelligence 
are remarkably unaffected by the absence of verbal language. Furth admits 
that some symbolization is required to set the stage for operational thinking, 
and notes that language is a ready-made conventional symbol system assimi- 
lated by the intelligence according to its structure. However, he points out 
that it does not follow from this that a relation between language per se and 
operational thinking is essential, 


~ 
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Piaget (9, 11) sees the sources of intellectual operations as lying in the 
sensorimotor period, for it is in this period that a system of schemes is elabo- 
rated that prefigures certain aspects of the structure of classes and relations, 
and elementary forms of conservation and operative reversibility. According 
to him, language belongs to the network of complexes that constitute the sym- 
bolic function in general. It has the same roots and, at first, the same function 
as symbolic play, deferred imitation, and mental images. For the developing 
child it represents a ready-made system that is elaborated by society and that 
contains, for those who learn it, a wealth of cognitive instruments at the 
service of thought. However, language for Piaget is not a sufficient condition 
for the constitution of concrete intellectual operations; nor, according to 
Sinclair-de-Zwart’s interpretation (14), is it a necessary condition. Thus 
Piaget has quoted experiments with deaf-mute children that clearly point to 
the fact that while symbolic function is an obvious necessity for concrete 
operations, the normal acquisition of language is not. The results of studies 
with the blind, on the other hand, indicate that their performance on opera- 
tional tasks lags behind that of normal children by about four years, Their 
sensorial deficit retards the constitution of sensorimotor schemes, and verbal 
acquisition cannot compensate for this retardation. Action learning is necessary 
before blind children reach an operational level comparable to that of the 
normal and the deaf (8). According to Sinclair-de-Zwart, Piaget has, how- 
ever, left open the question of the necessity of language for formal operations. 

Sinclair-de-Zwart (14) has conducted a number of experiments that have 
direct bearing on the relationship between language and intellectual operations. 
The purpose of these investigations was to determine whether the profound 
modification that occurs in the child’s thinking with the constitution of the 
first operation is paralleled by linguistic development. Children’s verbal 


capacities in the domain of quantitative and dimensional terms and comparatives 
were explored by asking them to describe simple situations—1.e., children 
were asked to tell the difference between two pencils, a short thick one and a 


long thin one. The children’s comprehension was then studied by asking them 


to execute certain orders—i.e., find a pencil that is shorter but thicker than 
this one. The subjects were then divided into three groups according to their 
response to a Piagetian conservation task (conservers, nonconservers, and 
transitional). The results indicated no differences among the three groups in 
the comprehension tasks, for almost all of the children executed all of the 
orders correctly. However, striking differences between the conservers and 

description tasks. For example, the over- 


the nonconservers were found on the a‘ ¢ i 
whelming majority of conservers used comparatives, while 90 percent of the 
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nonconservers used absolute terms. These and related results led Sinclair-de- 
Zwart to conclude that a distinction must be made between lexical acquisition 
and the acquisition of syntactical structures, Lexical items are used or easily 
learned at a preoperational level. The coordinated structures and operator- 
like words (i.e., more, less,) are correctly understood in simple situations at 
the preoperational level, but are precisely and regularly used only with the 
advent of the first operational structure. Sinclair-de-Zwart suggests that these 
results constitute further support for Piaget’s view that language is not the 
source of logic, but is, on the contrary, structured by logic. 

The present research attempts to throw some further light on another aspect 
of the connection between speech and thought. It focusses upon the relationship 
between knowing a correct answer on a conservation task and being able to 
verbalize the reasoning upon which this answer is based, a subject that may have 
implications also for the question of the development of conscious awareness of 
mental processes in the growing child. More specifically, it is concerned with 
the time of onset of the ability to make consistent judgment indicative of con- 
servation as a factor in the interrelationship between knowing that the quantity 
of a particular material has remained unchanged even though it is transformed 
in appearance, and being able to explain why this is so. It deals also with the 
relationship between verbal JQ and the ability to make consistently correct 
judgments on conservation tasks. 

The following are the questions raised: 

1, When children who begin to give consistently correct judgments on con- 
servation tasks in kindergarten are asked to give explanations of their judg- 
ments, what percentage of these explanations reveals clear evidence of con- 
servation? 

2. What are the corresponding percentages for children who begin to give 
consistently correct judgments in first grade and in second grade? 

3. Do children who begin to give consistently correct judgments on con- 
servation tasks in kindergarten differ in verbal IQ from those who begin to 
do so in first grade and in second grade? 


B. Mernop 


The data of the present study were abstracted from a larger body of infor- 
mation gathered in an extensive longitudinal investigation of the impact of 
special instruction in science and mathematics during kindergarten and first 
grade on children’s logical thinking in second grade, and were subjected to 4 
different form of analysis. Only those aspects of the overall study relevant in 
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the present context will be discussed here. For further information on other 
details of the larger research the reader is referred to the overall report (2). 


1. Subjects 


The subjects were 697 children enrolled in public school kindergarten 
classes in four suburban communities at the time that the study began. 

Only those children who remained within the public school system of each 
particular community were tested over the three-year period. No attempt was 
made to follow those who moved. As a result, by the third and last year of 
the study the number of subjects had been reduced to 432. 

In two of the four communities the curriculum included special programs 
of instruction designed to introduce the children to the basic concepts of 
science (4, 12) and mathematics (7). In a third, such instruction was given 
only in mathematics (7). In the fourth the teachers followed no prescribed 
program of instruction in these subjects. 

Kindergarten teachers in these communities teach a morning class and an 
afternoon class. While a random selection of children from both classes of each 
teacher would perhaps have been more desirable, practical considerations made 
this unfeasible. Instead, in most instances a single class taught by each teacher 
in each of the programs was selected. All of the children in these classes were 
tested except those whose parents indicated that they would move on to 
parochial school in first grade. 


2. Measures 


a. Measures of logical thinking. Tasks derived from the work of Piaget 
and used in a previous study (1) were utilized as the measures of logical 
thinking. The first two tasks involved the conservation of number, patterned 
after Piaget (10); the third involved conservation of liquid. 

Task A: During orientation in Task A the child compares eight yellow 
cubes with 11 red ones; then adds yellow cubes to make the rows the same; 
then compares the rows with one ted removed, one red added, two yellows 
removed, two added, and so on. For the actual testing the red blocks are 
bunched, and the child is asked, “What about now? Are there more red 
blocks or more yellow blocks, or are they the same 2?” The yellow blocks are 
then spread apart, and the above questions are repeated. Finally the child is 


asked, “Why do you think so?” dr 
Task B: In Task B only the yellow cubes are used. The child is asked to 
count them, Then they are spread, and he is asked to tell how many blocks 
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there are (without counting). Again they are bunched, and the question “How 
many ?” is repeated. 

Task C: During orientation in Task C the child compares the water in 
two identical glass tumblers, indicating when one has more water than the 
other and filling them to indicate when the water is “the same.” For the testing, 
the experimenter then empties the water from one tumbler into a shallow 
glass bowl, asking “What about now? Is there more here or more here (point- 
ing), or are they the same?” The final question is “Why do you think so?” 

In summary, Task A called for two judgments following transformations 
of appearance (spreading and bunching) and one explanation; Task B in- 
volved two judgments following transformations and no explanations; and 
Task C called for one judgment and one explanation. 

b. Measure of verbal ability. The Peabody Picture Vocabulary Test— 
PPVT (5) was chosen for its ease of administration and interest for kinder- 
garten children. It appraises the child’s auditory comprehension vocabulary, 
rather than his expressive vocabulary. 


The reported alternate form teliability coefficient for age five is .73; age 
six, .67; and age seven, .74. 


3. Procedure 


The children were taken out of the classroom and tested individually in 
kindergarten, again in first grade, and finally in second grade. An attempt was 
made to complete testing very early in the school year. In the majority of 
instances this was accomplished during the first two months of the fall semester. 

At the kindergarten testing the Peabody Picture Vocabulary Test was 
administered first. This was followed by the three conservation tasks. 

The conservation tests were repeated in the first and second grade inter- 
views. The PPVT was not. 

The interviewers were graduate students with some, but not extensive, 
experience in testing or working with children, They underwent fairly rigorous 
training in interviewing. 


4. Scoring and Categorization of Data for Measures of Logical Thinking 

a. Questions. For each of the three Conservation tasks the children’s judg- 
ments in response to the questions “What about now? . . . more, less, or the 
same?” were scored, If they answered “the same,” the foarte was scored 
correct. If they answered “more” or “Tess,” the Tesponse was scored incorrect. 
As noted above, five such judgments were included in the three tasks: two for _ 
the first, two for the second, and one for the third. A child was considered 


Per eeneese 
TO seeese 


LILLY DIMITROVSKY AND MILLIE Al 


to give evidence of the ability to make consistent judgments indicative of 
conservation if his responses to all five of these questions were correct. 

4. Explanations. Children’s responses to the “Why do you think so?” 
questions were scored separately. As noted above, the three conservation tasks 
included two such questions, one in the first conservation of number test, and 
one in the conservation of liquid test. These explanations were categorized as 
follows: 

Type a, clear evidence of conservation: The child indicates that the amount 
was the same to begin with and that what the examiner did makes no differ- 
ence, or that they merely look different, etc. 

Type b, uncertain evidence of conservation: The child asserts again that 
they are the same but offers no further explanation. 

Type c, clear evidence of nonconservation: The child’s explanation relates 
to the appearance of things, or expresses strong conviction that he knows that 
they are not the same. 

Type d, evidence regarding conservation lacking: The child makes no re- 
sponse, or his reply is unintelligible, or he talks about something else, etc. 

5. Scoring and Categorization of Data for Measure of Verbal Ability 

Verbal JQ scores were calculated for each child at the kindergarten level 
in accordance with the instructions included in the PPVT manual. 


C. Resutts 


Table 1 shows the number of children in each of the four programs and in 
the total group in the first year of the study, and the number remaining in 
the third, and final, year. It also presents the mean chronological age and 
Peabody Picture Vocabulary Test JQ scores at time of the first testing in 
kindergarten. Of the 697 Ss included at the outset, only 432 remained to the 
completion of the research. The mean PPVT IQ for children who dropped 
from the study did not differ significantly from that of the children who 
remained. 

Table 2 shows the number and percentage of children who consistently gave 
correct judgments on the conservation items at each grade level. Figures are 
given both for those who began to conserve at a particular grade level and 
for the total group conserving at that grade level, including also those who 
started to conserve at an earlier grade level and did not reverse themselves at 
subsequent testing. As indicated, in kindergarten only 70, or 10.04 percent 
of the 697 Ss, showed consistently correct judgments. By second grade this 
figure had risen to 271, or 62.73 percent of the 432 Ss thai ained to the 
completion of the study. 
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TABLE 1 
NUMBER, CHRONOLOGICAL AGE, AND /Q® aT OUTSET OF STUDY oF CHILDREN 
IN Four INSTRUCTIONAL PROGRAMS PRESENT IN FirsT YEAR 
AND CHILDREN REMAINING TO COMPLETION 


CA (in months) PPVT IQ 
Program N Mean SD Mean SD 
First year 
AAAS (GCMP)> 189 63.81 3.47 103.25 18,31 
SCIS (GCMP)¢ 159 64.97 3.56 101.09 16.97 
GCMP only 168 64.82 3.34 106.55 13.84 
No prescribed lessons 181 65.13 3.42 109.28 14.77 
Total 697 64.66 3.48 105.12 16.37 
C Completion 
AAAS (GCMP)> 94 64.28 3,61 105.41 17.62) 
SCIS (GCMP)¢ 79 64.42 3.67 100.66 18,24 
GCMP only4 122 64.80 3.37 106.76 12.55 
No prescribed lessons 137 64.92 3.31 109.20 14.73 
Total 432 64.66 3.46 106.13 15.76 


a IQ is based on the Peabody Picture Vocabulary Test, Form A. 
» Science program of the American Association for the Advancement of Science 
and the Greater Cleveland Mathematics Program, ; 


© Science program of the Science Curriculum Improvement Study and the Greater | 
Cleveland Mathematics Program, 


4 Greater Cleveland Mathematics Program, 


It was thought possible that those children who conserved in kindergarten 
might have been older than their classmates, However, the mean age at kinder- 
garten testing for those who conserved in kindergarten and for those who 
began to do so in first grade was found to be virtually the same (65.64 months 

_ for the former and 65.24 for the latter). 

- Table 3 gives the number and percent of explanations of various types given 

_ by children who consistently began to give correct judgments on the conserva- 
tion tasks in kindergarten, first grade, and second grade. Only 57.9 percent 
of the explanations (50 percent for the conservation of blocks and 65.7 percent 
for the conservation of liquids) of those who begin to make such judgments 
in kindergarten can be categorized as giving clear evidence of conservation. 
‘In all but a few instances the remainder of the explanations are unclear (types 


Total 


tes an Beginning to eeeaetre year 
Kindergarten 697 70 10.04 F 

‘ nN 70 10 0 
First 518 164 31.66 205 39.58 


Second 432 117 27.08 271 62.78 
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b and d), rather than clearly incorrect. By way of contrast, 82.3 percent of 
the explanations (82.3 for both the blocks and the liquids items) of those 
beginning to make such judgments in first grade gave clear evidence of con- 
servation (type a). Again, as with those beginning in kindergarten, almost all 
of the remainder of the explanations were of type b or d. The corresponding 
percentages for those beginning to give correct answers in second grade are very 
close to those for the group beginning to do so in first grade (86.3 percent 
for both the blocks and the liquids items). As indicated, the differences be- 
tween the figures for the kindergarten group and the others are highly signifi- 
cant. For the conservation of blocks item,  < .001; for the conservation of 
liquids, p < .005; and for both tasks together, p < .001. 

Table 4 presents the percentage of acceptable (type a) explanations given 
at each grade level by children who started to make consistently correct. judg- 
ments on conservation items in kindergarten, first grade, and second grade. 
The proportion of acceptable first grade explanations for those beginning to 
give correct answers in kindergarten is almost identical with that of those who 
began to do so in first grade. 

When the percentages of acceptable second grade explanations of the three 
groups, those starting to answer conservation items correctly in kindergarten, 


TABLE 3 s ; 
NUMBER AND PERCENT OF EXPLANATIONS OF VARIOUS TYPES FOR CONSERVATION OF 
Biocks aNp Liquips GIvEN BY CHILDREN BEGINNING To MAKE CONSISTENTLY 
CoRRECT JUDGMENTS ON CONSERVATION ITEMS IN KINDERGARTEN, ; 
Firsz GRapE, AND SECOND GRADE 


Explanation® 


- Typea Types b and d Type c 
Grade N % N % N % 
Conservation of blocks 
Kindergarten 35  50.00* 3245.71 3 cel 
First 135 82.32 28 17.07 Se 
Second 101 86.32 14 11,97 rary | 
Conservation of Hosigs. pent 
Kindergart 46 = -65.71** 24 : 
First 135 82.32 29 (17.68 oats 
Second 101 86.32 16 13.68 : 0 
Conservation of blocks and liquids aie 
Kindergarten 81 57.868** 56 = 40.00 oe 
First 270 82.32 57 17.38 Lig A 
Second 202 86.32 30 12.82 i 


a, clear evidence of conservation; type b, 
lear evidence of nonconservation; type @, 
see text. 


® Explanation types are as follows: type 
uncertain evidence of conservation; type ¢, ¢ “€ 
evidence regarding conservation lacking. For fuller details, 
* Chi square = 37.40; p < -001. 
** Chi square = 12.53; p < .005. 
**® Chi square = 47.08;  < .001. 
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first grade, and second grade are compared, the figures for the first two are 
almost identical, while those for the third group are slightly lower. However, 
chi square analysis shows these differences to be insignificant. 

It seemed possible that the relatively high proportion of unacceptable ex- 
planations given by the group who began to conserve in kindergarten might 
well have been due to the contributions of those children who later reversed 
themselves and did not continue to be consistently correct in their answers. 
A reversal was here defined as a single incorrect judgment at the subsequent 
testing. Analysis did indeed seem to reveal a somewhat higher reversal rate 
for those who began to conserve in kindergarten. Thus, of the 70 in the kinder 
garten group, 15, or 21.4 percent, gave one incorrect answer when tested in 
first grade. Of the 135 in the first grade group, only 22, or 12.7 percent, made 
an error in judgment at the second grade testing. Accordingly, it was decided 
to reanalyze the data to take this into account. Table 5 presents the number 
and percentage of acceptable explanations given by children who began to give 
consistently correct answers on conservation items in kindergarten and in 
first grade, but excludes from the count those children who made a reversal 
in judgment when tested the following year. The resulting figures are ex- 
tremely close to those reported in Table 3. 

Mean Peabody Picture Vocabulary JQ scores were calculated for each of 
the three groups. The mean JQ of the groups beginning to conserve in kinder- 
garten and first grade are very close, 111.44 and 110.49, respectively. The 
corresponding figure for those who began to conserve in second grade is just 
a few points lower, 106.13, and coincides exactly with the mean for the 
total group that remained in the study for the three years. 


D. Discussion 


It seems clear that children are sometimes able to give consistently correct 
judgments on tests of conservation without being able to verbalize the reasoning 
underlying these judgments. The fact that the data of the present study, by 
contrast with those of the majority of conservation studies, consist of 4 
sampling of the responses of the same group of children over a three-yeat 
period has allowed for an analysis of the changing relationship with age between 
the ability to judge and the ability to explain. The results indicate that children 
who respond correctly to conservation questions early in kindergarten are 
significantly less likely to be able to accompany these responses with acceptable 
explanations than are children who begin to respond correctly to such questions 
in first or second grade. The consistently correct judgments of these early 
“conservers” would appear to attest to their clear thinking and reasoning, 
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TABLE 5 
NUMBER AND PERCENT OF ACCEPTABLE EXPLANATIONS GIVEN BY CHILDREN Wuo BEGAN 
To Make CONSISTENTLY CORRECT JUDGMENTS ON CONSERVATION ITEMS IN 
KINDERGARTEN AND First GRADE AND MAINTAINED THESE JUDGMENTS 
INTO THE FOLLOWING GRADE 


Acceptable responses 


Number Conservation Conservation Blocks & 

of of blocks of liquids liquids 

Grade children N % N % N % 
Kindergarten 55 28 «50.91 34 61.82 62 56.36 
First 142 118 83.09 116 81.69 234 82.39 


yet in a sizable proportion of instances they either gave unclear explanations 
or no explanations at all. Thus, for example, one five-year-old sailed through 
a variety of conservation tasks with ease and sureness but, when asked “How 
can you tell?” repeatedly said only, “It just is.” 

It has been pointed out that the intertwining between thought and language 
varies with age (15, 18). One might add, parenthetically, that this interrela- 
tionship may also vary with the individual and the subject matter. Thus 
children may at times have pseudo concepts, in which they have the language 
without a real understanding of the concept. On the other hand, it is likely 
that they may at times also have an understanding of a concept without being 
able actively to muster and structure the language to explain this concept. 

Vygotsky has suggested that the development of the child’s spontaneous 
concepts proceeds upward, and the development of his scientific concepts down- 
ward. Thus, according to him, the child’s ability to define his spontaneous 
concepts in words and to operate with them at conscious will appears long after 
he has these concepts. The development of scientific, or what might best be 
designated school taught concepts, on the other hand, usually begins with a 
verbal definition and its use in nonspontaneous operation. However, Vygotsky 
emphasizes that the two processes are intertwined, for a spontaneous concept 
must have reached a certain level in order for the child to be able to absorb 
a related scientific concept. 

It has also been proposed that when children are in a particular transitional — 
Piagetian stage, increased exposure to language may well be the stimulus that 
propels them forward. This would be in accordance with the view that cogni- 
tive structures, while not determined by language, are consistently reinforced 
by the syntactical structure of language (9). It may well be that those children 
in the present study who achieved conservation (as here defined) by the | 
beginning of kindergarten did so spontaneously—and hence their greater 
difficulty in explaining their thinking. For those who began to conserve in | 
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first grade, the increased exposure to language, coupled with formal or informal 
instruction in science and mathematics during the kindergarten year, may 
-have helped them achieve conservation and, at the same time, provided the 
language for correct explanations. Thus, for one group the conscious use of 
language in this context often lagged behind the spontaneously achieved con- 
cepts, while for the others language may have been a tool that helped them 
achieve these concepts once they were ready for them. t 

It should be added, too, that it is conceivable that the found differences may 
in part more generally reflect the overall language development and ability 
to explain oneself that hopefully result from the year of socialization in kinder- 
garten. 


E. Summary 


A large number of children were tested on three conservation tasks in 
kindergarten, first grade, and second grade, successively. These three tasks 
called for five judgments and two explanations. All children were also ad- 
ministered the Peabody Picture Vocabulary Test in kindergarten. The ex- 
planations given by children whose judgments were consistently correct on’ 
all of the conservation tasks in kindergarten were compared with those given 
by children who began to be consistently correct in their answers in first grade 
and in second grade. The results indicated that children who began to con- 
serve in kindergarten accompanied their answers with clearly correct explana- 
tions significantly less often than did children who begun to conserve in first 
grade and in second grade. The percentage of clearly correct explanations 
given at first grade testing was almost the same for those who began to conserve 
in kindergarten and those who began to do so in first grade. Mean verbal JQ 
in kindergarten was also almost identical for kindergarten and first grade con- 
servers. These results were discussed in terms of their implications for the 
relationship between language and thought. 
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THE EFFECT OF VARYING REWARD SYSTEMS 
ON COOPERATIVE GAME BEHAVIOR* 


Brigham Young University 
Kay H. Smitu 


A. IntTRopUCTION 

Perhaps the classic study in the area of competitive or cooperative group 
behavior is that of Deutsch (2), who divided a class of American under- 
graduates into several small groups, promising the members of half the groups 
the same grade depending on the success of the group as a whole, and inform- 
ing the others that their individual behavior would be judged and graded even 
though they were acting as members of a group. The results of this study 
seemed definitely to favor the cooperative situation in terms of productivity 
and human relations. 

As Cartwright and Zander (1) point out, these reward systems are only 
two of many possible ones. They list the following five, although several others 
could be conceived as well: 

Rule 1. Each member’s pay is independent of group output and his score. 

Rule 2. Each member’s pay is independent of group output but contingent 
on his score. (a) Total payoff to the group is fixed, (4) Total payoff to the 
group is not fixed. 

Rule 3. Each member’s pay is contingent upon group output but indepen- 
dent of his score. 

Rule 4, Each member’s pay is contingent upon group output and his scores. 

Deutsch’s study made use of reward conditions that were most nearly like 
those of Rule 2b and Rule 3. In life, however, it would seem that for many 
groups, conditions more similar to Rule 4 obtain. The present study was con- 
ducted because of an interest in the question of how the disruptive effects of 
individual competition, as found by Deutsch, interfere with or are modified by 
the desirable effects of intergroup competition. 


B. MetxHop 
1. Experimental Treatments 
In the present study, 24 groups of five men each played a commodity buy- 
ing game. Eight groups played under each of three reward systems, which most 
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closely paralleled Cartwright and Zander’s Rules 2a, 3, and 4. The reward 
instructions were as follows: 

a. Group cooperative (A). Your group is competing with 16 other groups. 
If your total winnings are higher than the total winnings of the other groups, 
you will receive $10, which will be divided equally among you. 

b. Group competitive (B). Your group is competing with 16 other groups. 
If your total winnings are higher than the total winnings of the other groups, 
you will receive $10, which will be divided and distributed to each of you in 
proportion to your contribution to your team’s score, 

c. Individual competitive (C). You are each competing with each other 
and with players from several other groups, The individual among you who 
wins the most money in the game will be given a prize of $5. 


2. Subjects 


Subjects were male volunteers from a variety of undergraduate classes at 
Brigham Young University, including Introductory Psychology, Sociology, 
Child Development and Family Relations, Religion (which is required of all 
students), and others. The assignment of subjects to groups was not random, 
but was accomplished on the fortuitous basis of time available to the subject. 
The groups were assigned randomly to experimental treatments. 


3. Procedure 


a. The Commodity Game. The game that was used in this study was 
specially constructed for this purpose. It involved the purchase of five com- 
modities with various prices, rates of return, and risks of losses. The com- 
modities were to be purchased and then sold on any one of 50 selling days. 
The days were predetermined to be winning or losing days; that is, days on 
which the commodity was selling for more than its cost or for less than its 
cost by 40%, respectively. The commodities, their costs, rate of return, and 
tisk of loss were as follows: 

Rye, $100, 20%, five days were losing days, 

Oats, $200, 40%, 10 days were losing days. 

Corn, $300, 60%, 20 days were losing days. 

W heat, $400, 80%, 30 days were losing days, 

Soybeans, $500, 100%, 40 days were losing days, 


The rules of the game were rather complex and were designed to introduce 
a fairly high degree of interdependence, as well as the possibility of competi- 
tion. Important rules in understanding the procedure are as follows: 
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1. Buying was done in rounds, with each round beginning with com- 
petitive bidding to determine who would act as buyer and thus be able to 
choose the commodity to be purchased and the day on which it would be sold. 
Bidding began with a different player each round, and bids were made in 
multiples of $25. 

2. The purchase had to involve a commodity costing more than the amount 
of the winning bid. Thus $475 was automatically a winning bid. Such a bid 
restricted the buyer to soybeans as the commodity for purchase. Lower bids — 
afforded greater freedom of choice; but the greater the difference between 
the winning bid and the price of the chosen commodity, the more help from 
other players was required in order to make the buy. 

3. After the buyer had announced the commodity that he would buy and 
when he would sell, the other group members each were given the opportunity — 
to purchase any amount of that same commodity from none through the full 
price of the commodity. ‘ 

4, If the amount of the winning bid plus the investments of the other 
group members was equal to or greater than the price of the commodity, the 
buy was made, and the buyer was paid $50 as a buyer’s fee. If the amount 
‘was not sufficient to make the purchase, the round was cancelled and the buyer 
‘was fined $50. 

5. The date was then revealed by the experimenter to be either a winner 
or a loser, and winnings were paid or losses were collected. Once a date was 
used for a given commodity, it could not be reused for that commodity, al- 
though the same day could be used for other commodities. 

6. The winning and losing days followed simple repeating patterns on 
each commodity, If the players figured out the pattern, they could win a great 
deal more than if they just bought on a hit or miss basis. 

7. The following hint was given to aid in discovering the patterns: A 
winning day on any commodity is a winning day on all commodities costing 
less, and a losing day on any commodity is a losing day on all commodities cost- 
ing more. All subjects had work sheets in front of them giving days and com- 
modities to assist them in discovering the pattern. 

There were two ways of figuring out the pattern of winning days: by going 
across commodities on a given day or by discovering the pattern of repetition 
across the 50 days for one commodity. These two ways were ‘not, of course, 
independent either in accumulating clues or applying insights in making pur- 
chases, Sa LAAN 
At the beginning of the game, each player was given $500 in game mney 
He could continue to play as long as he had sufficient money to cover his 40% 
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loss on any purchase he might make. In practice, no player ever went broke, 
although four five-man groups ended the game with less than the $2500’ they 
had had collectively at the beginning. 

As the game was designed, there were essentially two ways in which group 
members could cooperate in increasing winnings, One way was to cooperate 
in bidding so that a greater number of rounds could be played within the 1 
time limit. This could be done by taking turns being buyer, assigning one 
person always to act as buyer, or some similar strategy. The game instructions 
did not suggest nor preclude such cooperation. A second way in which coopera- 
tion could take place was in sharing insights and clues as to the nature of the 
pattern of winning days, and thus maximize winnings and minimize losses on 
each transaction. 

b. Procedural Details. When the groups were assembled, the instructions 
were read to the subjects, and questions were answered. The nature of the 
reward for which they were working was then explained to them. They then 
played one practice round to clarify the rules of the game further. After the 
practice round, any remaining questions were answered before the actual game 
began. 

At this point, a 30-minute timer, which was visible to the group members, 
was started, and the game began. At the buzzer, if the group had made a 
purchase, they were paid off. If any part of the purchase process had not been 
completed, the round was cancelled. . 

At this point, each subject counted his cash on hand and completed the 
Postgame questionnaire. This was a multiple-choice form administered to 
assess the degree of understanding about game rules, the subjects’ feelings 
about the game, their group, the reward, and their own success. 

At the completion of data collection, the results were tabulated, and checks 
were sent to the winning individual and members of the winning group. 


C. Resurrts 
Table 1 shows the winnings and numbers of rounds played by groups in 
each condition. Since the winnings (both total and per round) formed quite 
TABLE 1 
Winnines, Rounps PLAYED, AND Winwinos PER RounD 
; Median 
Badin number of Median 
18 See 
Reward system winnings Dine oer round 
A. Group cooperative 3,7: 
B. Group competitive eat a ce 
C. Individual competitive 3,637 17 233 
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cA TABLE 2 
rf TOTAL WINNINGS AND WINNINGS PER RounpD CONTINGENCY TABLE 
A, B. C) 
ead Group Group Individual 
Winnings cooperative competitive competitive Totals 

Groups above overall 

median 7 4 1 12 
Groups below overall 

median 1 4 7 12 

Totals 8 t ARs 8 24 


skewed distributions, chi square tests were run comparing the three conditions 
against a median split. Results of these tests in total and per round winnings 
were identical and are given in Table 2. The chi square computed from this 
table was 9.00, which has a chance probability of less than .05. The results 
show significant differences among groups in both areas, with the total coopera- 
tion groups highest in success, the individual competition groups lowest in 
success, and the group with both inter- and intragroup competition in between. 

The chi square table for number of rounds played was nearly the same as 
those for winnings. The chi square produced was 6.42, which has a probability 
less than .05. However, if we look at a parametric test shown in Table 3, we 
see that the picture is somewhat different. The mean number of rounds per 
group in the group competitive is actually slightly higher than that for the 
group cooperative. The high variance of the group competitive groups results 
from the fact that some of these groups hit upon cooperative bidding and got 
in many rounds (the three highest round totals were achieved by groups in 
this condition), while others bid competitively to the end (one of the two 
lowest groups was in this condition). The variance was significantly different 
from the variance of the individual competitive groups, but not from the vari- 
ance of the group cooperative groups at the .05 level. 


The objective responses to the questionnaire were analyzed with use of chi 


TABLE 3 
PARAMETRIC TESTS OF NUMBER OF ROUNDS PLAYED 
‘5 Mean SD of 
number of ane of fopdd uN 
R d rounds rounds e 
teh played played various means 
A. Gro aa i 
copdscntivd 21 3.21 Avs. By t= 21; 2> 25 
B. G igen i 
Smee 21.5 5.88 B vs. C3 t = 2.13; p < 10 
C. Indivi bal : 
sonneenes 16.6 2.62 A os, Cj t= 2.965 P< 05 
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square tests. No significant differencé could be found among the three exper 
mental conditions with regard to (a) how well they understood the rules at 
the beginning of the game; (4) how much they enjoyed playing the game; (c) 
how well they felt they performed compared with other individuals; (d) how 
much enthusiasm they had for the money prize; or (e) how difficult they felt ” 
the game was to play. No differences could be found between the group co- 
operative and the group competitive groups on the other two questions, Nega- 7 
tiye responses, however, were significantly more frequent from members of | 
individual competitive groups with regard to how much they enjoyed working = 
with the others in their group and how well they felt their group did com- ~ 
pared with other groups when they were compared with members of either 


of the group reward conditions. The .05 level of significance was used for 
these tests. 


D. Discussion 


It appears that an equally divided group reward resulted in cooperation on 
both dimensions of the game—cooperative bidding and sharing of insights and 
solutions. When intragroup competition was added, the average number of 
tounds played did not decrease although the variability did appear to increase. 
The amount won per round was much lower, however, suggesting that this 
more individualized kind of cooperation was curtailed. It should be noted 
that the number of rounds played and amount won per round are not indepen- 
dent, since each new round produces additional clues to solving the pattern. 
Further research needs to be carried out holding number of rounds constant to 


assess more clearly the effect of a reward system on sharing of insight. Very 
little cooperative behavior was observed in the 


as individual competitive situation. 
This is borne out by the showing of these gro 


d Dikte ups in number of rounds played 
and total winnings, as well as winnings per round. These subjects also seemed 
to like the other mem! 


bers of their groups less and to be less satisfied with the 
group performance. This does not speak well for raw competition among 
people working in groups. 

In conclusion, the results of this stud 
tive situation was clearly superior, 


reward. 
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E. SumMMARY 


Eight five-person groups participated in a commodity buying game under’ 
three reward conditions: A. Intergroup competition, intragroup cooperation ; 
B. Intergroup and intragroup competition; C. Individual competition. Two 
modes of cooperation were possible: (a) bidding so that a greater number of 
rounds could be played within the time limit; and (4) sharing insights and _ 
clues as to the nature of the pattern of winning responses. As predicted, group 
scores were negatively related to the amount of competition involyed. Co- 
operation was evidenced in condition A in both modes, while it was mainly 
limited to one mode— cooperative bidding—in condition B. Little cooperation 
was shown in condition C. No differences were shown among the groups with 

“regard to liking or understanding of the game. There was a tendency for Ss 
in condition C more often than in either of the other conditions to report 
negative feelings about their groups. 
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THE BODY SCHEMA OF NORMAL AND MENTALLY 
RETARDED CHILDREN* 


Children’s Medical Center, Tulsa, Oklahoma 


Ropert K. Ciapp 


A. InTRopucTION 


Numerous investigators have demonstrated that body schema deficits are 
positively associated with retarded intellectual functioning. In the majority 
of studies, however, the sample has consisted of children with mental ages 
(MAs) of six years or more (2, 3, 4, 6, 7, 9, 11, 12). Moreover, experi- 
mental measures utilized in studies with preschool children are generally of 
questionable validity for one or more reasons. For example, the results are 
often confounded because performance of the experimental task requires 
skills, such as verbal abilities, which may be unrelated to the body schema. 

Witkin (13) and Witkin et al. (14, 15) have postulated that one sub- 
division of the body schema, surface schemata, develops earlier than the other 
body schema subdivision, the postural model schemata. Moreover, they state 
that body schema development is related to body area. That is, the body 
schema develops earlier on those areas more proximal to the body trunk than 
on those areas more distal to the body trunk, 

The primary purpose of this study was to investigate systematically the 
body schema and its two subdivisions—the surface and postural model 
schemata—in nonorganically impaired retardates, organically impaired re- 
tardates, and normal children of preschool MAs. The surface and postural 
model schemata were separated into two subgroups according to distance 
from the body trunk. 


B. Merxop 
1. Subjects 
Sixty subjects (Ss)—15 organically impaired retardates, 15 nonorganically 
impaired retardates, and 30 nonorganically impaired normal children, ranging 
in MA from three to six years—were used for the study. In all retarded 
cases, a neurological examination, conducted by a certified child neurologist, 
was used to determine presence or absence of organic impairment. 
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The mean MA of each group was as follows: organically impaired re- 
tardates—four years and 10 months; nonorganically impaired retardates— 
four years and nine months; normals—four years and six months. A single 
factor unequal but proportional N, analysis of variance (ANOVA), in which 
MA served as the dependent variable was calculated. The F ratio was not 
statistically significant. Thus, the three groups were not significantly dif- 
ferent with regard to average M4. 

‘The chronological age (C4) range of each retarded group was from four © 
to 10 years. The Cd range of the normal Ss was from three to six years. 

All retarded Ss attended day schools for the educably retarded. The normal 
Ss attended day preschool nursery programs. 


2. Experimental Measures 


The finger localization (FL) and localization of tactile stimulation 
(HAL) subtests of the Southern California Kinesthesia and Tactile Percep- © 
tion Tests (SCKT), a nonverbal battery, were used to measure, respectively, 
the surface schemata of the fingers and of the hands-arms. For the FL tasks, 
successful identification of the touched finger or fingers was scored one. For 
the HAL tasks, the distance between where an S had been touched by the 
examiner (E) and where the § indicated being touched was measured in 
centimeters (cm). If a distance was greater than one cm the trial was scored 
zero; if less than one cm, the trial was scored one. This scoring was derived 
from Ayres’ (1) standardized data. 

A nonverbal task, Imitation of Gestures (5), was used for measuring the 
postural model schemata. In the task, the child imitated a series of body move- 
ments or gestures made by the E. Each imitation was scored as either success 
(one) or failure (zero). Mirror image responses reflecting the egocentricity 
of the preschool child were scored as successes, The task was subdivided into 
hands-arms imitations (THA) and finger imitations (IF). 

All four experimental tasks consisted of 20 trials—four practice and 16 
experimental. The four practice trials were administered without the S’s 
vision being occluded ; during the 16 experimental trials, the S’s vision was 
occluded. This was accomplished by placing goggles over the S’s eyes. The 
bottom and lateral half of the goggles were covered with black adhesive tape 
which enabled the § to see the E but not his own hands, arms, or fingers. 
If an § failed two or more of the practice trials, the E assumed that the § 
did not understand the nature of the task, and testing was discontinued. 
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3. Procedure ‘ 


Each retarded S’s MA was assessed before the experimental measures were 
administered. All were taken from evaluations conducted not more than six 
months before experimental testing. The M44 scores of retarded Ss were 
measured with the Merrill-Palmer Intelligence Test (10). 

Each retarded § was seen for two separate 20-minute testing sessions. 
There was one session per day for two consecutive days. Two of the experi- 
mental tasks were administered during each of the sessions. The order of 
task presentation was randomized for each §. The nonretarded Ss were 
treated exactly like the retarded Ss except they were seen for three consecutive 
days. During the third session, the Peabody Picture Vocabulary Test, Form 
A, was administered to determine As of the normal Ss. 

To determine the reliability of the E’s scoring for the postural model task, 
a judge was present when five of the retarded and five of the normal Ss were 
tested. Each of the trials was scored independently by the judge for each of 
the imitation tasks. 


C. ExprrIMENTAL DgsiGN AND RESULTS 


A Pearson product-moment coefficient (r) was calculated to determine the 
reliability of the E’s scoring of the IF tasks. The dependent variable was 
the number of correct responses as scored by the £ and by the judge, for all 
10 Ss, for each separate trial of IF tasks. A similar r was calculated for the 
IHA tasks. Both coefficients were significant (6 < .01). This established 
that the E’s scoring of these two tasks could be reliably repeated by other Es, 

The total number of correct responses for each S, for each separate experi- 
mental task, was computed. The mean number of correct responses ranged 
from 5.8 (organically impaired retardates-IF) to 15.8 (normals-IHA). On 
a chance basis, an S would be expected to make four or fewer correct responses 
on each of the experimental tasks. Therefore, the mean number of correct 
responses, for each group, for each task, was significant beyond a chance level 
of expectation. 

The total number of correct responses, for each §, for each experimental 
task was used as the dependent variable in a repeated measure, unequal but 
proportional N, mixed design ANOVA, with one variable between Ss 
(groups), and two variables within Ss (body schema vs. body area). 

The results of this analysis showed that the following variables were 
statistically significant: (a) the Group main effect (F = 23.7, df 2/37, 
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p < .001); (4) the Body Area main effect—fingers, X = 8.5; hands- 
arms, X = 12.3 (F = 146.1, df 1/57, p < .001); (c) the Body Area X 
Body Schema interaction (F = 19.6, df 1/57, » < .005); and (d) the 
Group X Body Area X Body Schema interaction (F = 3.5, df 2/57, 
p < 05). 

‘The overall group effect was further broken down by Multiple Comparison 
Contrast Tests (MCCTs) computed according to Scheffé’s formulations 


(8). These MCCTs indicated that (a) the difference between the organically - 


impaired retardates (XY = 8.2) and the nonorganically impaired retardates 
(X = 10.6) was significant (p < .005); and (4) the difference between 
the nonorganically impaired retardates and normals (X = 12.9) was statisti- 
cally significant (p < .005). These findings demonstrate that nonorganically 
impaired retardates have an overall body schema deficit in comparison to 
normals and that organically impaired retardates have an overall schema deficit 
in comparison to nonorganically impaired retardates. 

A simple effects analysis was computed to determine if an overall group 
difference existed, respectively, on the two surface tasks and on the two 
postural model tasks. The results of both simple effects analyses were statis- 
tically significant (p < .001). The overall group effect, for the surface tasks, 
was further broken down by MCCTs. These MCCTs indicated that (a) 
the difference between the organically impaired retardates (X = 8.2) and 
nonorganically impaired retardates (Y = 10.5) was significant (p < .005); 
and, (b) the difference between the nonorganically impaired retardates and 
normals (X = 12.4) was significant (p < .005). Similarly, the group effect, 
for the postural model tasks, was broken down by MCCTs. These MCCTs 
indicated that: (a) the difference between the organically impaired retardates 
(X¥ = 8.2) and the nonorganically impaired retardates (X¥ = 10.7) was 
significant (p < .005); and, (4) the difference between the nonorganically 
impaired retardates and normals (X = 13.2) was significant (p < .005). 

The Group X Body Area X Body Schema interaction is graphically pre- 
sented in Figure 1. Examination of Figure 1 shows that each group made 
more correct responses on the IHA tasks than on the HAL tasks. In addition, 
the normal group made more correct responses on the IF tasks than on the 
FL tasks. The converse is true for the organically impaired retardates and 
nonorganically impaired retardates. That is, each group of retardates made 
more correct responses on the FL tasks than on the IF tasks. 


— 
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FIGURE 1 
Group X Bopy ScHEMA X Bopy ArgA INTERACTION 
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D. Discussion 


The results of this investigation have shown that retarded intellectual 
functioning is highly associated with deficits in body schema, Piaget would 
argue that a causal relationship exists between the two characteristics; how- 
ever, such a relationship cannot be substantiated by this study. 

The findings that the surface and postural model schemata are significantly 
more well-developed on the more proximal body area (hands-arms) than 
on the more distal body area, does not clearly support nor contradict Witkin’s 
(13) contention that the development of the body schema proceeds from the 
more proximal to the more distal body areas. Since performance of each group 
on the experimental tasks was beyond a chance level of success, it is possible 
that the two subdivisions of the body schema began to develop at the same 
time on each body area but developed more quickly on the hands-arms than 
on the fingers, 

The significant three-way interaction does seem to confirm Witkin’s for- 
mulation that the surface schemata develop earlier than the postural model 
schemata. Witkin states that surface schemata much reach a certain level of 
development before movements based on the postural model schemata can be 
effectively carried out. In essence, the child’s ability to move body parts effec- 
tively in relationship to each other is limited by his inability to distinguish 
body parts from one another. Thus, for each group of Ss, at the hands-arms 
body area, the surface schemata were well enough developed so that move- 
ments based on the postural model schemata were not hampered. However, 
at the fingers-body area, the surface schemata of the organically impaired 
retardates were so poorly developed that difficulty in distinguishing between 
their fingers interfered with the ability to execute finger movements. This 
was also the case with the nonorganically impaired retardates but to a lesser 
degree, for their surface schemata had reached a higher level of development 
than that of the organically impaired retardates. Finally, the surface schemata 
of the fingers were highly developed in normal children and had reached a 
developmental level sufficient not to hamper movements based on the postural 
model schemata of the fingers, 

The findings that organically impaired retardates have an overall body 
schema deficit in comparison to nonorganically impaired retardates is not 
congruent with the results of Benton’s studies (2, 3, 4). Examination of some 
of the methodological differences between Benton’s and the present study may 
help in explaining this discrepancy. In Benton’s studies, the distinction is 
drawn between familial and brain injury causes of retardation. Yet, it 
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is not always clear which criteria for the classification of familial retardation 
Benton has used. In essence, it seems that the least contaminated classification 
of whether or not the child is organically impaired is that used in the present 
study—i.e., the results of neurological examinations. 

Even if it is assumed, from Benton’s study, that his classification system 
was accurate, examination of his data suggests that his conclusion that there — 
was not a significant difference between performance of the two retarded groups 
is questionable. For example, the nonsignificant differences between the re- 
tarded six-year old subjects were derived from the exceedingly small sample 
of three brain injured children, and their performance was quite variable. 
Thus, the small sample in conjunction with the large variability probably 
would make it extremely difficult to detect significant differences between the 
retarded groups, even if they did exist. 

Moreover, Benton’s measurement of the retarded child’s 144A was based 
on “the last intelligence test administered” (2, p. 584) by the institution, 
Benton does not provide information on the period of time that lapsed from 
this testing until the child was administered the experimental tasks. Con- 
sequently, the two retarded groups did not seem to haye been matched care- 
fully for Md. 

One final remaining difference between the methodology of Benton’s and 
the present study also has to do with the MA of Ss; whereas Benton’s sub- 
jects were school age, the present study investigated children of preschool 
MAs. Possibly, the body schema deficit of the organically impaired retardates 
in comparison with the nonorganically impaired retardates, which is present 
at the preschool Js, is lessened significantly by the time the children achieve 
an MA of six or more years. However, there is no known theory or research 
that would tend to support this conclusion. In fact, it would be contrary to 
some of the existing theory in the area, such as Piaget’s. Therefore, in 
consideration of the methodological difficulties of Benton’s studies, it seems 
that the most acceptable conclusion is that organically impaired retardates 
have an overall body schema deficit in comparison to nonorganically impaired 
retardates, 

Benton’s conclusions that the postural model schemata develop earlier than 
the surface schemata is not congruent with the results in the present investiga- 
tion. Examination of Benton’s tasks indicates that he failed to take into ac- 
count the effect that body area had on the development of body schema skills. 
His surface schemata measure was derived from studies of the fingers-body 
area, whereas his postural model schemata measure was derived from studies 
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of the movements of the hands-arms. Therefore, Benton’s failure to control ’ 
for body area may have led to the erroneous conclusion that postural model 
schemata developed more quickly than surface schemata. 


15. 
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VERBAL OPERANT CONDITIONING AS A MEASURE 
OF VALUE STRENGTH: 1I* 1? 


The University of Georgia 


EpirH WEIssKoPpF-JOELSON AND WALTER Fioyp Heiney, Jr.* 


A. INTRODUCTION 


The present study is designed to investigate the validity of the provocative 
statement made by Skinner (14) that “Any list of values is a list of rein- 
forcers . . .” (p. 33). The statement suggests the practical possibility that 
conditioning can be used as an objective and quantitative tool for the measure- 
ment of value strength, thus helping to bridge the gap between the social- 
humanistic and the experimental approach to the study of human psychology. 
The most direct test of the notion that personal values function as reinforcing 
stimuli obviously lies in the domain of verbal operant conditioning (VOC). 
Thus, this study specifically hypothesizes that verbal stimuli pertaining to 
personal values of high strength will increase the frequency of verbal re- 
sponses more than verbal stimuli pertaining to personal values of low strength. 


The study of reinforcers with regard to their relative efficacy is an im- 
portant area of research which could open many unexplored vistas. Literally, 
such knowledge tells what makes an individual “tick” or what “turns him 
on.” Therefore, it is difficult to understand why previous literature on VOC 
limits itself mostly to verbal reinforcers expressing interest, approval, or 
agreement in different manners. For example, Greenspoon (7) compared the 
stimulus “mm-hm” with the stimulus “huh-uh.” He found that the former 
stimulus increased the number of emitted plural nouns, while the latter de- 
creased them. Kennedy and Zimmer (9) found that “mm-hm,” “good,” and 
a short paraphrase increased the number of self-reference statements emitted 
by the subjects (Ss), while “mm-hm” with an affirming head nod and smile 
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and “I see” did not do so. Verplanck (19) found paraphrasing more effective 
in increasing statements of opinion than “That's so,” or “Yes, you’re right.” 

Few studies have employed relatively complex reinforcement comparable 
to the complexity of values. Adams, Noblin, Butler, and Timmons (2) found 
that psychoanalytic interpretations of the sentences that Ss emitted increased 
the emission of the reinforced response class—first person pronouns, Similarly, 
Adams and Frye (1) were able to increase personal references by clinical 
interpretations; in contrast, reflections of feelings and hostile statements de- 
creased personal references. 

‘These studies are, however, only tangentially related to the present ques- 
tion of the relative effectiveness of verbal stimuli that pertain to personal 
values of varying strength. 

In contrast, the present study is closely related to the investigation by 

' Golightly and Byrne (6), in which it was shown that conditioning occurs 
when the critical responses are followed by statements reflecting attitudes 


similar to $s, while all other responses are followed by statements reflecting 
‘dissimilar attitudes. 


B. Metuop 
1. Selection of Values—Related Reinforcing Statements 


Since the study (see below) used college students as Ss, the high strength 
values chosen were those that, according to previous investigations (8, 20) 
‘were rated as most meaningful by over 300 undergraduate students at the 
same institution from which the subjects for the present study were selected. 
These values were Self-Interpretation and Self-Actualization. Ten state- 
ments, which on the basis of content seemed to reflect these values and had 
been empirically chosen as preferred most frequently by students in the above 
studies, were used as reinforcing statements in the present study. Typical of 
‘such high value statements are the following: 

1, If you will try harder to understand why you act and feel as you do, 
you will be able to solve many of your problems (Self-Interpretation). 

2. You would profit from developing many of your capacities which have 
thus far been neglected (Self-Actualization). 

3. You are the kind of person who would profit from analyzing your 
own needs and motives (Self-Interpretation), 

The value chosen to represent low strength was that of Conformity. This 
value was shown (8, 20) to rank the lowest in the hierarchy of the nine 
values presented in the above studies to the college population used in the 
present study. Ten statements which, again on the basis of content and 
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_ empirically determined preferences (8, 20), embody this value were chosen as 
low strength value reinforcers. Typical of these statements reflecting the 
low strength value of Conformity are the following: 
| 1. You need to allow yourself to be influenced more by the standards and 
_ norms of your friends. ; 
~ 2. You could gain a great deal by bringing your ideas more in line with 
the ideas of your peers. 
3. You will find it helpful to try to be more like other people your age. 
To establish the fact that the statements actually represent the values in 
| question, a definition of all nine values (20, p. 302) was given to five judges 
together with all the statements believed to represent these values. Ninety- 
six percent of the judgments made agreed with the authors’ intentions. 


2. Subjects 


ri: Forty male and 40 female undergraduate students participated in the ex- 

‘periment to fulfill course requirements in Introductory Psychology or in the 
Psychology of Adjustment. Their ages ranged from 17 to 25, with a median 
of 19 years and a mean of 19.48 years. They participated in the experiment 
before VOC was discussed in their courses. 


3. Procedure 


The experiment was conducted in a Taffel-type (18) experimental situa- 
tion, One hundred twenty 5 X 8 inch cards were used as materials. Three 
: Personal pronouns were typed in capital letters in a horizontal row near the 
top of each card. Below the set of pronouns there was a horizontal row of 
three different common past-tense verbs in random order, also typed in capital 
letters. The personal pronouns were randomly selected from the list, “I,” “you,” 
“he,” “she,” “we,” “they,” with the restriction that “I” appeared on only 60 
cards and “we” appeared on the remaining 60 cards only. “I” as well as we” 
Occupied the first, second, and third position in the row of pronouns an equal 
number of times; with the exception of this restriction the order of pronouns 
was random, Each verb was used twice on the series of 120 cards; thus, there 
was a total number of 180 verbs. The cards were presented in random order 
which was different for each S. 

The following instructions were given: 

We are attempting to develop a new method which is to be used to diagnose 
one’s personality. You are going to look at some cards one at a time. For each 
card I would like you to make up a sentence using one of the three upper 
words as the first word in the sentence and one of the three lower words as 
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the second word in the sentence. You may use any one of the upper words 
and any one of the lower words. It is important that you respond with the 
first sentence that occurs to you. We feel that your sentences can be used 
to understand your personality. I may interrupt you whenever I have reached 
a diagnostic conclusion and communicate the conclusion to you. Go ahead. 

What Ss were led by the instructions to perceive as “diagnostic conclu- 
sions” were actually reinforcers contingent upon Ss’ use of a first person 
pronoun—either “I” or “we.” Thus, whenever Ss constructed a sentence 
using “I” or “we,” the experimenter (E) delivered a verbal reinforcement. 
After the termination of the experiment Ss were told that the diagnostic 
conclusions did not apply to them, since they were the same for all Ss. 

Ss were randomly divided into four groups of 20 Ss each, with the restric- 
tion that each group contained the same number of males and females. Group 
I was reinforced by verbal stimuli pertaining to high values—namely, to 
Self-Interpretation and Self-Actualization, Each time an § in Group I con- 
structed a sentence beginning with the pronouns “I” or “we,” one of the 10 
statements discussed above, pertaining to the values of Self-Interpretation or 
Self-Actualization, was delivered to him by £. Similarly, Ss in Group II 
were reinforced by one of the 10 statements pertaining to a low value, Con- 

_ formity, each time they made up a sentence beginning with “I” or ‘‘we.” 

Group III was reinforced by the mild affirmatory words “good,” “fine,” and 

ok.” for the emission of first Person pronouns. Group IV received no rein- 
forcement, 


The order of presentation of all reinforcers within Groups I, II, and III 
was random and was different for each S. 

The possible bias according to which E might, for example, have emitted 
high value statements “with greater enthusiasm” than low value statements 
was not controlled systematically ; but E tried to keep his tone of voice con- 
stant for all reinforcers and to eliminate all other linguistic, paralinguistic, 
and kinesic cues that could confound the experiment (4), 

The operant level of first person Pronoun emission was determined in 
Trials 1 to 24. The experimental phase during which reinforcement was 
given to Groups I, II, and III was administered over Trials 25-96. 

Ss’ awareness of reinforcement contingencies was ascertained b’ 
ing four oral questions: 

1. What do you think it was all about? ‘ 


2. Did you usually give the first sentence which came to your mind? 
3. How did you go about deciding which of the words to use? 


4, What do you think the purpose of the diagnostic comments (things I 
said) was? 


y the follow- 


‘eZ 
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The answers were recorded verbatim. Aware Ss were defined as those who 
held a hypothesis that would enable them to bring about reinforcement at 
each trial if they wished to do so. On the basis of the answers to the above 
questions all Ss were judged unaware except three Ss in Group III. They 
were replaced by three new Ss assigned to Group III who were subsequently 
judged to be unaware. 

A long questionnaire, such as the one used by Spielberger (16, p. 86) and 
others, was avoided because the authors felt it might have created awareness 
through suggestion. Nevertheless, in consideration of the controversies in this 
area, a questionnaire based on the one reported by Spielberger (16, p. 86) was 
used in a later modified replication of the study. The modified questionnaire 
contained 16 questions; it was administered before the extinction period. The 
answers were submitted to four judges, who were asked to determine which 
Ss were aware—i.e., held an hypothesis which, if acted upon, would produce 
reinforcement on 100% of the trials. The judges agreed unanimously on 98% 
of the judgments made. The results showed that 89% of Ss were unanimously 
judged unaware. i 

In the present study, as well as in the replication, Ss were told that the 
administration of the questionnaire was not a part of the experiment proper 
and that they would not be “diagnosed” on the basis of their responses to it. 

Tt appears that the perception by Ss of the experiment as a diagnostic pro- 
cedure might have reduced awareness. The attention of Ss may have been 
focused on the contents of the “diagnostic conclusions” which may have been 
of great interest to them, rather than on the contingencies. This view is sup- 
Ported by the fact that the three aware Ss in the present study belonged’ to 
Group III, which was reinforced by mild affirmatory words, rather than to 
the groups reinforced by bogus diagnostic conclusions. In addition it seems 
plausible that Ss interpreted the “diagnostic conclusions” as based on a large 
number of their sentence constructions, rather than only on the last sentence 
immediately preceding the conclusions; therefore, Ss can be expected to have 
been less attentive to contingencies. 

The extinction period, during which no reinforcement was given, included 
Trials 97-120, 

E recorded the responses in such a way that S could not see the recording 
sheets, For every S the pronoun he selected was recorded at each trial. 

The experiment was conducted in a 10 X 8 X 10 foot room in which § 
and E (Heiney) sat facing each other across a table. The first 10 minutes of 

€ experiment were taken up by a friendly conversation. The inclusion of 
this pretest interchange between § and E was based on findings by Solley and 


a Long (15) according to which a warm-up period of this kind facilitates con- 
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ditioning. Then E presented the cards, one at a time, to S. The entire session 
lasted approximately 40 minutes. 


C, REsuts 


Figure 1 shows that the group reinforced with high values was consistently 
superior to all other groups in the production of first person pronouns. The 
group reinforced by mild affirmatory words was superior to the group rein- 
forced by a low value and to the control group. The group reinforced by a 


low value did not emit appreciably more first person pronouns than the control 
group, except in one trial block (Trials 73-84). 
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EXTINCTION 


The data were analyzed by a group sex X blocks of 12 trials each (4 X 
2 X 9) design with 20 Ss (10 males and 10 females) in each group. (The 
24 trials that determined the operant level were treated as one trial block.) 
A computer-programmed multivariate analysis of variance was used to assess 
overall treatment and sex effects over the nine trial blocks, yielding a non- 
significant F ratio (Hotelling’s T?) of .856 for sex main effects (3, 13). A 


TABLE 1 
F Vauues ror EACH oF THE NINE TRIAL BLOCKS 


Operant 


Source of variation 
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likelihood ratio statistic (chi square series) of 83.71 was obtained for group 
effects (3, 13) ; with 27 degrees of freedom this statistic was significant (p < 
01), indicating that some differences due to treatment were significant. 

A repeated measures analysis was not used because the assumptions of homo- 
geneity of variance and covariance were not met. Therefore, a multivariate 
analysis of variance which requires fewer assumptions was employed. 

To examine treatment effects by trial blocks, a two-way univariate analysis 
of variance was performed at each of the nine trial blocks (Table 1). 

As can be seen in Table 1, which summarizes all these two-way analyses of 
variance, group (treatment) effects were not significant at the operant level, 
but were significant (6 < .05 to p < .001) at every treatment block. The 
treatment effects persisted through the two extinction blocks (¢ < .05), but 
the magnitudes of group mean differences were not nearly as great as in the 
four preceding conditioning trial blocks, 

Correlations between the trial blocks and the discriminant function among 
the experimental groups (5) showed that as Ss proceeded through the operant 
and conditioning blocks, they could be more and more conspicuously discrimi- 
nated on the basis of the reinforcement they received. The correlation was 
low during the operant phase (.059), then it increased consistently during the 
conditioning trial blocks (.275, .468, .643, .754, .754, .774), and it dropped 
markedly during extinction (.257, .292) in comparison to its previous level. 

Table 2 shows mean differences per § of first person pronouns among all 


TABLE 2 
MEzAN DirrFerENcEs OF First PERsON PRONOUNS BETWEEN GROUPS 
At Each TRIAL BLock OF THE CONDITIONING PHASE 


Mean Differences 


Groups 1 2 3 4 5 6 
L-Il 96 1,32* 3.488 4.03%* 3.008* 3.72°* 
I-III 36 AZ 144° 1.92%* 1.56%" 1.32* 
I-IV 1.32* 1,928* 3.12* 3.60** 4.208" 3.60** 
U1 I -60 1,20* 2.04* 2.16** 1.44* 2.40** 
II-IV 36 -60 — 36 — 48 1.20* — 12 
Il-IV -96 1,80" 1.68* 1,68** 2.64%* 2.28%* 
*p<.05. 
 7< 01. 


groups at each conditioning trial block. The Newman-Keuls procedure was 
used to test these differences. As can be seen in this table, the means of the 
group reinforced by high values (Group I) were significantly higher than the 
means of the control group (IV) in every conditioning trial block; they were 
higher than the means of the low value reinforced group (II) for all con- 
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ditioning trial blocks except the first one, and higher than the means of the 
group reinforced by mild affirmatory words (III) for all except the first two 
trial blocks. The means of the group reinforced by mild affirmatory words 
were significantly higher than the means of the group reinforced by a low value 


"and of the control group for all trial blocks except the first one. The means of 


the group reinforced by a low value and the means of the control group 
differed only at the fifth conditioning block, for which the mean of the former 


. group was higher. 


For the extinction period Newman-Keuls tests were conducted on group 
mean differences. In both extinction trial blocks the mean of the group rein- 
forced by high values was significantly larger (p < .05) than the mean of the 
group reinforced by a low value. 


D. Discussion 


The hypothesis tested in the present study stated that Ss reinforced by 
verbal stimuli pertaining to Self-Interpretation and Self-Actualization would 
demonstrate more effective conditioning than Ss reinforced by verbal 
stimuli pertaining to Conformity. This hypothesis was based on previous 
findings that Self-Interpretation and Self-Actualization have relatively high 
value strength, while Conformity has relatively low value strength for the 
population from which Ss were selected. The hypothesis was distinctly con- 
firmed. Starting with the second conditioning block of trials, Ss reinforced 
with high value statements emitted significantly more first person pronouns 
than Ss reinforced with low value statements; this superiority was main- 
tained throughout the conditioning series. 

‘The results also suggest that the responses to high value statements showed 
some resistance to extinction. Even during the extinction periods Ss previously 
reinforced by statements pertaining to high values emitted significantly more 
critical responses than Ss previously reinforced by statements pertaining to a 
low value. 

‘Thus, as predicted, statements pertaining to high values are more effective 
reinforcers than those pertaining to a low value. 

No specific hypotheses were formulated regarding the efficacy as reinforcers 
of high and low values as compared to the control group and to the group 
reinforced by mild affirmatory words because the previous investigations by 
Weisskopf-Joelson et al. (20) and by Katkin and Weisskopf-Joelson (8) 
Yield suggestions regarding only the relative strength of values in comparison 
to each other: they indicate that Self-Interpretation and Self-Actualization are 
higher values than Conformity for the population from which Ss were selected, 
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but they do not show whether these values are high or low in an absolute sense, 
for example, in comparison with the control group. However, the results of the 
present study indicate that Self-Interpretation and Self-Actualization are high 
values for Ss under consideration also in an absolute sense, that they result in 
an emission rate of critical responses that is significantly higher than the emission 
rate that occurs when no reinforcement is given. In contrast, Conformity, while 
being a low value in relation to Self-Interpretation and Self-Actualization, is 
a neutral value in relation to the control group for Ss of this study: it neither 
increases nor decreases the number of critical responses that occur without 
reinforcement. (Trials 73-84 are an exception to this rule, since during these 
trials Ss reinforced by Conformity emitted significantly more critical responses 
than Ss in the control group.) 

The results indicate that mild affirmatory words, a well-established rein- 
forcement, are less effective reinforcers than high values and more effective 
reinforcers than a low value. 

Kerlinger (10), Mogar (12), and others have criticized research on values 
because it often focuses on verbally professed rather than behaviorally mani- 
fested values. With regard to the values investigated in this study one might 
ask: To what extent do students “self-actualize” in their overt behavior? To 
what extent do they conform? Such questions cannot be answered within the 
Present investigation, However, students’ tesponses in the present study would 
seem to be more closely related to behavioral manifestations of values than 
are their responses to paper-and-pencil questionnaires, 

It is conceivable, for example, that the effect of social desirability might 
be decreased when VOC is used as a measure of value strength. Thus, the 
present investigation may represent one step toward the goal of studying 
values on the basis of behavior in simulated or real life situations, 

However, it seems quite possible that the Statements pertaining to values 
of high strength could have encouraged Ss to talk about themselves (use first 
person pronouns) independent of the reinforcing potency of the statements. 
On the other hand, the statements concerning the low strength yalue could 
have encouraged Ss to talk about others. Thus, further research is needed in 
which the reinforced response classes are other than first person pronouns. The 

same argument could be applied to all Taffel-type VOC studies that use first 
person pronouns as reinforced response class, and stimuli as reinforcers which 
can be perceived by Ss as expressing interest (mm-hm), approval (good, fine), 
or agreement (that’s right). 

The confounding mentioned in the previous Paragraph would predict a 
stronger effect for “I” than “we” in the present study, when high values are 


i 
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used as reinforcers. However, “I” and “we” do not differ significantly with 
regard to the frequency with which they have been selected by Ss reinforced 
with high values. Furthermore, in the replication mentioned aboye in the 
Method section, the reinforced response class was the use of second and third 
person pronouns rather than first person pronouns; yet the results of the 
replication were in the same direction as those of the present study. 
Moreover, if the results shown in Figure 1 were not due to conditioning 
but to merely the fact that Self-Actualization statements and Self-Interpreta- 
tion statements induce Ss to speak about themselves (i.e., to use first person 
pronouns),’ then the delivery of high value statements should increase the 
frequency of first person pronoun selection also if high value statements are 
delivered noncontingently. However, two subsequent studies (11, 17) suggest 
that this is not the case. Kersey (11) showed that contingent reinforcement 
of first person pronoun emission by high value statements produces condition- 
ing, while noncontingent (yoked) reinforcement does not produce condition- 
ing. Likewise, Steele (17) showed that a group of Ss that was contingently 
reinforced by high value statements selected significantly more first person 
pronouns than a yoked control group that was noncontingently reinforced. 


E. SuMMARY 


The hypothesis tested was that verbal stimuli pertaining to high values 
(SHV) are more effective reinforcers than verbal stimuli pertaining to low 
values (SLV). Three groups of 20 undergraduates each were reinforced for 
the emission of first person pronouns in a Taffel-type situation. A control 
group received no reinforcement. Group I, reinforced by SHV, showed the 
highest degree of conditioning. Group II, reinforced by SLV, did not condi- 
tion appreciably. Group III, reinforced by mild affirmatory words, took an 
intermediate position between Group I and Group II with regard to the degree 
of conditioning. Thus, the hypothesis was confirmed. The detected relation- 
ship between value strength and degree of conditioning may lead to a new 
way of measuring values through verbal operant conditioning. 
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GROUP SANCTION SEVERITY AS A FUNCTION OF 
DEGREE OF BEHAVIOR DEVIATION AND 
RELEVANCE OF NORM: REPLICATION 
AND REVISION OF MODEL* 


Smoke Psychological Laboratories, Gettysburg College 


S. A. Mupp? 


A. INTRODUCTION 


A model describing an empirically determined higher order relationship 
between group norm relevance, degree of behavior deviation from a norm of 
given relevance, and the severity of sanction for such a deviation was pre- 
sented (3) in the form Y = .32 log X + .45, where Y was the slope of the 
regression of sanction severity as a function of degree of deviation from a 
norm, and X was the relevance of that norm. This expression reflected the 
fact that, as norm relevance increased, the severity of sanctions assigned by 
college student leaders per unit deviation from the norm increased, but at a 
negatively accelerated rate. 

Such a model has utility only to the extent that the constants in the 
equation remain stable across repeated samples and across time. The study 
reported here was an exact replication of the original study on a new genera- 
tion of students at Gettysburg College where the original data were taken. 

The original study involved three behavioral norms: speech, cleanliness, 
and honesty. Sanctions assigned to various deviate behaviors in each of the 
Norm areas were found to vary linearly as a function of degree of deviation. 
That is, deviations from each norm were so sanctioned that a unit increase 
in deviation from the norm led to a constant increment of severity of sanction 
judged to be appropriate for that behavior. The slope of the deviation func- 
tion for each norm was unique: speech (—.21) ; cleanliness (.47) ; and honesty 
(.65). It was the change in these slope values as a function of the scaled 
televance of their corresponding norms [speech (.01) ; cleanliness (1.46) ; 
and honesty (3.35)] that was described by the proposed model. 
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The 1969 replication reported here was run to test the hypothesis that the 
slopes of the three deviation functions, on which the model was based, would 
not differ from those obtained in the original 1962 study (3). 


B. Mernop 


The procedures used in the original study were used without modification 
in this replication. Briefly, items scaled for use in the original study were 
presented to 20 fraternity and 19 sorority leaders (office holders) all of 
whom were Gettysburg College students. Nine scaled behavior deviation 
items (e.g., “cheats at cards”) were presented in each of the three norm areas 
involved, Subjects assigned what they judged to be appropriate sanctions from 
a list of 11 sanctions scaled for the original study in terms of severity (eg, 
“being asked to give up membership in the group”). Each of the 27 deviation 
items was presented separately in a randomized order. 


C. ReEsutts 


Data were in the form of scaled magnitudes of sanction severity as assigned 
by each of the 20 men and 19 women to each of the nine deviant behaviors 
at each of three levels of norm relevance. These 1969 data were compared 
with the data from the 1962 sample (24 men, 24 women) in separate three- 
factorial analyses of variance (sex [2] X year [2] X deviation [9]) for 
each norm. The F tests for the three analyses are summarized in Table 1. 

The sources of variance crucial to the hypothesis that the slopes of the 
deviation functions would not vary from 1962 to 1969 were the two- and 
three-way interactions involving the year and deviation variable. It can be 
seen from an inspection of Table 1 that deviations did not interact signifi- 
cantly with any other variables. Graphically, this meant that plots of the 
mean sanctions assigned to the deviations did not differ significantly from 
1962 to 1969 (Deviations X Year), from men to women (Deviations X 
Sex), nor as a function of both sex and year (Deviations Sex Year): 
In fact, the mean sanction values showed remarkable stability across all vari- 
ables as reflected by the fact that all sources of variability for all three norms 
were nonsignificant, except for the main effects of deviations for all norms 
and the main effects of sex for the honesty norm. 

A significant effect of deviations was expected at all norm levels. A seX 
difference for the honesty norm was not expected, since such an effect had 

not been obtained in the 1962 data. The significance of the sex difference 
was based on the tendency of men to assign more severe sanctions across all 
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TABLE 1 
SUMMARY OF F RATIOS OBTAINED IN ANALYSES OF VARIANCE OF SANCTIONS AS A 
Function oF SEX, YEAR, AND DEVIATIONS AT EACH OF THREE 
Levels OF NorM RELEVANCE 


Source of Behavioral norm 
variance df Speech Cleanliness Honesty 
Between subjects 81 _— — — 
Year (Y) 1 -63 3.37 2.36 
Sex (S) 1 00 1.00 5.51* 
Yxs 1 24 AS 18 
Persons within 
groups 83 = = ae 
Within subjects 696 | bec" TEs 
Deviations (D) 8 17.68%* 12.03** 58.348* 
Dex y 8 1.50 1.28 1.55 
Dxs 8 1.84 1.64 AL 
DxXYxs 8 75 1,36 32 
D X persons 
within groups 664 = +s sets 
Total 782 
* Indicates an F ratio significant at p = .05. 


** Indicates an F ratio significant at = .01. 


honesty deviations. This main effect operated across all honesty deviations 
So that the intercept of the deviation function was involved, but the slope of 
the line was not. 

Since the slopes of the deviation functions did not vary from 1962 to 1969, 
the data from both years were combined, and curves were fitted with use of 
the pooled data. All three functions were fitted and tested for linearity (2, 
p. 525). The resulting functions and F tests for linearity were as follows: 


1. Speech: Y=—36¥ +207 (F=13.05;p <.01; df 1,781). 
2. Cleanliness: Y= 38X+ .77 (F= 3.86;p <.05; df 1, 781). 
3. Honesty: Y= 60X¥+246 (F=66.76;6< .01; df 1, 607). 


The slopes of the three deviation functions were then fitted to the scaled 
televance values of the three norms and found to vary as a linear function 
of log values of norm relevance: Y = .36 log X + .37, where Y equals the 
slope of a given deviation/sanction function, and X is the magnitude of norm 
Televance. A test of the goodness of fit of that line (1, p. 43) yielded fs 
?= 12.80 (df — 1) which was significant (p < .05) and provided a basis 
for the rejection of the null hypothesis that no linear relationship existed. 
The confidence interval (p = .05) for the obtained slope was .36 = 40. 
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D. Discussion 


The basic finding of the replication was that the three deviation function 
slopes were found to be invariant over a seven-year period during which there 
was a complete turnover of the college population from which the two 
samples were drawn. This finding plus the fact that the honesty deviation 
function for women, combined across both years, retained the same slope, 
while the overall level of the function (Y intercept) differed significantly 
from that for men, is considered to be strong evidence for the stability of such 
functions. The obtained sex difference in overall level of the honesty function 
without a change in slope opens up the possibility that the effects of various 
classes of variables on these deviation functions can be differentiated with 
respect to whether they influence the slope, the Y intercept, or both parameters 
of the functions, For example, Sellin and Wolfgang (4, pp. 263-268) found 
that the slopes of rated seriousness of juvenile crimes did not vary from one 
class of rater to another (e.g., juvenile court judges to police officers), nor 
did the slopes change with age of alleged offender (e.g., from 13 to 27 
years). 

The present study confirmed the earlier finding that the slope of the 
deviation/sanction function varied as a function of the relevance of the be- 
hayior norm involved, but not as a function of sex, even though the function 
for the honesty norm showed a sex difference in the general level of sanction 
assigned. One might speculate that tolerance as a dynamic social process 
operates to raise or lower the threshold of deviation from a norm tolerated, 
but that it does not modulate the rate at which sanctions are increased as 
behavior deviations increase above that threshold. 

Aside from such speculation, it is appropriate here to revise the original 
estimation of the parameters of the model from Y = .32 log X¥ + .45 to 
Y = .36 log X + .37, this revision being based on the pooled data for both 
years, At the present time plans are underway to develop deviation scales 
for a new sample of norms in order to test the internal validity of the 
model. 

The applicability of this model to populations other than college age Ameri- 
cans has not been tested. Further studies are required to establish the extent 
to which the equation can be generalized to other groups and other cultures. 
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“NOTE ON THE RELATIONSHIP OF ORDINAL 
POSITION, DOGMATISM, AND PERSONAL 
fi SEXUAL ATTITUDES: A REPLY* 


/ oie 
¢ “Psychology Service, Veterans Administration Hospital, Charleston, South Carolina; 
~ and Department of Psychiatry, Medical University of South Carolina 


Dean G. KILpatricK 


~ Swanson (3) recently published an article commenting on the statistical 
‘methodology of an article by Kilpatrick and Cauthen (2). The point of 
_Swanson’s article appeared to be that many of the significant findings of 

the Kilpatrick and Cauthen study resulted from their using the Duncan ~ 
“multiple range procedure (1) instead of the more conservative Newman- 
Keuls procedure (4) to compare group means. Specifically, Swanson reported 
that when the Newman-Keuls procedure was used on the data, only four of 
the dogmatism and sexual attitude means were significantly different, as 
contrasted with the seven significantly different means said to have been re- 
Ported by Kilpatrick and Cauthen. By implication, Swanson appeared to say 
‘that the conclusion that dogmatism and sexual attitudes are related to ordinal 
- position is not supported by his reanalysis of the data. 
“Swanson is correct in saying that the Newman-Keuls is a more conserva- 
tive multiple comparisons procedure than the Duncan. However, certain in- 
" accuracies and misrepresentations contained in Swanson’s article should be 
Corrected, 

First, Swanson inaccurately reported the original findings. For Sexual 
| Liberality Scales (SLS) scores, Kilpatrick and Cauthen reported (a) males 
had higher SLS means than females; (5) only and firstborn males had lower 
‘means than middle and last borns; (c) firstborn females had lower means than 
last borns. For dogmatism scores (D scores), Kilpatrick and Cauthen found 
(@) males had higher means than females; (e) male firstborns had lower 
Means than only, middle, and last borns; (f) male only children had higher 
‘Means than firstborns, but lower means than middle and last borns; (g) middle 
? bors had higher means than only and firstborns, but lower means than last 
boins; (h) last-born males had higher means than all other male ordinal 
"Position groups; (i) middle-born females had higher means than only and 
+ last borns; (7) last borns had lower means than all other female ordinal 

—_—_—_ 
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position groups; and (&) only children had lower means than middle borns 
and higher means than last borns. Swanson, in summarizing these findings, 
stated that the only significant findings were that (a) last-born females had 
higher SLS means than firstborns, and (4) last-born males had higher D 
score means than all other ordinal position groups. This was clearly a mis- 
representation of the Kilpatrick and Cauthen findings. 

Secondly, Swanson evidently misinterpreted the results of his own Newman- 
Keuls procedure. Swanson stated that last-born females had higher D score 
means than all other female ordinal position groups while, in fact, an analysis ‘ 
of the mean D scores contained in Table 2 of the original article (2) reveals 
that the exact opposite was the case. Last-born females were the least dog- 
matic of any group. Using impeccable statistical methodology is still no 
substitute for careful interpretation of the data after their analysis. 

Finally, even if fewer significant mean differences were found when the 
Newman-Keuls was used as Swanson reported, the conclusions to be drawn 
from the revised findings would not be substantially changed. Kilpatrick and 
Cauthen concluded that (a) the effects of ordinal position differed for males 
and females on both sexual attitudes and dogmatism; (b) there should be 
separate classifications for male and female only, first, middle, and last borns; 
and (c) there was a striking difference in dogmatism between male and fe- 
male last borns which, if replicated, might represent a source of confounding 
in research on authoritarianism and dogmatism. In lieu of Swanson’s pre- 


senting any data which are inconsistent with these conclusions, it is maintained 
that they are still tenable. 
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ANXIETY, SELF-CONCEPT, AND THE 
SEMANTIC DIFFERENTIAL* 


, Washington University and John Cochran Veterans 
Administration Hospital, St, Louis 
Lron CowEN 
a 


A. INTRODUCTION 


Various authors have demonstrated an inverse relationship between anxiety 
and self-concept viewed at a single point in time (2, 5, 6, 7). Although some 
attempts have been made to look at antecedents (6) and components (2) of 
self-concept in its relationship to anxiety, the writer knows of no one who has 
employed the pervasive dimensionality of the Semantic Differential (3) to 
clarify which aspects of self-concept are preponderant in this relationship. 
However, it is highly suggestive that the Evaluative (E) dimension of the 
Semantic Differential has been viewed as a generalized attitude measure (3), 
and these studies usually assess self-concept by global self-report (attitude 
toward the self). 

To measure anxiety, studies of anxiety and self-concept have often made 
use of the Manifest Anxiety Scale (MAS), which is known to be highly cor- 
related with underlying personality factors, such as Eysenck’s Neuroticism 
(N). Peterson (4) suggested that such verbally defined personality dimen- 
sions may simply be variants of the general Semantic Differential structure. 
For personality trait ratings, Hallworth (1) did find that Evaluative scales 
marked the first underlying personality factor (akin to anxiety), Activity 
(A) scales the second, and Potency (P) the fourth. 

: Consequently, this study sought to confirm the hypothesis that the correla- 

tion of anxiety with the Semantic Differential rating of self on the dimensions 
of E, P, and A would be sizable only in the case of E. In addition, an attempt 
Was made to investigate the effects of change over time in both anxiety and 
self-concept on these relationships. Finally, the inverted U-shaped. relation- 
ship frequently found between measures of drive-arousal-anxiety and measures 
of performance suggested that the supposed increase in performance attendant 
on reduction of high anxiety might be reflected on the remaining self-rating 
dimensions (A, P), as well as on E, perhaps as a component of improved 
C_ 
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TABLE 1 
Pre AND Post CorRELATIONS OF ANxiETY (AACL) wirH THE E, P, anp A 
DIMENSIONS OF THE SEMANTIC DIFFERENTIAL 
MEASURE OF SELF-CONCEPT 


Dimension Pre Post 
E —.66 —.63 
P —.04 —.69 
A +.02 +415 


self-confidence—better (E) and stronger (P)—as suggested by the findings 
of Weitzner, Stallone, and Smith (7). 


B. MertnHop 


Subjects presumably high in anxiety (and low in self-evaluation) were 
selected and tested by the author. They were 16 male patients entering a 
crisis-intervention day hospital program. Anxiety and self-concept were assessed 
upon admission and discharge, with a median interval between administra- 
tions of 27.5 days. 

Anxiety was measured by the Anxiety scale (AACL) of the Multiple 
Affect Adjective Check List, Today form (8, 9), since past results indicate 
that the AACL is sensitive to short-term change (10). Eleven Semantic 
Differential scales representing the Evaluative, Potency, and Activity dimen- 
sions were used to assess self-concept. 


f 


C. Resutts 


Pearson product-moment correlation coefficients were computed between 
the AACL scores and the E, P, and A scores of the Semantic Differential - 
for both administrations (Table 1). As expected, initial assessment revealed 
a high negative correlation between anxiety and E self-concept (—.66), and 


TABLE 2 
PRE AND Post MEANS AND DIFFERENCES FOR ANXIETY (AACL) anp THe E, P, AND A 
DIMENSIONS OF THE SEMANTIC DIFFERENTIAL 
MEASURE OF SELF-CONCEPT 


Measure Pre Post Change 
AACL 12.4 6.9 —5,5008 
E 44 5.5 +1.1* 
P 3.6 48 +120" 
A 4.6 44 —0.2 
*p<.05. 
p< .01. 
wae 4 < 01. 
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; zero correlations between anxiety and P (—.04), and anxiety and 
02). The only change on the second administration was the appearance 
of a high negative correlation between anxiety and P self-concept (—.69). 
To verify that change in anxiety and self-concept did occur, # tests for cor- 
means were run on the pre and post AACL and Semantic Differential 
x ion (E, P, A) mean scores (Table 2). AACL anxiety diminished 
significantly (p < .01), while self-concept increased on E (p < .05) and 
ae < .01), but A remained about the same. 


D. Discussion 


yee. 
The present results confirm the hypothesized negative relationship between 
iet and Evaluative self-concept, and extend it to actual change over time. 
revealing only negligible correlations between anxiety and self-ratings on 
antic Differential dimension of Activity, which on the surface bears 
ptual relationship to anxiety, this study tends to support Peterson’s 
}) contention that verbally defined personality dimensions, such as the 
anxiety-loaded Neuroticism factor of Eysenck, are variants of the general 
ic Differential structure. 
finding of a high negative correlation between anxiety and Potency 
cept, only after anxiety had been reduced from a high level, suggests 
iform changes in anxiety are sufficient to bring about changes in aspects 
cept, viewed perhaps in terms of a combination of E and P dimen- 
»2., self-confidence (7). However, with respect only to P, these 
lings tentatively support Mukherjee’s hypothesized etiology of anxiety 
Perseverance (2). Repeated measurements at shorter intervals would be 
cessary to establish firmly that anxiety reduction does precede increase on 
f-concept. y 
| ture researchers in this area might employ behavioral and physiological 
res of anxiety to see whether the relationship between anxiety and self- 
ept can be clearly extended beyond the semantic level. They might also 
tigate whether it is possible to disengage temporarily verbalized anxiety 
E self-concept by such manipulations as induced failure experiences, posi- 
ot negative feedback from others about the self (on the E dimension), 
ministration of antianxiety drugs, or posthypnotic suggestion. 


oat 


n E. Summary 

his Semantic Differential measure of self-concept and an adjective check 
; ure of anxiety were administered to 16 patients in a day hospital 

fam upon admission and discharge. Correlations between anxiety and the 
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three Semantic Differential dimensions—Evaluative, Potency, and Activity— 
were significant for E at first testing (—.66) and for E (—.63) and P 
(—.69) at the post testing. The emergence of a high negative correlation 
between anxiety and P on the second testing suggested the possibility that 
change in anxiety may be etiologically prior to change in certain aspects of 
self-concept. Suggestions for future research were given. 


10. 
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THE PSYCHODYNAMICS OF HUMOR, AS SEEN IN 
INSTITUTIONALIZED RETARDATES* 


Willowbrook State School, Staten Island, New York 


GasrieL PustEL, MANNY STERNLICHT, AND Louis SIEGEL 


A. INTRODUCTION 


who is a projected image of the listener himself (thus resulting in a Py: 


Kris (6) conceived of humor as a way of mastering former fears. 


there is word play on proper names as ambiguously used, em! 


mental processes (12), personality dispositions (4, 
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Freud (5) postulated that the function of humor is relief from the burdens 
of reality, via a primitive and transient regression to childish triumphs. 
Bergler (1) and Reik (8) have emphasized the self-aggressive component 
in humor; Bergler also has asserted that the basis of all humor is a denial of 
one’s fears, The butt of a joke is an artificially created masochistic victim 


victory by the ego over the superego). Every joke represents a kind of “in- 
formation, please” type of game which reactivates a childhood situation of 
lack of self-possession, and the understanding of the joke proves to the “child” 
in the adolescent and adult that he has mastered a “forbidden” knowledge 
and is thereby accepted into the confidence of the adult world. In this vein, 


Perhaps the most elaborate psychodynamic formulation of children’s humor 
has been offered by Wolfenstein (11), who postulated three stages of emo- 
tional development as a background for joke preferences: (a) at ages 4-5, 
phasis on motoric 
acts, concern with being loved and not loved, and problems of sexual identity ; 
(b) at ages 6-11, a concern with riddles reflecting a need to know (rather 
than to look), avoidance of dangerous thoughts through disguise, reduction 
of Oedipal wishes to absurdity, and deflation of parental authority; (¢) 
at ages 12-21, a concern with rhymes and anecdotes reflecting a devaluation 
of the need for answers, and parents viewed as fallible. Although noting 
that there may be considerable overlapping of these stages in adults, Wolfen- 
stein also concluded that, in children’s humor, the main thrust is to transform 
pain into pleasure by partial mastery of fear and enjoyment of scary feelings. 

Most of the experimental studies of humor have not been concerned with 


the underlying psychodynamics, but rather with either cognitive develop- 
9, 10), or affect arousal 


* Received in the Editorial Office on October 18, 1971, and published immediately 


70 JOURNAL OF PSYCHOLOGY 


through sex or aggressive facilitators (2, 3, 7). All of these studies have 
utilized prepared jokes or cartoons, in a passive fashion, and with one excep- 
tion (12) have dealt primarily with psychiatric or intellectually normal 
populations, 

The present study utilizes a Freudian type of approach in an attempt to 
analyze the productions of humor by institutionalized retardates, following 
Wolfenstein’s formulations of the underlying psychodynamics; we were not 
concerned with personality-societal preferences or affect-arousal facilitation. 


B. Susyzcts AND ProcepurEs 


This experiment employed 60 mildly mentally retarded institutionalized Ss 
(at Willowbrook State School). The 60 Ss consisted of 30 adolescents (15 
females and 15 males) and 30 adults, also dichotomized according to sex. 
Within each age group, the males and females were matched for CA, MA, 
and IQ. The mean adolescent C4 was 16.3 (range from 12-20), with an 
average IQ of 59.8 (range from 50-71) and a mean MA of 8.5 (range from 
5-12), while the mean adult C4 was 33.7 (ranging from 21-54), with an 
average JQ of 62.9 (ranging from 50-74) and a mean MA of 10.9 (ranging 
| from 8-13). 

Each § was seen individually (by the senior author) for approximately 
15-30 minutes, during which time the investigator tried to establish some 
degree of rapport with the resident and attempted to elicit a joke (‘Tell 
me a joke.”) from the §. (An effort was made to obtain more than one joke 
from each participant, but this proved fruitless as most of the Ss were unable 
to recall more than one joke.) Each joke was categorized basically according 
to Wolfenstein’s classification (with some slight modification), and to Freud’s 
psychosexual developmental stages and psychodynamic concepts. 


TABLE 1 
An ANALYSIS OF RETARDATEs’ HUMOR AS RELATED TO REBELLION: 
A Frequency anp PEercenTace Distripution 


Rebelli ; 
Rebellion ebellion with 


. f i 
Subjects succeeds yaaa oa Mtge 

eee (60% 

Adults 9 (60%) 5 (33 1 ( 7%) 

Adolescents 9 (60%) 3 (20%) 3 Cam 
ices 10 (67%) 

Adults 67% 5 (33 

Adolescents 11 (72%) 4 ee ; 
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TABLE 2 
AN ANALYsI8 OF RETARDATES’ Humor AS RELATED TO MISFORTUNE: 
A FREQUENCY AND PERCENTAGE DISTRIBUTION 
Misfortunes Misfortunes No 
Subjects of others of self misfortunes 
Male 
Adults 10 (67%) 3 (20%) 2 (13%) 
Adolescents 12 (80%) 1 ( 7%) 2 (13%) 
_ Female 
Adults 12 (80%) 2 (13%) 1 ( ick 
Adolescents 8 (53%) 2 (13%) 5 (33% 


Z = 5.38, p > .01. 


C. Resurts anp Discussion 


Table 1 illustrates that the jokes of the mentally retarded individuals 
displayed a greater unconscious wish for successful rebellion than fantasizing 
that rebellion would end in some type of retaliation from the environment. 
These results are significant at the .002 level of confidence. These results 
seem to support Freud’s contention that the essential unconscious motive in 
joke enjoyment is rebellious irreverence: one feels oppressed by adults, thus 
compensating and hoping for an infantile narcistic victory. 

Table 2 indicates that the mentally subnormal individuals displayed an 
unconscious projected wish for the tribulations of others, as opposed to either 
‘an unconscious component of masochism or the neutralization of the nature 
of the misfortune. These results are statistically significant at the p > 01 
level. These findings again seem to support Freud’s assertion that the satisfac- 
tion of jokes comes from the misfortunes of others as a compensatory partial 
triumph (temporary regression to childish triumphs) over distress. 

Table 3 presents results that suggest that the mentally retarded individuals 
did not follow the developmental trends elicited by Wolfenstein—i.e., younger 


TABLE 3 
An ANALYIs oF RETARDATES’ HuMoR, ACCORDING TO 


- Anecdotes- os 
Subjects Riddles Rhymes Total Significant 

Male : 

Adults 15 N.S. 

Adolescents q a3 15 Significant at .01 
Female ‘ 

Adults 4 1 15 Significant at 05 

Adolescents g 7 15 N.S. 
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TABLE 4 
An ANALYsIs OF RETARDATES’ HuMuR, CLASSIFIED ACCORDING TO FREUDIAN 
STAGES OF DEVELOPMENT: A FREQUENCY AND PERCENTAGE DisTRriBUTION 


Oral Anal Phallic 

Subjects themes themes themes Total 
Male 

Adults 3 20% 4 26% 8 54% 15 

Adolescents 213% 1 7% 12 80% 15 
Female 

Adults 6 40% 1 7% 8 53% 15 

Adolescents 1 7% 1 7% 12 86% 14 


children have a propensity toward riddles, and adolescents favor anecdotes- 
thymes. These results indicate that mentally subnormal individuals performed 
atypically from that of normals on the whole. 

Table 4 depicts results that demonstrate that the mentally retarded adoles- 
cents, but not the adults, revealed a propensity toward jokes with phallic 
intonations, This broad finding would seem to suggest an unconscious wish 
at striving for masculinity. Their preoccupation with phallic symbols may be 
a reaction formation against feelings of inferiority. 

In summary, we found support for Freud’s postulate that humor is not 
resigned but represents a triumph of the pleasure principle over adverse cit- 
cumstances. Our results basically show that mildly retarded (JQ ranging 
from 50 to 74) institutionalized individuals preferred jokes that unconsciously 
reflect motives of irreverent rebelliousness. This is not to say that unconscious 
self-aggressive motives were not to be found in their humor, but it was not 
their major preference. We cannot, therefore, discount entirely the view- 
points of Reik and Bergler that the superego is not very permissive to the 
ego, resulting in self-aggressive humor, since our population did engage in 
some masochistic humor. 

We also found that Wolfenstein’s developmental levels were significantly 
applicable to our male adolescents and female adults, but not to our male 
adults and female adolescents, Possibly the contradictory findings depicted 
in Table 3 suggest that, as is the case with physical maturation, the develop- 
mental progressions in institutionalized retardates are so muted as to be highly 
overlapping, with no sharp lines of demarcation. Thus, high grade retardates 
reacted atypically from normal populations in relation to Wolfenstein’s riddle 
us. anecdote-rhyme category. 

Lastly, their humor reflected unconscious concern with phallic symbolism, 
and this may indicate a reaction formation against feelings of inferiority. 

Thus, these findings show that the humor of mildly retarded institution 
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alized retardates represents psychodynamically a striving for compensatory 
superiority due to feelings of lack of self-possession. 

This primitive-temporary denial or re-evaluation of reality by the pleasure 
principle as a compensatory technique was the major method used by our 
_ population, but was not the exclusive type of humor used. Although the results 

are not statistically significant, we duly take note also of the masochistic side 
(superego component) of humor as seen in the fear of retaliation for rebellion 
and even outright failure of rebellion. 
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PERSONALITY DIFFERENCES IN THE RELATIONSHIP 
BETWEEN FAMILIARITY AND LIKING* 


University of Maine 
” 
Coty MartTINDALE 


A. INTRODUCTION 


Research by Zajonc (10) has shown that a high positive relationship exists 
between frequency of exposure to, or mere familiarity with, a stimulus and 
subjects’ ratings of liking for the stimulus. This relationship seems to form 
the basis for the large portion of variance in interpersonal attraction accounted 
for by propinquity. Harrison (1) ascribes the relationship between familiarity 
and liking to the negative affect consequent upon the increased tension pro- 
duced by the response competition which is elicited by novel stimulation. If 
Harrison’s hypothesis is correct, the relationship between familiarity and liking 
should vary as a function of the average basal level of arousal of the subject 
population under study. If we accept an equilibration hypothesis, we should 
expect individuals with subnormal levels of arousal to tolerate and even seek 
out larger increases in tension than those with supranormal levels of arousal, 
The tendency for familiarity and liking to covary should therefore be greater 
for the latter group, since novelty with its concomitant increases in level of 
| tension should be more aversive to this group. Martindale (4) suggests that 
“creative subjects are characterized by lower basal levels of arousal than are 
_ uncreative subjects. If this is true, the relationship between familiarity and 
liking should be less in creative than in uncreative subjects. 

The above line of reasoning would not be expected to hold with subjects 
exhibiting more extreme deviations in basal level of arousal. According to the 
law of initial values (9), reactivity is an inverse function of the basal level 
of arousal, Thus, subjects with a very low level of arousal should show large 
increases in tension with a degree of response competition that would produce 
4 small increase in tension in subjects with very high levels of arousal who are 
already near the upper limit’ of arousal. Novelty should, then, cause great dis- 
comfort for the former type of subjects, but be neither particularly pleasurable 
or unpleasurable for the latter, Venables (8) and others have argued that 
acute schizophrenics are characterized by low levels of arousal, while chronic 
ee 
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schizophrenics are characterized by high levels of arousal. WeRhould expect, 
then, that the relationship between familiarity and liking should be greater in 
acute schizophrenics than in chronic-sghizophrenics. The relationship should 
be greater than in a normal population in the case of acute schizophrenics, and 
less for chronic schizophrenics. 

It might be argued that the relationship between familiarity and liking is 
the result of simple conditioning. People are probably characterized by a posi- 
tive hedonic tone more often than not. Stimuli more frequently encountered 
have more chances to be paired with positive affect and are thus better liked ' 
than novel stimuli. Since creative individuals admit to more anxiety (3) and 
since chronic schizophrenics probably have experienced a good deal more 
negative affect than most people, the expected pattern of findings might be 
more simply explained as due to more frequent association of familiar stimuli, 
with negative affect in these groups. If this argument is correct, then psychi- 
atric patients suffering from depression would be expected to exhibit a very 
low or perhaps even a negative relationship between familiarity and liking. 


B. Metuop 


Creative and uncreative subjects were selected from students in a large 
introductory class at the University of Maine on the basis of scores on the 
Remote Associates Test (5). High creative subjects were defined as those in 
the top quartile of the distribution, and low creatives as those in the bottom 
quartile, There were 33 high creative subjects with an average Remote Ass0- 
ciates Test score of 23.1. There were 30 low creative subjects with an avera 
Remote Associates Test score of 10.8. 

Psychiatric subjects were successive admissions to a ward at the Veterans 
Administration Hospital, Palo Alto, California. Assignment to schizophrenic 
and depressive groups was on the basis of psychiatric diagnosis, Acute schiz0o- 
phrenics were defined as schizophrenics for whom this was their first admis- 
sion, chronics as those with one or more previous admissions. Fourteen 
chronic schizophrenics with an average age of 44, nine acute schizophrenics 
with an average age of 23, and 10 depressive (neurotic and psychotic) patients 
with an average age of 44 were tested. 

The relationship between familiarity and liking in each group of subjects 
was assessed by having subjects make ratings of liking on a seven-point scale — 
of 10 fruits, flowers, trees, cities, and countries, Subjects were asked how 
well they would like to live in the cities and countries, The 10 items in each 
category were the same as those used by Zajonc (10), who selected items 
such that the frequency of appearance in the English language—based on the 
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Thorndike and Lorge word counts (7)—could be arranged in increasing con- 
stant log units. The rationale behind the procedure is that more frequent 
words should be more familiar. For each group, Spearman rank correlations 
were computed between average liking rating and frequency of usage. 


C. ReEsuLts 


Table 1 gives the correlations between frequency and liking for each of 
the groups. The correlations obtained by Zajonc (10), using a large number 
of normal raters, are included for comparison. 


TABLE 1 


SPEARMAN CORRELATIONS BETWEEN FREQUENCY AND LIKING 
Category 
Sample Trees Flowers Countries Fruits Cities Xe 
High creative 84 82 17 69 —A2 64 
Low creative 91 95 84 79 —.38 78 
Normal (10) 34 389 389 81 85 86 
Chronic schizophrenic 92 79 69 69 —.03 70 
Acute schizophrenic 4 92 88 55 09 80 
Depressive 87 93 88 AS —25 73 


The hypotheses suggested above were tested by converting the correlations 
to z scores and testing the significance of the difference between average 2 
stbres (2, p. 140). For the hypothesis that the relationship between familiarity 
ad liking is higher in uncreative than in creative subjects, z = 1.22 D~< 
:12). However, if the aberrant correlations involving cities are excluded, 2 = 
"1.32 (p < .10). All of the correlations for creative subjects are lower than 
‘those for uncreative subjects. If we use the sign test (6, pp. 68-75) to assess 
the probability of this under the null hypothesis of no difference, x = 0 (A = 
03). For the schizophrenic groups, the relationship between frequency and 
liking is significantly lower in chronic schizophrenics than in normals: 2 = 
1.78, p < .05. However, the difference is no longer significant if the correla- 


tions between cities are left out. Correlations for the acute schizophrenics are 
not significantly different from those for chronic schizophrenics or normals. 
Correlations for the depressive patients are not significantly different from 
those of normals, suggesting that the relationship between frequency and liking 


is not simply a matter of conditioning. 


D. Discussion 


For all of the groups tested, the correlations between f 
of cities are markedly lower than those obtained by Zajonc. 


‘requency and liking 
In part, this may 
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be due to the fact that Zajonc’s subjects were midwesterners ; only four of 
the 10 cities rated were coastal ones. However, an avoidant reaction to large 
cities would be understandable on the basis of the pairing of such cities with 
all manner of aversive stimuli in the news media in the several years since 
Zajonc’s studies. Such conditioning is not readily possible with neutral stimuli, 
such as trees and flowers. Thus the discrepant correlations point to an interest- 
ing pattern in the other correlations: the correlations between familiarity 
and liking are higher for all of the groups for neutral stimuli. With ego- 
involving instructions (asking subjects to rate how much they would like 
living in a country or a city) and with fruits (where subjects have consumed 
at least thesmore common ones) the correlations are lower for all groups than 
those for flowers and trees. Table 2 gives average correlations for flowers and 
trees vs. fruits and countries. In all cases the former are larger than the latter. 


: TABLE 2 
Mean SPEARMAN CORRELATIONS BETWEEN FREQUENCY AND LIKING FOR NEUTRAL 
AND PERSONALLY RELEVANT CATEGORIES 
Categor: 
Trees and pas 
flowers Countries and fruits 
_ Sample Xz Xz Difference 
High creative 3 
Low Creative 33 es oF 
Normal (10) 87 136 01” 
Chronic schizophrenic 83 69 185 
Acute schizophrenic 93 76 A174 
Depressive 91 74 17 


With use of the sign test (6), x = 0 (p < .05, two-tailed test). Further, 
the degree of discrepancy is markedly larger in the clinical groups than in the 
normal ones (normals, creative, uncreative) : with use of the Mann-Whitney 
U test (6), U = 0 (p = .10, my = ne = 3, two-tailed test). Thus, personal 
involvement attenuates the relationship between familiarity and liking, and 
more so with clinical than normal samples, Given that the clinical groups at¢ 
more characterized by a negative hedonic tone than the normals, the patter? 
of results does suggest the importance of conditioning for the relationship. 
The discrepant correlations involving cities cloud inferences concerning the 
major hypotheses, but there does seem to be a fairly clear trend for chronic 
schizophrenics and creative subjects to exhibit attenuated relationships betwee? 
familiarity and liking, the latter because they value novelty and the former 
because of their lessened reactivity to both novel and familiar stimuli. It is 
interesting to speculate whether some of the deviant interpersonal relations of 
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“these two groups—the withdrawal of the schizophrenics and the independence 
"of judgment and lessened conformity of the creatives—are due to the fact 
that propinquity and resultant familiarity do not lead to interpersonal attrac- 
tion to the extent that they do in other groups. 
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EFFECTS OF A SELF-IMPROVEMENT COURSE ON 
SELF-CONCEPTS OF ADOLESCENT FEMALE CLIENTS* 


Sunbeam Home and Family Service, Oklahoma City 


Cor.iss CoLLINS 


A. INTRODUCTION 


Although many data appear in the literature concerning the self-concepts 
of delinquents, mental patients, the aged, addicts, criminals, handicapped 
people, those with high intelligence and academic accomplishment, and people 
in various social classes, relatively little documented effort has been directed 
toward attempts to induce changes in these self-concepts. Some of the methods 
that have been used, in addition to counseling and psychotherapy, include 
occupational and physical therapy, music and art therapy, business speaking 
courses, and self-improvement courses. 

With respect to the latter and on the basis of a demonstrated relationship 

between a person’s self-concept and the way he views his body (1, 6), attempts 
have been made to alter self-concepts by presenting educational material cover- 
ing personal grooming, posture and poise, dating etiquette, etc. For example, 
Biles (2) obtained significant changes in the sorting of self-referent statements 
(a Q-Sort design by Page and Pattinato) following each of two different 
types of exposure of college freshmen women to a basic physical education 
course (physical self-image, relaxation, strength, cardiovascular endurance, 
Posture, balance, rhythm, and other topics) ; a control group showed no signifi- 
cant change. 
: In another study, James, Osborn, and Oetting (5) exposed a group of “act- 
ing-out adolescent girls” to a series of 10 self-improvement. classes (hair 
grooming, make-up, posture, etc.). Pre- and posttesting involved the Draw-A- 
Person test, a check list of personal characteristics, and a special sentence- 
Completion test. Although the only significant change was an increase in the - 
Size of female figures on the Draw-A-Person test, the authors concluded. that 
all of the tests showed changes suggesting greater feminine identification, less 
hostility to authority, and greater openness to criticism. 

Finally, Cole, Oetting, and Miskimens (3) compared pre- and posttreat- 


Ment measures of self-concept, ideal self-concept, and anxiety in a study of 14 
————_—_. 
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adolescent females with histories of delinquent and acting-out behaviors. Five 
subjects were in group therapy led by professional mental health workers; nine 
subjects were in a similar group run by community volunteers. A third (con- 
trol) group contained eight subjects of comparable age and academic perfor- | 
mance, but with no record of acting-out behavior. The therapy groups were 
exposed to a 10-week socialization program (make-up, hair styling, exercises 
and diet, clothes and accessories, etc.) designed to produce positive changes in 
self-concept. All subjects were given tests that provided measures of the con- 
cepts “Me as I am today” and “My ideal-self.” Although there were no 
significant differences among the pre- and posttest means for the three groups, 
the variances of the two therapy groups were significantly greater than that of 
the control group during the pretesting; during posttests, however, the vari- 
ances among the groups were not significantly different because of a reduction 
in extreme scores made by subjects in the two therapy groups. The reduction 
was accounted for “primarily by changes in the level of value placed on the 
self-concept rather than by ideal self changes.” There were no significant 
differences among the groups in pre- and posttesting of measured anxiety (The 
Anxiety Differential). 

‘The lack of significant group differences in mean scores in the study by Cole 
et al. (3), and the single significant change obtained by James et al. (5) 
regarding the size of female figures on the Draw-A-Person test, may be 
attributed, in some part at least, to the types of subjects studied and the types 
of scales used by those authors to measure areas of change in self-concept after 
a self-improvement program. The present study is specifically interested in 
the premise that the self-concepts of female adolescents with varied combina- 
tions of socioeconomic and emotional problems can be altered after the girls 
have gained knowledge of and skill in personal health, hygiene, grooming, and 
social behavior. It reports the results of a self-improvement program supporting 
casework therapy and aimed at producing positive self-concept changes as 
measured by the ‘Tennessee Self Concept Scale (4). 


B. Mertuop 

1. Subjects 
The subjects were 30 female adolescents between 13 and 18 years of age 
all of whom had some combination of financial and behavioral, academic, 


truancy, or emotional problems. They were divided into three groups, approx!- 
mately equal in average age, as follows, 


(a) Self-Improvement (S-I) Group. Twelve subjects were enrolled in 4 
self-improvement course. The group comprised teenagers whose families, 0 


uthor) at the Sunbeam Home and Family Service, Oklahoma City, 
ihoma. The families had been referred to the Family Service Agency 


e girls were being seen individually in counseling; four were siblings 
five; and three girls were children of an emotionally disturbed 
client. These girls represented all of the available adolescent female 
s of the Agency, during a particular block of time, for whom it was felt 
self-improvement course would be of benefit. 


tred to the same Family Service Agency, were involved only in case- 
Tapy; four were in group therapy, and four in individual therapy. 
girls represented all the available adolescent female clients of the 
y, during a particular block of time, for whom it was deemed that a 
rovement course would be of benefit, but none received such a course. 
Control (C) Group. Ten subjects, similar to the S-I and CW-TH 
in age, socioeconomic level, academic performance, and truancy 
were selected as a control group from the public schools. These teen- 
vere identified as girls with school-behavior problems who could benefit 
tom casework therapy and a self-improvement course; they represented all 

ch females who were available for pre- and posttesting in three approved 


2. Test Instrument 


if general and five specific categories. The former comprised Identity 


} Behavior (“This is what I do.”); and Self-Criticism (a measure of 
iveness). The five specific areas included Physical Self (view of body, 
health, physical appearance, skills, and sexuality) ; Moral-Ethical 
1 worth, relationship to God, feelings of being a “good” or “bad” 
) and satisfaction with religion or the lack of it); Personal Self (sense 
worth, feeling of adequacy as a person, and evaluation of person- 
rt from body or relationship to others); Family Self (feelings of 
> worth, and value as a family member); and Social Self (self as 
in general relation to others). A total “value” score, derived for each 
Was readily compared to percentile norms established for the test. 
the test instructions were read aloud to the subjects with the excep- 
paragraph asking for the reporting of “time started” and “time 
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in whole, were involved in casework therapy with a social worker 


is schools, the welfare agency, and other social organizations. Five — 


le areas of the Tennessee Self Concept Scale (4) provided measures in 


is what I am.”) ; Self-Satisfaction (“This is how I feel about my-— ; 


) Casework-Therapy (CW-TH) Group. Eight subjects, all of whom 
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finished.” The girls were allowed as much time as needed for test completion 
and were additionally encouraged to give honest answers to the questions. 
Subjects were told that the purpose of the testing was to help them to express 
how they felt about themselves and to give them an opportunity to think in [ 


terms of several “self” areas; the purpose of the posttest was to provide a 
second opportunity for self-evaluation. 
3. Procedure 
A five-week interval separated pre- and posttesting sessions. For the C 


Group, testing was conducted in two junior high schools and one senior high 
school. Since the project was officially approved by the Research Committee, 
Oklahoma City Public Schools, the subjects were released from classes specifi- 
cally for testing. There was no other contact with these students between 
testing. 

The members of the CW-TH Group were tested at the Family Service 
Agency during their regularly scheduled counseling sessions. These girls were 
seen weekly in therapy between the pre- and posttesting period. 

The S-I Group went through a five-week self-improvement program (one 
three-hour session per week). The pretest was given during the first meeting; 
the posttest, during the last meeting. The self-improvement program was 
designed to cover several areas including personal hygiene, make-up, hair care, 
clothing, visual poise, manners, and proper conduct with boys; there were 
also group discussions initiated by the girls. A professional model served as 
the volunteer instructor; the social worker observed and assisted in group 
discussions. The owner of a beauty college donated his services on propet 
hair care and style. Group members who were in individual therapy also 
continued in those sessions on a one-hour-per-week basis. 


C. Resuxts 


Means and standard deviations for scores in each of the nine categories were 
computed for the first and second administrations of the Scale. These values 
are in Table 1. Mean scores (pre- and Posttests) for all categories except 
Self-Criticism fell below the 50th percentile norms for the Scale, which sub- 
stantiates that these females had poor self-concepts. Mean Self-Criticism 
scores (pre-and posttests) were above the 50th percentile norms (betwee? 
50-72), indicating that all three groups of adolescents were as open to self- 
criticism as the upper half of the standardization population. 

Differences between mean scores were evaluated statistically; ¢ tests for 


correlated data were calculated by categories for each pair of administrations, 
as follows. 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS FOR SCORES MADE BY THE THREE Groups 
in Each Test CATEGORY OF THE TENNESSEE SELF CONCEPT SCALE 
FoR EACH OF THE TWO ADMINISTRATIONS OF THE TEST 


Test administration 


S-I Greup CW-TH Group Controls 
(N = 12) (N = 8) (N = 10) 
Test category 1st 2nd 1st 2nd 1st 2nd 
a M 522 53.8 554 50.5 519 50.9 
Bey Criticism SD 74 «98 82 12.0 94 | (9.2 
: M 32.5 40.28" 27.5 324 328 31.1 
aeatity SD 13.6 16.0 104 169 124 | 85 
Self-Satisfaction me 458 at $28 ee BS a 
- M 334 36.2 298 344 33.9 31d 
pepavior SD 112° 7128 77 ‘14.6 12.2 8.3 
Physi M 343 42.° 32.6 37.4 36.2 359 
once Sele SD 1.0. 114 72 147 171 (126 
ae M 37.5 43.0% 34.5 37.9 317 29.5 
re euical SD 80 119 9.0 169 98 7.0 
P M 43.0 45.5 37.8 40.8 38.7 41.4 
ached, Sele SD 152149 wi 177 126 9.0 
Fami M 36.8 37.6 25.3 30.6 32.2 33.7 
uy Sele SD 99 13.6 10.6 14.0 112 «114 
Social Self M 39.3 45.8 34.3 41.6% 38.7 39.2 
SD 112 «13.4 105 158 77 «6A 
* p< 05. 
Db < 01. 


(a) Self-Improvement Group. Each response category showed pre- to 
mbsttest increases in scores for the $-I Group. Four of the increases were 
statistically significant—viz., Identity (¢ = 3.75), Self-Satisfaction (¢ = 
2.23), Physical Self (t — 3,00), and Moral and Ethical Self (¢ = 2.24). 
| () Casework-Therapy Group. While the scores for the CW-TH Group 
pore es all categories except Self-Criticism, only one change—in Social Self— 
Was significant (t = 2.43). The Self-Criticism score decreased insignificantly. 

(c) Control Group. The Self-Satisfaction scores of the C Group showed 
4 significant increase (f = 4.22) from the pre- to posttest period. Of scores 
in the other categories, there were five that decreased slightly and three that 
Ag slightly—a random variation that might be expected for a group where 

© changes were introduced between testing dates. 


D. Discussion 


‘i Despite the relatively small number of subjects tested, these results indicate 
®nificant improvement in self-concepts for girls participating in the self- 
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improvement course. For the C Group, only the Self-Satisfaction score showed 
a significant change, Similarly, only one score—that for Social Self—increased 
significantly for the CW-TH Group (although, with an N of 8, this grouping | 
may not have been adequately evaluated). With respect to the latter, it had 
been anticipated that, since all of these teenagers had been involved in therapy 
for several weeks prior to pre-testing, no especially marked changes were 
likely to occur over the five-week pre- to posttest interval. However, the 
significant improvement in the Social Self category can be attributed to dis 
proportionate increases in Social Self scores by the four girls who were in- 
volved in group therapy (the other four were in individual counseling), 
Apparently, the group experiences were effective in increasing the area of 
social confidence measured by the Scale. 

It was anticipated that scores in the following categories might show signi- 
ficant positive change in the S-I Group after the self-improvement course: 
Identity, Self-Satisfaction, Physical Self, and Social Self. The first three of 
these, as well as Moral-Ethical Self, showed the improvement, but Social 
Self did not. The fact that the positive change obtained for scores in the Social 
Self category was not significant may be attributed primarily to one girl whose 
score decreased markedly (18 points) between pre- to posttesting. This sub- 
ject had been involved in overt conflict within the group two weeks prior to 
taking the posttest, and the conflict may have made her feel less acceptable 
socially. When this subject’s test score is excluded, the pre-to-posttest increase 
aa is statistically significant in the Social Self category (t = 2.88; » < 
.05). 

The significant increase in scores of the $-I Group in the Moral-Ethical 
Self area had not been anticipated. Two explanations may be considered. 

First, one of the goals of the program was to make each girl feel “important” 
and “accepted.” These attempts may have helped the teenagers to regard 
themselves more positively. And, secondly, during one session, the adolescents 
initiated an informal discussion around moral and ethical behavior. At the 
time, stress was placed on their exerting energy to “begin anew” and “look 
ahead” instead of dwelling on past mistakes, 

In general, these results support the notion that a self-improvement course 
used as an adjunct to casework therapy may be beneficial in improving the 
self-concepts of adolescent females with varied family, social, or emotional 
problems. Indeed, many of the girls commented spontaneously that they felt 
the course had benefitted them and had increased their feelings of social ade- 
quacy. It is uncertain, of course, as to how long the self-concept improvement 
might last, since the posttesting was done immediately after the course. How- | 


| 
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ever, the present study shows that such improvement can be made, It should 
be noted that active participation, follow-through, and apparent growth oc- © 
tred in a program that involved a relatively homogeneous socioeconomic 
_ grouping of girls. Moreover, our experience suggests that therapy following 
- the self-improvement course is probably helpful in solidifying and expanding 
the self-concept gains made by adolescent clients, 


E. Summary 


The self-concepts of 30 female adolescents with behavioral problems were 
"Measured by nine categories of the Tennessee Self Concept Scale at the 
_ beginning and at the end of a five-week period. Twelve subjects, whose 
" families, in part or in whole, were involved in casework therapy, participated 
‘in a self-improvement course designed to produce positive self-concept changes. — 
“Eight other subjects were involved in casework therapy only and were not 
enrolled in the self-improvement program. Ten subjects, comparable in age, 
S0cioeconomic level, academic performance, and truancy history, were selected 
4 a control group from the public schools. Results indicate that self-concepts 
“of female adolescents with certain combinations of social/emotional problems 
Were improved in therapy situations by exposure to, and participation im, self- 
‘Improvement courses, The addition of this technique to the more usual forms — 
of therapy appears to hold promise for improving the adjustment of at least — 

_ Some types of adolescent girls. ; 
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THE SERIAL ORDERING OF ITEMS ON THE 
PEABODY PICTURE VOCABULARY TEST* 


University of South Florida; and Exceptional Child Education, 
Pinellas County Public Schools, Pinellas Park, Florida 


Sruart I. RITTERMAN AND ELIzABETH E. TaLiey 
A. INTRODUCTION 


The serial ordering of vocabulary items on the various intelligence test 
instruments has traditionally been determined solely on the basis of empirical 
observations of population performance. That is, only those permutations or 
series of items for which a steep linear growth curve between age groups is 
demonstrated are selected. Thus, the rank order of any given item is established 
as a function of percent passing by age. 

Terman and Merrill (15, p. 14) concur that locating items on a scale is 
purely an empirical procedure and note that “. . . curves of percent passing by 
age affect the variability of [Qs.” That is to say, the distribution of hard and 
easy items at any one age level will have a corresponding effect on the percent 
passed, The variability of derived scores at a given age level can be compared 
to others only in terms of the distribution of the percent passing, which in 
turn is a function of the relative concentration of items of varying difficulty. 
McNemar’s (13) analysis of the 1937 Stanford-Binet Scale indicates the 
degree to which item difficulty may influence test variability. 

In JQ tests such as the Stanford-Binet (15) the term item may be equated 
with subtest. In a single measure test, such as the Peabody Picture Vocabulary 
Test (6), item is synonymous with stimulus, and linear growth curves are 
obtained between stimuli rather than between subtests. Item validity on the 
Peabody Picture Vocabulary ‘Test (PPVT) was determined by selecting 
Words where 6% of the subjects passed increased from one age group to the 
next, 

Pethaps the two most salient characteristics of a verbal stimulus in their 
effect on a-subject’s likelihood to attach it to a particular referent are mean- 
ingfulness and familiarity. Noble (14, p. 89) theorized that “.. . the familiarity 
of a stimulus (or response) may be regarded as some function of its frequency 
of occurrence in the organism’s history.” Noble found a curvilinear correlation 
ae between meaningfulness and familiarity. Coinciding with this is Ls 
et eulte (17) hypothesis indicating that the priority of recal 
vag enuli is some function of the past frequency of experience. 


* 5 a J ‘. 
a pReteived in the Editorial Ofice on October 21, 1971, and published immediately 
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These postulates do not differ markedly from Hull’s (10) reaction potential 
(sER) or Estes’ (9) response probability (P,) formulations. Few researchers 
will quarrel with the concept that m facilitates performance. 

Extending the above postulates of verbal learning to verbal testing, it may 
be logically assumed that relative ease or difficulty of any given test item is 
some function of its commonality or rarity in the linguistic environment, 

In the construction of a 150-item test, such as the Peabody Picture Vocabu- 
lary Test, it is obviously impossible to test all possible permutations or orders 
of stimulus presentation in the generation of relative growth curves. Since, 
however, the identification of verbal stimuli appears to be some function of 
relative familiarity, word frequency of occurrence in the language might pro- 
vide an objective scaling reference for the construction of permutations to be 
tested. 

By smoothing out the progressions of easy to difficult items through the age 
levels tested, better distribution of percent passing, with a corresponding 
reduction in variability, should be achieved. 

While the employment of word frequency tables is somewhat restricted, 
inasmuch as they are traditionally derived from lexical corpora, Ausherman 
and Black (2) found some degrees of correspondence between the frequency 
of occurrence of words on the Thorndike and Lorge (16) lists and the fre- 
quency of occurrence of words iri the formal spoken vocabularies of college 
students, 

Word frequency counts may be divided into two broad categories: those 
constructed from adult lexicons (e.g., 11) and those whose construction in- 
cluded juvenile sources (e.g., 16). Utilizing these counts, the researcher may 
observe the correspondence of any given sequence of vocabulary items with an 
objective estimate of familiarity. As an example, the relative frequencies of 
occurrence of all 150 items of Form A of the PPVT (6), based on both a — 
juvenile and an adult word frequency count, are illustrated in Table 1. 

Inspection of the table reveals that the greater variability in word frequency ; 
of occurrence lies within the first 50 items of the test. When the serial order- : 
ings of these first 50 items of the PPVT were compared with a serial order- 
ing of the same 50 words arranged according to rank on an adult word fre 
quency count (11), a correlation of .44 was obtained. When compared to al 
ordering based on a juvenile count (16), a correlation of .40 was obtained; 
a correlation of .81 was obtained between the A- and J- lists. In all instances, 

p<.05. 

If word frequency of occurrence as represented by rank on a word frequency — 
list accurately describes some function of familiarity, gains in performance 
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TABLE 1 
RELATIVE FREQUENCY OF OCCURRENCE OF ITEMS ON THE PPVT Utiuizinc AN ADULT 
Worn FREQUENCY COUNT AND A JUVENILE WorD FREQUENCY Count ‘* 


Occurrence of items 


Column A® Column B Column C Total 
Items 1-50 Items 51-100 Items 101-150 Items 1-150 
Word frequency N % N % N % NN %. 

Juvenile word frequency count based on Thorndike and Lorge (16) 

Over 100 per million 10 20.83 2 4.00 0 - 12 8.10 
50-99 per million 12 25.00 oo — 0 _ 12 8.10 
20-49 per million 10 20.83 1 2.00 0 _ 117.43 
10-19 per million 1 2.08 1 2.00 04=— 2 (1638 
6-9 per million .o— .o- _ 0 f= 
Below 6 per million 15° 31.25 46 92.00 50 100.00 111 75,00 
Adult word frequency count based on Kucera and Francis (11) i 
Over 100 per million 4 8.33 1 2.00 0 - 5 3.37 
50-99 per million 5 10.41 1 2,00 0 _ 6 4.05 
20-49 per million 10 20.83 8 16.00 0 _ 18 12.16 
10-19 per million 11 22.91 2 4.00 1 2.00 14 9.45 
6-9 per million 4 8.33 4 8.00 2 4.00 10 6.75 
Below 6 per million® 14 29,16 34 68.00 47 94.00 95 64.18 


4 Two items in Column A were not counted because of lexical ambiguity. 

> OF the last category (Below 6 per million) 107 items had a frequency of oc~ 
ee pee than once per million: 12 in Column A, 45 in Column B, and 50 in 

olumn ~ t 

© Of the last category (Below 6 per million) 47 items had a frequency of less than 
once per million: 4 in Column A, 13 in Column B, and 30 in Column C. 


should accrue to orders of progressively descending frequencies of occurrence 
over more randomly distributed permutations. ‘ 

This study investigated the performance of a preschool population on two 
serial orderings by words frequency of occurrence of the first 50 items on 
Form A of the PPVT. One ordering was arranged according to a juvenile 
frequency count (16) and the other to an adult count (11). To the extent 
that frequency counts are an estimate of familiarity, differences in performance 


favoring the juvenile permutation were hypothesized. 


B. MerHops 


1. Subjects 
Subjects were 26 white children (11 males and 15 females) attending day 
m 27 to 50 months with 


nurseries in the Tampa, Florida, area. Ages ranged fro ‘ } 
a mean chronological age of 40.73 months (SD = 5.42). Subjects placed 
from lower middle to upper middle class on the Index of Status Characteristics 
(18), were from monolingual homes, and had successfully passed an audio- 


Metric screening examination. ; 
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2. Test Materials 


Two permutations of the first 50 items of Form A of the PPVT were 
constructed. Serial ordering of the J-list was based upon a juvenile word fre- 
quency count (16), while the ordering of A-list was based upon an adult (11) 
count. The ordering of words on both lists was arranged on a progressively 
descending frequency scale so that item 1 had the highest and item 50 the 
lowest frequency of occurrence. 

Stimuli were identical to those in the original PPVT. The entire page of 
the PPVT on which a stimulus appeared was removed to be placed in its 
new serial order rank, Each stimulus used, therefore, appeared in its original - 
context with respect to the adjacent S. 


3. Procedures 


The first 50 items for Form A of the PPVT and each of the two permuta- 
tions were administered individually to each subject during a single session at 
the day nursery. The pretest instructions were those from the Examiner's 
Manual of the PPVT (6). Order of presentation of the three lists was deter- 
mined according to a table of random numbers. 

All subjects began each list at item 1 and continued until a ceiling of six 
errors in any eight consecutive permutations was reached. Subjects who had 
not reached their ceiling within the first 50 items of any of the lists were 
administered items 51 and above of the PPVT, in their regular serial order, 
until a ceiling was obtained. Any subject making five or fewer errors on the | 
first 50 word list presented to them was eliminated from the study. 

Performance on all lists was evaluated according to the scoring procedures 
and normative data in the Examiner’s Manual of the PPVT (6). 


C. Resuxts 


Mean scores by list for each of five scoring criteria are presented in Table 
2. The results of a three-part analysis of variance (7) indicating significant | 
interlist mean differences is presented in Table 3. Significant differences (? < 
.05) were obtained in all scoring categories with the exception of Errors 
These intralist differences simply indicate that, as would be expected within 
normal populations, subjects show significant differences in mean performance 

Tests of the significance of the differences in means for individual pairs 
treatments by use of an a posteriori procedure described by Lindquist (12) 
are presented in Table 4. 

Significant differences (¢ < .05) between the PPVT and J-list were noted 
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TABLE 2 
MEAN Scores BY LisT FOR EACH OF THE Five ScoriNG CRITERIA 
Scoring 
1 criteria 
+ Ceiling ¥=43.42 
SD = 13.03 
Errors X= 12.80 X= 17.03 
SD= 3.84 SD= 4.56 
Raw Score X= 23.92 X= 26.38 
SD= 7.90 SD= 911 
Mental Age ¥ = 33.00 = 35.00 
SD= 5.60 SD= 8.36 7.08 
19 X = 84.23 X = 81.34 = 88.00 
SD = 14.44 SD = 20,04 SD = 15.61 
TABLE 3 
Turee-PartT ANALYSIS OF VARIANCE OF INTERLIST MEAN DIFFERENCES 
FOR ALL ScoriNG CRITERIA 
Source of 
variation Sum of squares df Mean square F 
Raw Score 
Between columns 259.68 2 129.84 3.56 
Between rows 5,251.50 25 210.06 5.76 
Residual 1,822.36 50 36.44 
Total 7,333.54 17 
Errors 
Between columns 170.41 2 85.20 4.34 
Between rows 879.60 2s 35.18 1.79° 
Residual 979.99 50 19.59 
Total 2,030.00 7 
Ceiling 
i Between columns $70.33 2 435.16 443 
Between rows 9,559.73 25 382.38 3.89 
Residual 4,910.74 50 98.21 
Total 15,340.80 7 
Mental Age 
Between columns 95.10 2 47.55 2.24 
| tween rows 2,882.72 25 115.30 5.44 
Residual 1,059.63 50 21.19 
eee 4,037.45 7 
B 10 
etween columns 578.97 2 298.47 3.59 
tween rows 17,830.73 25 713.22 8.85 
Residual 4,026.78 50 80.53 


* Insignificant (p = .05). 
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TABLE 4 
¢ TESTS OF THE SIGNIFICANCE OF THE DIFFERENCE IN MEANS 
FOR INDIVIDUAL Pairs OF TREATMENTS 


List Ceiling Errors Raw Score 1Q 
PPVT vs, t= .042 t= 1.0347 t= 1.9646 t= 11612 
A-list (p> .05) (p> .05) (p > .05) (b> .05) 
PPVT ys, t= 2.8604 #= 3.5199 #= 1.4693 #= 1.304 
J-list (@ S.01) (@<.01) (p> .05) (6 >.05) 
A-list vs, t= 2.8161 t= 2411 t= 2.7858 t= 2.6759 
J-list (6S .01) (2 <.05) (@<.01) (p<.01) 
SOLES oh RN OR sl a lel 


both with respect to Ceiling and Error means, but not for Raw Score. Inspec- 
tion of the means will reveal that while significantly higher Ceilings were 
obtained on the J-list, so too were significantly higher Error scores, These 
offset one another and consequently produced nonsignificant mean Raw Score 
differences. With the exception of the foregoing, no other significant dif 
ferences between the PPVT and cither the J-list or the A-list were obtained. 
Significant differences, however, in all scoring categories were obtained be- 
tween the A-list and the J-list. 
_ ‘An analysis of rank score performance on each of the lists yielded signifi- 
cant correlations (p < .05) of .62 between the PPVT and J-list, .60 between 
the PPVT and the A-list, and .73 between the A-list and the J-list. 
Presentation order did not appear to be a significant factor, Of the 26 
subjects, six obtained their highest performance on the first list presented, | 
10 on the second list presented, and 10 on the third list presented, 


D. Discussion 


The hypothesis of significant gains accruing to those permutations of 
stimuli most closely approximating the word frequency count appropriate t0 | 
the experimental population was confirmed, Significant mean Raw Score dif- 
ferences (p < .05) of 4.46 items, yielding significant mean JQ differences 
of 6.66 points (p < .05), were obtained between the A-list and the J-list 
favoring the J-list. 

The observation of no significant (¢ = .05) interlist differences in derived 
Mas would seem to indicate that it is the least sensitive index of performance 
differences. 

A tentative explanation of this finding may lie in the derivation of the M4 
and JQ from Raw Scores, and the correlation between these indices. 

The present authors correlated Raw Scores from 5 to 57 of Form A of the 
PPVT with the corresponding Ms and IQs. As a reference, the GA of 3 ‘ 
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_ years 4 months was used in derivation of Qs, as this CA corresponded to the ~ 


mean CA of the subjects in the present study. : 

A significant correlation of .9715 (p = .001) was obtained between Raw 
Score and MA, while a correlation of .9997 (p = .001) was obtained be- 
tween Raw Score and JQ. 2 

By means of a 2’ transformation of r (8) the significance of the difference 
between the correlations for Raw Score and Md, and Raw Score and [0 
was examined. A significant difference (# = .001) was obtained. " 

While the correlational differences between Raw Score and JQ (.02) ap 
pear statistically rather small, clinically these may be of greater significance. 
When part list correlations are examined, the variability of Md becomes 
evident. The MA differences for the 12 Raw Scores ranging from 14 through 
25 on Form A, for example, are 6 months (2-3 through 2-9), while the 1Q 
differences (using a CA of 3-4 years as a reference), are 17 points (67-84). 
For the 12 Raw Scores ranging from 53 through 64 (again using a CA of 
3-4 years as a reference) the IQ differences are again 17 points (127-144). 
The MA differences between these 12 Raw Scores are 2 years 1 month (5-7 
through 7-8 years). The conclusion the present authors draw from the fore-, 
going is that, for this particular instrument, JQ is the less variable and there- 
fore more valid index of performance differences. f 

While significant interlist differences for number of errors was obtained, 
no such difference was obtained for the intralist errors. This would seem to 
indicate that, for this population, differences separating high and low p eh 
formance on the PPVTT are a function of increments in correct responses with 
No concomitant increase in the number of incorrect responses. 

The role of the distribution of difficult items in its effect on the raw score 
and the measurements derived from it becomes increasingly evident. : 

Item analysis of A- and J-lists revealed that while there was an interlist 
correlation of .8068, the lists differed chiefly in the distribution of what 
Dunn (6). has termed gerunds. In the J-list, six of the 10 gerunds appeared 
within the first 11 items. ag 

The following observations were made concerning the gerunds: a 

(a) While numerically comprising only 20% of the 50-item lists, they 

accounted for 25.66% of the J-list errors and 25.22% of the A-list - 
errors. % 

(6) Item analysis of the words constituting, the eight ceiling criteria for 

each of the 26 subjects revealed that gerunds reptesented 33 33% of 
the total A-list ceiling items and’ 21.79% of the total list: ceiling 
items, 
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The gerund, a verb functioning nominally, consists of two componel 
free base plus the inflectional morph -ing. The gerundive form class, in g 
tion, occupies a given position in English paradigms that sets it apart from 
present progressive. Thus in the paradigm, running is healthy, running, 
the subject of the sentence, is a gerund. With use of any of the carrier p 
indicated in the PPVT administration manual (6)—e.g., point to run 
the stimulus word appears in a present progressive slot in the paradigm. _ 

The ability to discriminate the referent of a present progressive sti 
from the three competing stimuli presented with it therefore is de 
upon (a) comprehension of the base, (4) recognition of the verbal in’ 
and (c) the differentiation of base inflection from those of the com 
stimuli. 

Bellugi (3), in her study of the emergence of English inflections in 
speech of two children, observed that the present progressive appeared eat 
than any other inflection (1934 and 28 months). In addition, it was secc 
only to the plural inflection /-s/ in the speech models presented by the mot 

Berko (4), in examining the morphology of preschool and first grade 
dren, observed that while 97% of first graders correctly responded t 
present progressive inflection, only 72% of the preschoolers were able | 
so. This difference was significant at p = .01. Berko (4, p. 168) conel 
that with the present progressive inflection. “. . . there was no ques! 
choice, there is only one allomorph of the present progressive morpheme 
the child either knows this -ing form, or does not.” Her results suggested # 
they do. 4 

Adams, (1) comparing the verb forms in the speech of four-year-olds. 
those of adults, found that while the -ing inflection was the fourth m( 


With regard to the child’s represented ability to respond differen’ 
pictorial referents illustrating the present progressive from those illu 
other form classes, Carrow’s (5) results indicated that 60% of her 
year-old subjects were able to differentiate the present progressive is 7 
from the past and future tenses, rode and will ride, However, it was not 
age six years that they were able to respond differentially to present p 
sive and present progressive perfect—eg., is eating vs. has eaten. It is 
gested the obtained differences between A- and J-list performance are 
tributable in part to the rank order differences of the present progre 
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items. With respect to performance on the present progressive items, it is 
felt that results reflect difficulties at all three of the above-mentioned levels. 


E. Summary 


Two serial orderings of the first 50 items of Form A of the Peabody 
Picture Vocabulary Test (6), one arranged according to a juvenile word 
frequency count (16) and the other to an adult count (11), were presented 
to 26 preschool children. 

Results indicated that (a) the PPVT is relatively insensitive to changes in 
serial ordering based on word frequency counts, and (4) population ap- 
propriate word frequency counts are reliable estimates of word familiarity. 
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CHILDREN’S RESPONSE SPEED AS A FUNCTION OF 
OMISSION AND DELAY OF A 
CUSTOMARY REWARD* 


Marquette University 


Aness A. SHEIKH AND Donatp C. Cook 


A. INTRODUCTION 


Both absence and delay of reward in a situation where it had been present 
previously are considered frustrating events (1). Studies of children’s response 
speeds have been conducted where a few immediately rewarded trials were 
suddenly followed by nonrewarded trials (2, 7), or where a sudden shift 
from immediate reward to delayed reward occurred (4, 5, 9). The results of 
these studies do not clearly indicate that omission and delay of reward have 
identical effects on children’s performance. Since these studies were conducted 
by different experimenters using different subjects and tasks, one cannot 
conclude with certainty that withholding and delaying of reward ate two 
completely different variables, and their effects cannot be interpreted in the 
same manner. To have a better understanding of these two variables, the 
Present study manipulated the two conditions within the same experiment, 


B. Maernop 


The Ss were 45 third and fourth grade children (16 girls and 29 boys) 
from a public school in Milwaukee, Wisconsin. All Ss were given 30 trials on 
4 lever-pulling, marble-despensing apparatus similar to the one described by 
Ryan and Cantor (6). The Ss were invited to play a “game” and were told 
that if they won enough marbles, they could trade them for a toy of their 
choice. Each § was shown a collection of eight toys and was asked to choose 
the toy he would like to try to win. 

The Ss were divided into three subgroups: (a) the Delayed Reinforce- 
ment Group (DL) received 20 immediately reinforced trials that were fol- 
lowed by 10 trials with delayed (15-second) reward; (5) the Partial Rein- 
forcement Group (PR) received 20 immediately reinforced trials that were 
followed by 10 nonrewarded trials; (c) a Control Group that was immedi- 
——— 
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ately rewarded (Group IM) on all 30 trials. The distribution of sexes and 
third and fourth grade children was approximately equal in all three groups. 
The interval between successive stimuli (signal light to pull the lever) 
was kept constant and was about 20 seconds in duration. On each trial, E 
recorded the starting time (the interval from the onset of the stimulus light 
to the initial movement of the lever) and the movement time (the time taken 
to pull the lever all the way down). ‘ 


C. ReEsutts 


The starting and movement times were converted to speeds (1/t second) 
and combined into blocks of three trials each. The first seven blocks were the 
preshift blocks, and the last three were the postshift blocks. In the case of 
Groups DL and PR, the first delay trial or the first nonreward trial (21st 
trial) was included among the preshift trials. Since the § presumably did not 
know until after he had responded that the reward was delayed or omitted, 
it was, in effect, an immediate reward trial. 

In order to compare the changes in speed over trial blocks of the three 
subgroups, separate Lindquist Type 1 (3) analyses of variance were pet 
formed on the seven preshift blocks and the three postshift blocks of trials, 
The starting and movement speeds were subjected separately to these analyses. 
For between-Ss the main effect was reinforcement condition, and for within-Ss 
the main effect referred to trial blocks. 

In addition, for each group, correlated ¢ tests compared the speeds on the 
last preshift block (7th block) with the first postshift block (8th block). 

No significant effects were obtained on the preshift blocks of trials for both 
starting and movement speed measures. The results on postshift measures for 
the starting and moyement speeds are listed below separately. 


1. Starting Speeds 


On the three postshift blocks of trials, a significant main effect was obtained 
for the reinforcement condition (F = 3.193, df 2/42, p < .05). Subsequent 
individual comparisons indicated that the only significant difference occur! 
between Groups PR and DL (» < .05). Group IM did not differ from 
either of the other two groups, Neither the main effects for trial blocks (F = 
2.589, df 2/84) nor the trial blocks X reinforcement condition interaction 
(F = 2.410, df 4/84) was significant, 

The t tests, comparing the speeds on 7th and 8th trial blocks, indicated that 
Group DL decreased and Group PR increased their speeds significantly from 
the 7th to the 8th trial block (@ < .05). 
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2. Movement Speeds 


The only significant result on the movement speed measures was a main 
effect for the trial blocks (F = 4.900, df 2/84, » < .05). 


D. Discussion 


The results concerning starting speeds are generally in agreement with 
previous findings (5, 8, 9). When the Ss are used to immediate reward, the 
delay of reward leads to a decrement, and nonreward leads to an increment 
in starting speeds. The faster starting speeds following nonreward for Group 
PR are readily explainable in terms of Amsel’s theory (1). He postulated that 
nonattainment of an expected reward leads to the occurrence of frustration, 
which in turn results in increased motivation: hence, faster speeds. 

In accordance with Amsel’s (1) suggestion, one would also expect that 
delayed attainment of an immediately expected reward would also be frus- 
trating and lead to faster speeds. However, the present data do not support 
this notion. It is possible that, in the case of delay, whatever increment in 
drive level is produced is dissipated by the arrival of the delayed reward. 
Consequently, when the § makes the next response, no evidence of this frus- 
tration is present. In the case of Group PR, on the other hand, no reward 
arrives, and this frustration-produced motivation is probably carried over to 
the next trial. 

One other major difference between the two reinforcement conditions has 
been pointed out (8, 9). In a delayed reinforcement situation, whatever 
competing responses are produced during the delay interval are reinforced by 
the delayed reward. These competing responses become conditioned to the 
apparatus cues and are elicited during the subsequent trials and interfere with 
the response speed. Such competing responses in a nonreward situation are 
not followed by any reinforcement; hence, they do not become stronger. 

Apart from a significant main effect for trial blocks, none of the other 
effects was significant on the movement speed measures. However, a closer 
examination of the mean speeds of all subgroups for each postshift block of 
trials indicated a trend similar to that for starting speeds. No explanation for 
this discrepancy in the starting and movement speed results is advanced here. 


E. SUMMARY 


The effects of omission and delay of a customary reward on children’s 
starting and movement speeds in a lever-pulling task were investigated. The 
results indicate that the omission of reward has a facilitating effect on starting 
speed, whereas the delay tends to inhibit the same speed. Movement speed 
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measures did not reveal a significant difference although the results poi 
in the same direction. 


REFERENCES 
1, AmseL, A. The role of frustrative nonreward in a noncontinuous reward situz 
Psychol. Bull., 1958, 55, 102-119. 


2. Horton, R. B. Amplitude of an instrumental response following cessati 
reward, Child Devel., 1962, 32, 107-116. 


3, Linpaquist, E. F. Designs of Experiments in Psychology and Education. 
Houghton Mifflin, 1953. 


4, Rastnowitz, F. M. Children’s lever-pulling behavior as a function of shift 
aeiey of reinforcement interval: A modified replication. Psychol. Rep. | 
693-646, 


5. Rueper, M. Shifts in response reward interval and its effect upon response 
Psychol. Rep., 1961, 9, 393-398, 


6. Ryan, T. J., & Cantor, C. N. Response speed in children as a function of re 
forcement schedule. Child Devel., 1962, 33, 871-878. , 


7. Ryan, T. J., & VoorHorve, A. C. A parametric investigation of reinfoi 
schedule and sex of S as related to acquisition and extinction of an instr 
response. J, Exper, Child Psychol., 1966, 4, 189-197. 


8, Suetkn, A, A. Response speed as a function of different reinforcement cond 
and a ready signal. Child Devel., 1967, 38, 857-867. 
% 


——. Children’s response speed as a function of delay of reward at dif 
distances from the goal and incentive value, Psychon, Sci., 1968, 11, 201. 


Department of Psychology 

Marquette University 

617 North Thirteenth Street 
Milwaukee, Wisconsin 53233 


Published as a separate and in The Journal of Psychology, 1972, 80, 103-109. 


DO SHIFTS IN ATTENTION CHANGE 
PERCEIVED LOCUS OF SELF ?** 


University of Florida 


J. C. Dixon? 


A. INtTRopuUcTION 


Self will be used in two related ways. In the reflexive statements, “I am 
talking to myself” and “I pinched myself,” and in the self-referent statements, 
“I feel my heartbeat” and “I am aware of my feelings,” the term “I” will 
tefer to the observer, experiencer, agent, or judge at the moment of the reflec- 
tive act. Reciprocally, “myself,” “my heartbeat,” and “my feelings” will refer 
to that which is reflectively observed, experienced, acted upon, or judged. For 
convenience, ‘I’? will be called subject-self; and “myself,” “my heartbeat,” 
tte,, will be called object-self or aspects of object-self. 


Like all percepts, the experienced body is projected onto three-dimensional 
space, and the experience of body changes is detected through the maintenance 
of a particular locus of the subject-self at the time of perception, Since sub- 
ject-self is apprehended as more central than what is perceived, including 
Perceived object-self, the locus of subject-self provides information about a 
Person’s subjective point of reference. Sherrington (10) located this central 
subjective focal point in the mid-vertical of the forehead as a sort of cyclopean 
eye. Claparede (1) and Critchley (2), among others, have made similar ob- 
servations. The word, viewpoint, is the most common expression for a personal 
teference point whether used literally or figuratively, and since we ordinarily 
apprehend space visually and binocularly, this locus seems intuitively correct. 

Possibly because sight and hearing provide our dominant cues for spatial 
localization, the major testimony favors a “head” localization. But what 
about the blind-deaf or others who depend more on other senses? Helen 
Keller (8), deprived of both sight and hearing, notes that when she first 
learned finger language, if someone had asked her where her self was, she would 
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most assuredly have located it in her hands. The present investigator (1957?) 
asked her in her later years if she still felt her “self” in her hands. She replied 
that, as she acquired more abstract concepts and was able to organize her world 
more conceptually, she largely lost this spatial frame of reference and her | 
self became more conceptual or spiritual. She also reported that she did not 
use finger language as much in talking to herself as she had before. Her 
secretaries noted, however, that throughout her life there was often a vigorous 
accompaniment of finger language when she was alone and seemed to be 
talking to herself. 

William James (7) reported that his self of selves seemed to be located 
either in the head region or somewhere between his head and throat. Claparede 
(1) said that two young ladies responded, ‘dans mon coeur,” when asked 
where they found their most intimate selves. He also noted that some did not 
understand the question. Horowitz (6) asked a college class, “If you had to 
locate yourself at some point either within or external to your body, where 
would that point or area be?” He found that all gave an internal locus, most 
frequently in the head, but with many localizations in other parts of the body. 
Himelstein (5) made the same inquiry in a similar classroom situation, con- 
firming the individual differences found by Horowitz, and also noted a possible 
sex difference with relatively more females reporting a thoracic localization. 

Although the above reports are not based on an explicit distinction between 
subject-self and object-self, it seems reasonably clear that subject-self is not 
inalterably and in every person experienced in the same place. As a class home- 
work exercise the investigator has demonstrated individual differences many 
times, often to the surprise of students as well as investigator, as follows: 
While alone with eyes closed, to concentrate better on somatic stimulation, 
the student is asked to engage in a series of exercises that involve self-stimula- 
tion of various body points. The rationale behind the exercises is that stimula- 
tion of various body points will provide a three-dimensional fix on locus of 
self. One example will give the gist of the procedure. 

“Press your fingertips against your navel so that you have the experience, 
‘I feel my navel.’ Using ‘I’ in this sentence as your point of reference (sub 
ject-self), locate your navel (object-self) as well as you can, then open yout 
eyes and check one blank in each of the three sets of locations. From the 
vantage point of ‘I,’ does your navel seem to be 


, below —__, other 3 
, behind —— _, other Weis 
, left , other 


1. above 
2. in front 
3. right 


J. C. DIXON 105 


Now describe in your own words where ‘I’ seemed to be when you were feeling 
your navel.” 

In addition to individual differences the reference point seemed to vary for 
some students. One student, Lauren Deldin (3), decided to manipulate body 
position to determine if this might be a change-of-locus variable. The very 
slight change involved in standing up and lying down was selected as the 
independent variable. Besides confirming subject-self changes and a possible 
trend for females to use head localization relatively less frequently than males, 
she also found that 27 of 60 students said they experienced, to some degree, a 
shift of subject-self either up or down in one or more of the five exercises as 
they changed body position. Women did not shift significantly more than men. 

Since the reference point seemed to vary from exercise to exercise for some 
students, the present study is addressed more directly to the question of the 
relation between changes in site of self-stimulation and perceived changes in 
self-localization. Shift in focus of attention is assumed to be induced by changes 
in site of self-stimulation along with the instructions to concentrate on these 
different loci. 


B. MetHop 
1. Subjects 


Forty male and 40 female students, mainly sophomores, juniors, and seniors 
from psychology and related courses at the University of Florida, volunteered 
to participate in the study. Each student was seen individually. 


2. Procedure 


After a general orientation to the purpose and method of investigation and 
the role of the student in the study, the self-localization exercises were pre- 
sented along with several other tasks not relevant to the present question. 

Locus of subject-self was determined indirectly by a series of exercises that 
Tequired the student to concentrate on a particular part of his body by applying 
tactual pressure on the part, locate the part as above or below subject-self, and 
then shift his attention to the point of reference he used to locate that par- 
ticular body area as above or below. During the exercises he remained in an 
Upright seated position. Since previous experience showed no appreciable varia- 
tion in the front-back and right-left dimensions, all body areas were stimulated 
in the median plane. In order to emphasize a somatic rather than visual orienta- 
tion, the student’s eyes remained closed during the exercises, which consisted 
entirely of somatic stimulation. All references to vision, such as “point of 
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view,” were avoided. It is not pretended, of course, that this eliminates th 
visual component. It is only hoped that it may have reduced somewhat o 

strong visual dominance so that individual differences could be better d 

cerned. 

Two orders of presentation, alternated by subject and sex, helped control 
for order effects. Form “A” order was (a) finger pressure on navel, (b) 
tongue against palate, (c) joined hands on lap, (d) joined hands extende 
directly overhead. Form “B” was (c), (d), (a), (6). Two student inter 
viewers, one male and one female, presented the exercises to half of each sex 
and to half of each order of presentation. 

Locus of subject-self was categorized as Head or as Other (trunk or wh 
body). Thus a person could score from none to four head localization: 
Changes in locus from exercise to exercise were scored on a scale from 1 to! 
where 1 indicates no change whatsoever in locus and 4 represents change 
that include both head and other localizations, as well as changes within oi 
or both of these categories. A score of 2 represents changes within one category 
and 3 represents change from one category to another without change withi 
either category. 


C. REsutts 


‘Tabular data are presented for frequency of Head and Other (trunk and 
whole body) localizations of subject-self under the four stimulating conditions 


TABLE 1 
Frequency or Heap anv OTHER LocALIzATIONS OF 
MALES AND FEMALES UNDER Four STIMULATING CONDITIONS: 
(N)avet, (L)ap, (P)ALate, (O) veRHEAD 


Localizations 
Head Other 
Conditions M F M F 
N 21 20 19 
L 20 17 20 23 
P 33 34 7 6 
(o) 27 33 13 7 


Table 1 shows that a lower body (navel, lap) site of stimulation resulted 1 
about equal numbers of Head and other localizations of subject-self, whet 
upper body (palate, hands overhead) stimulation led to a much greater ff 
quency of Head localizations. Table 1, converted into a fourfold table (He 
Other ; upper and lower body sites of stimulation) for test of statistical signi 
cance of this difference, yielded a chi square of 32.76 which is significa 
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beyond the .01 level. Sex differences and order effects were not significant, but 
there was a tendency toward more Head localizations for female students in 
the presence of the male interviewer during the navel exercise (13 compared 
to 8 Head localizations for the female interviewer), 

A count of those who changed locus of subject-self shows that 26 males and 
30 females changed at least once, while 14 males and 10 females maintained 


the same locus during the exercises. It is apparent that twice as many 


changed locus at least once as those who did not, Possibly because one 
of the interviewers made more intensive inquiries about locus, which led to 
more frequent reports of shifts within an area, change is defined grossly 
as shift between rather than within areas, Thus a shift from “beneath 
sternum” to “in the middle of the girdle area” is not classified in the change 
Group, since it involves only a shift within the Other category. The « test for 
significance of a difference between two proportions was 2.79, which is signifi- 
cant beyond the .01 level. The tendency for females to shift more {s not 
Statistically significant. 

With Head localizers defined as those who reported a Head locus in at least 
three of the four exercises, the result is that 20 males and 20 females are 
classified as Head localizers, 


D. Discussion 


Since the investigator is a consistent Head localizer (around the eyes), the 
implications of the findings will probably not be seen from all vantage points, 
and probably least from’a navel point of view. The findings certainly con- 
firm the results of the classroom exercises and Deldin’s findings that some 
People are consistent Head localizers and others are consistent Other localizers 
in these exercises, The question posed by the present study is answered posi- 
tively: locus of subject-self definitely tends to change as a function of site of 
stimulation. Thus, the exercises designed to obtain a “fix” on self demonstrate 
that locus may shift in a predictable way and that perceived self does not 
necessarily have a fixed locus, 

Upon first encounter with an Other localizer, the investigator's first thought 
Was that he must have misunderstood the instructions or failed to grasp the 
Subject-object distinction. This is no doubt true in some cases, but interviews 
with several Other localizers suggest that this does not account for the 
Majority of cases. Likewise, interviews suggest that some Head localizers are 
Conceiving rather than perceiving. That is, they reason that the head is where 
the brains are and that is where the self (or the seat of the soul) should be, 
*rg0 .. ., but it is unlikely that this accounts for the majority of cases either, 
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What else, then, may account for these individual differences aside 4 
chance, perversity, and instrument artifact? 
Some ancient Greeks believed that the spirit of some troubled souls y 
dered from site to site within the body seeking an outlet. It may be true 
many college students are also troubled souls, but whether this helps to a 
for the relation between site of stimulation, direction of attention, and p 
ceived locus of subject-self remains to be determined. Since most student: 
cooperative, obliging, and also curious and intelligent, it is quite possible 
the different sites of stimulation and the repeated requests for locus of 
ject-self could lead some students to comply with what they perceived to. 
change-of-locus hypothesis of the investigator. Nevertheless, the like 
remains that perceived locus of subject-self really does change under 
conditions. Why? 
Apparently, subject as well as object perception is dependent on poin 
reference. Some maintain an invariant reference, possibly analagous to 
that some perceptual constancies are more invariant in some people th 
others, while those who do shift show a definite tendency for subject locus 
follow object focus. It may be fruitful to relate these findings to tho 
Witkin e¢ al. (12) and to Wapner and Werner (11) on body perc 
under varying stimulational conditions, to Natsoulas and Dubaroski (9 

the inferred locus of the perceiver, and to Fisher (4) on body boundary. — 


E. Summary 


Subject-self and object-self were defined analogous to the reflexive pro! 
distinction in grammar. Then a methodology was described for inferri 
locus of subject-self. Subjects were 80 university students, Under 
tactual self-stimulation in the vertical midline axis, it was found that loct 
subject-self tended to change consistent with site of stimulation and ass 
shift in attentional focus. 
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TIME-SAMPLING CARETAKER AND INFANT. BEHAVIORS | 
IN THE FIRST FIVE WEEKS OF LIFE** Sa 
University of North Carolina at Chapel Hill and Western Carolina University 


KATHLEEN BLooM AND EucENE E. McDows1t ~ 


A. IntTRopUCTION iy 


The importance of early experience has been stressed in studies of infant 


development. In such studies, some investigators have viewed the infant as bein 
entirely reactive—that is, as responding passively to the various impinging 
stimuli. More recently, however, the infant has been viewed as playing an 
active role in its environment. The infant not only reacts to the stimuli in its 
surround, but also acts as a stimulus within its environment. In this context 
both infant and caretaking behavior have been investigated. ‘ 


Various parameters of mother-infant interaction have been experimentally — 


investigated in nonhuman organisms (2), An experimental analysis of the 
behavior of human infants and caretakers entails many problems that would 
appear insurmountable. One of the few experimental studies of human mother- 


infant interaction was done by Rheingold (4). She manipulated the amount 


and quality of caretaking in a group of institutional infants. 

For the most part, however, studies of maternal and infant behaviors have 
been restricted to those that are descriptive rather than experimental. Rheingold 
(5), for example, used the method of time-sampling in studying infant and 
caretaker activities, She devised a checklist of 42 items which included the 
activities of the caretakers, the infant, and the location of the infant. Each 
subject was observed for eight hours, and the behaviors were sampled once 
every 15 seconds for 10 minutes of each quarter-hour. Maternal care of three- 
month-old infants in the home was compared with that in the institution, 

The purpose of the research reported herein was to attempt to use the 


time-sampling technique in a longitudinal design that measures changes in 


infant and maternal behavior during the first five weeks of life. Two ques- 
tions were asked in this study: (a) Will observations within the chosen time 
unit detect changes over time in the behavior and interactions of the caretaker 
a 
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and the young infant? (b) Will sex differences also be revealed by this tech- 
nique? 
B. MetHop 


1. Subjects 


‘Twelve infants (six males and six females) from a nursery of an adoption 
agency were chosen on the basis of normal health at birth and present physical 
condition. This was determined by hospital reports and the judgment of the 
attending doctors and nurses at the nursery. Typically, the infant came to the 
nursery from the hospital when he was five or six days old. Observations were 
begun at seven days of age, thus giving the infant a day or two to adjust to 
the new environment. The infants were observed, therefore, at seven, 14, 21, 
28, and 35 days of age. 


2, Environmental Setting 


The nursery of the Children’s Home Society at Greensboro, North Carolina, 
provided an ideal opportunity to study young infants, The nursery housed 
about 20 infants ranging in age from five days to six weeks. There were four 
large rooms which contained from four to six cribs each, and two small rooms 
with two cribs each. Caretaking activities were performed by a total of five 
registered nurses, one licensed practical nurse, eight nurses’ aides, and occa 
sional volunteers, This staff was divided into three eight-hour shifts, with at 


least one registered nurse residing at the nursery throughout the 24-hour day. | 


3. Method of Observation 


Two infants per day were studied during two three-hour periods (9 a.m. to 
12 p.m., and 1 p.m. to 4 p.m.). Each hour was divided into six 10-minute 


periods, and each infant was observed during alternating 10-minute periods | 


(a total of 30 minutes per hour). During a 10-minute period, 40 observations 
were made (one every 15 seconds). For each observation the observer looked 
at the infant-environment for one second, and during the following 14 seconds 
recorded what she had seen and heard during that observation. This method 
permitted four observations per minute and assured a total of 720 observations 
for each week of each infant’s first five weeks of life. 


4. Type of Observation 
The categories of behavior chosen for observation were essentially thos¢ 
developed by Rheingold (5), with modifications determined by preliminary 
studies of the one- to five-week-old infant. These categories, which are listed 
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Table 1, included 12 items of maternal behavior, three items that described the 
physical location of the infant, and 10 categories of infant behavior. In addition, 
the infant’s weight was recorded each week. The categories were arranged in 
achecklist, and for the most part they are self-explanatory. 

All but the most necessary conversation was avoided so that the observer 
would not be increasing the amount of auditory stimulation in the nursery. 
The observer positioned herself so that she could see the infant’s face and that 
of the caretaker when the latter was administering to the infant. The ob- 
server remained outside the infant’s line of vision and out of the way of the 
nurses’ activities in order to minimize the effect of her presence. Also, observing 
two infants for alternating 10-minute periods made it less obvious to the 
caretakers which infant was being observed at any given time. 


5. Reliability 


Reliability was determined at the beginning of the study by the two ob- 
servers, who simultaneously and independently recorded items of behavior 
according to the method described above. All ages of infants and times of ob- 
servation used in this study were spanned in determining the reliability. Fight 
infants were observed for a total of 1200 independent observations. Reliability 
was reported as the percent of agreement for each category of behavior. Dis- — 
agreement was scored when the responses were not the same for any particular 
time period. Agreement was scored when identical responses were recorded by 
both observers in the same observational time-interval. No-response periods 
Were not included in the calculation of agreement in order to avoid artificial 
inflation of reliability in the categories of behavior that occurred infrequently. 
The median percent of agreement was 81.00. 


C.  Resutts AND Discussion 


This study was designed to test the time-sampling technique as an instru- 
Ment for measuring caretaker and infant behaviors during the first five weeks 
of life. It was hypothesized that developmental differences could be detected 
by this technique. In addition, this study attempted to determine whether there 
Was differential treatment of the male and female infants by the caretakers, as 
Well as sex differences in the behavior of the infants. 

Jn order to determine whether there were developmental changes and sex 
ifferences in the observed items of behavior, all data were examined by 
means of the trend analysis described by Edwards (1). An analysis of variance 
for repeated measures on the same subjects was used in a2 (sex) X 5 (weeks) 
design. The significance of the main effects of sex and weeks was determined, 
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as well as their interaction. A trend analysis was performed for linear, qua- 
dratic, cubic, and quartic relationships. This analysis revealed the slope and 
curvature of the overall trend, as well as that of the interaction of sex and 
weeks, The analysis of variance was performed for each item of behavior, and 
the trend analysis was performed for each item that differed significantly over 
weeks (see Table 1). 

In this study, the trend analysis is used as a means for describing the data, but 
not as a tool for prediction. The limitation of this analysis to a descriptive use _ 
is warranted in two related ways. First, a separate analysis of variance is per- 
formed for each of the 26 items of behavior. One could expect some of these 
categories to be significant by chance. Second, this chance factor is increased 
by the fact that these categories are not considered to be independent. (A 
statistical analysis revealed that 39 of 325 intercorrelational coefficients were 
significant at less than the .05 level.) Therefore, the trend analysis numerically 
describes that which can be visually represented by graphs. 


1. Caretaking Activity 


It can be seen in Table 1 that the number of caretakers in the infant’s room 
increased over time in a linear trend which was significant. Since the number 
of caretakers increased over time, it is not surprising that the amount of talk 
ing in the infant’s room increased in the same way. The amount of talking 
directly to each infant, however, did not increase over weeks. The other cart 
taking activities that showed significant increase over time were: ‘Looks at 
Face,” “Holds,” and “Feeds.” There was a significant overall linear trend 
for each of these categories, as well as a quartic curvilinear overall trend 
within the category “Holds.” 

It appears from the results in Table 1 that the caretakers in this institution 
differentiated between sexes in their treatment of the infants. This occurred 
in the categories of “Feeds” and “Pats.” Male infants were fed and patted 
more than female infants. With regard to sex-week interactions in caretaking 
activities, a significant F yalue resulted in the categories of ‘‘Holds” and 
“Looks at Face.” In both categories the scores of the male infants actos 
weeks show a quartic curvature. This is demonstrated in Table 1 by the 
significant quartic component of the interaction. 

Over the first five weeks of life the infant spent significantly less time in his 
crib and significantly more time in the caretaker’s arms. Both of these evens 

show overall significant linearity, and the latter, “In Arms,” also shows ove! 
significant quartic curvilinearity, The scores of male infants over weeks sho 
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TABLET 


Aalyeis of Variance and Trend /nalysis 
of Caretaker and Infant Behaviors 


Item = Haske: i} in —— Ste Quantic 
Caretaking 1.54 219 La 
Nof Caretakers 0.25 12.19% 0,54 8 eu, 7088 6.27 0,01 
r 0.01 1.46 0.02 
Looks at Face 0.23 sue 2.78 0 32,9200 0.81 0.47 
I 0.00 0.82 10! 
Talks 0.62 3.79% 0.39 © 12.7480 0,68 
I 0.06 0.63, 
Talks to Infant 1,08 2.30 2.20 
Pata S.0088 2.50 Ln 
‘Shows Affection 1.28 Ls 2.06 
Holds 1 9.4gee 3,06¢ O 31.68 1.44 0.12 
I 0.06 3.09 0.16 
Diapers 0.10 0.53 0.88 
Feeds 5.378 dee Ooh. ise 
- tie 2N6 
AMjusts Position 0.22 1.87 1a 
Other 0.03 1.02 0.: 
In Crib ; 764 i oe 0.26 0.38 0,28 
0.80 3.76 2.678 dans it ee} were 
ag . uae 1.38 0.2 was 
2.22 1.516 2.08 On ty a see 
Seated 2,06 1.29 1.00 
Asleep 2.28 4.1908 1.88 ° 8.708 nc} 
5 er 1.00 
Hiecough 0.05 2.30 1.26 
Vooalizes 1.00 1.73 4.0608 
rad . ‘ .6uee i; 1.60 4,02, 
0.15 3.728 4. Gat 2 hs oon Tow 
Winger in Mouth 0.59 0.04 2.93 
Rottle in Mouth 0.03 0.79 0.52 
locks at ‘Toy a 0.69 1. 1.16 
86 2.68¢ 1.7% s ae ee Pare 116 
Pacifier in Mouth 0.16 0.75 0.62 
Nitoes 9.70 0.3% 0.89 
fate 0.02 0.02 
2.08 Fi 2.58 0 25.858¢ 
ey I 0.01 2.56 sherman 
Wetght o. 10.9408 6.97% 
* $3 45.3208 2788 0 972.3208 
205.3208, osTz.a eet 0.30 
a Overall trend 
b Interaction 
Loge 
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a quartic trend which resulted in a significant sex-week F value within th 
category of “In Crib.” 


2. Infant Behavior 


It can be seen from Table 1 that the infant slept less as a function ofa 
This trend was both linear and curvilinear (quartic). In addition, the res 
show a significant increase over time within the categories of “Cries,” “ 
at Toy,” “Eats,” and “Weight.” There was a significant linear relationship | 
each category and a quartic and cubic trend, respectively, for ‘“Eats’’ and 
“Weight.” Although there were no significant sex differences, ‘“Vocali: 
“Cries,” and “Weight,” did show significant interaction between sex and 
The scores for male infants in the category “Eats” showed a quartic trend. 


3. Feeding Behavior 


From these results and from general observations of the infants and cai 
takers in this institution, it can be noted that the events that vary most si 
cantly during the first five weeks of life and the events by which the sex of 
infant can be differentiated are, for the most part, those involving the feeding) 
solid food to the infant. Caretaker-infant interaction was more pronounced 
pervasive in this event than at any other time. Feeding was accomplishe 
the caretaker while she sat in a rocking chair holding the infant in her arm 
As the caretaker fed the infant, she looked at his face. The categories thal 
were scored during the feeding of solid food most often included: “Look 
Face,” “Pats,” “Holds,” “Feeds,” “In Arms,” and “Eats.” 

It is reasonable to assume that as the infant grew older he was fed m 
often, and this resulted in a longitudinal increase of frequency of the categ 
of feeding behavior. The results show, however, that there were sex differel 
as well as age differences, in the feeding of the infants by these caretakers. 
response “Feeds” was recorded when the caretaker placed a bottle or a SP! 
ful of food into the infant’s mouth. Since the bottle was generally prop! 
“Feeds” was recorded more frequently during the observation of a solid-f 
feeding period. For males, the addition of solid food to the infant's diet 
one week earlier than for most female infants. Figure 1 shows that 
infants were fed more frequently at three weeks of age than females. Fi 
1, however, includes both spoon and bottle feeding; therefore, Figure 
presented to demonstrate the differential treatment of the sexes over 
since “Eats” included only the consumption of solid food. At three week 
age, eating behavior was observed in five of the six male infants and in 
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the six female infants. At five weeks of age, eating behavior was recorded for 
five male and five female infants. 

It is interesting to note that there is no significant sex difference within the 
category “Eats” despite the earlier onset of this activity in males, Apparently, 
the/relatively equal occurences of this behavior at all but the third week of 
life prevented the sex difference in frequency from reaching statistical signifi- 
cance, The earlier solid-food feeding of males accounts for the sex differences 


30 a&——xa Mean for males 


o——o Mean for females 
o-----o Mean for total Ss 


De) 
9 


FREQUENCY 


WEEKS 


FIGURE 1 ‘ 
FREQUENCY OVER WEEKS OF OBSERVATION “FEEDS” 
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in the categories “Feeds” and “Pats.” ‘The latter was a behavior that the 
taker exhibited during the feeding of solid and liquid food. This 
treatment of the sexes also accounts for the significant interaction of #* 
weeks in the categories of “Looks at Face,” “Holds,” and In Crib.” In 
tion, this effect is demonstrated by the quartic curve of male scores for 
categories and for the categories of “In Arms” and “Eats.” 

Upon questioning the nurses after the completion of this study, it w# 
covered that there was no policy at this institution with regard to the 04 
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of solid food to the infant’s diet. Moreover, the caretakers were unaw 
their earlier solid-food feeding of male infants. Although there was no o 
rion for the addition of solid food, the caretakers felt as though their deci 
to include solid-food was based on the need of the child as expressed by 
infants’ crying and fussing. The results are consistent with this explana 
It can be seen in Figure 3 that males did cry more than females at three w 
of age and that they were fed solid food (see Figure 1) more often than fen 
at the three-week period. 

The method and categories used in this study are similar to those of 
(3). Contrary to what Moss suggested, the caretakers in this institution 
treat infants differently according to the sex of the child. It is interestin, 
note, however, that there were no significant sex differences in behaviors 
the infants. : 


D. Summary 


The behaviors of six male and six female infants and their caretakers in 3 
adoption agency were time-sampled on each of the first five weeks of 
infant’s life. Developmental changes in infant behavior were found, as well 
differential treatment of the sexes by the caretakers, This treatment occu! 
in a complex of caretaking activities and was the result of the earlier addi 
of solid food to the male infants’ diets, It was concluded that the time-samp! 
technique can be employed in a longitudinal design to describe the behi 
of very young infants and their caretakers. 
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USE OF THE SEMANTIC DIFFERENTIAL TECHNIQUE 
IN RESEARCH: SOME PRECAUTIONS*?* 


Counseling Center, Brigham Young University 


Ricuarp A. Hzaprs 


A. INtTRopUCTION 


The semantic differential technique has become an increasingly popular 
method of cognitive measurement with broad application (13). It is important 
to note that, as happens with many well-known measurement instruments, 
certain assumptions about the use and interpretation of the semantic ‘dif- 
ferential have become widely accepted. For example, there are several articles 
in the literature suggesting the relative invariance of the three factors dis- 
covered by Osgood and Suci (12) and the desirability of using factor scores 
when employing a semantic differential instrument (3, 4, 5, 8, 11). Con- 
sequently, many researchers using the semantic differential technique might be 
tempted to select adjectives from Osgood and Suci’s (12) lists, and compute 
mean or factor scores on the basis of the original factor structures or factor 
loadings for the scales being used (eg., 2, 5, 10, 14). The purpose of the 
Present study was to assess the advisability of such a procedure by examining 
the current factor structure of a semantic differential test used under typical 
research conditions. : 


B. Mertuop 
1. Subjects and Measures Used 


The subjects were 33 boys and 30 girls from three fifth-grade classes in 
Salt Lake City, Utah. Each subject was given four semantic differential 
Measures stapled together into one test booklet. Each of the four measures 
Was comprised of the same 28 bipolar adjectives (14 seven-point scales) and 
was headed by a different concept. The four concepts rated were: (2) My 
bes (6) My Parents; (c) My Best Friend’s Parents; and (4) My 

ather, 
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2. Procedure 


Test booklets were given separately to each class as a group, the instn 
tions were read aloud, and the subjects’ questions answered. After completi 
the booklet the subjects were asked to turn it over and indicate their sex 
not their name. The procedure for each class took 20 minutes. 


3. Data Analysis 


‘The scales were given numerical values from one to seven, inclusive, a 
factor analyzed as follows: Scale scores were summed over both j 
and concepts, generating a single 14 X 14 intercorrelational matrix of ¢ 
scale with every other scale. Since each scale was used by all 63 subje 
rate the four concepts, every scale was paired with every other scale 2 
times (63 subjects )< 4 concepts). These correlations were then analy: 
with the use of the principal components method of factor analysis (6) 
rotated to simple structure with the use of Kaiser’s varimax solution (S$ 
It was decided beforehand that only those scales with factor loadings ¢ 
-50 would be discussed. 


C. Resutts 


The rotated factor loadings are reported in Table 1. t 
The first factor is comparable to Osgood’s “activity” factor (12), but. 
content of the three scales with the highest loadings for this factor (slox 
fast, .71; weak-strong, .70; dull-sharp, .58) emphasizes a general ene 


TABLE 1 
Roratep Factor Loapincs 


aadshaney A ‘ 

heavy-light 17 —75 —.02 

hard-easy —04 —29 52 

slow-fast a th —17 4 

loud-soft 11 —72 21 

awful-nice 21 —15 -66 

bad-good 27 —14 65 
weak-strong 70 12 25 

sour-sweet .29 —.30 63 

hot-cold —01 08 —.07 

cruel-kind 32 02 76 a 
soft-hard A+ 05 J —54 nS 
unfair-fair A “a 


dull-sharp 
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level or potential for activity more than actual movement or activity per se. 


For this reason the first factor was labeled “energy.” 

The second factor is directly comparable to Osgood’s “potency” factor and 
was so labeled here. The scales in this factor are heavy-light (.75) and loud- 
soft (.72). 

The basic content of the third factor is included in Osgood’s “evaluative” 
factor, although one scale (soft-hard) was previously in Osgood’s “potency” 
factor and one scale (hard-easy) was not used by Osgood at all. In the present 
analysis, the emphasis of this factor seems to be behaviorally related to parental 
discipline. Consequently, the third factor was labeled “discipline.” The scales 
in this factor are unfair-fair (.76), cruel-kind (.76), awful-nice (.66), bad- 
good (.65), sour-sweet (.63), soft-hard (.54), and hard-easy (.52). 

The fourth factor contained only one scale (hot-cold, .90), but since this 
seale was found to have a comparatively low reliability when used with 
children in a previous study (3), this factor was not considered to be a 


| Meaningful one. 


D. Discussion 


The studies referred to in the introduction of this article enable one to see 
why investigators frequently perform research using the original semantic 
differential factors discovered by Osgood and Suci (12). However, the present 
study identified enough variations from Osgood’s original factor structure that 
‘word of caution seems warranted. That is, the order of factors found in the 
Present study is just the reverse of that reported by Osgood, and two of the 
seiles (weak-strong and soft-hard) which belonged in Osgood’s “potency” 
factor shifted to the present “energy” and “discipline” factors. In addition, the 
Mesent “‘energy” and “discipline” factors seemed to have a slightly different 
peel emphasis than Osgood’s “activity” and “evaluative” factors. The 
‘stor loadings for the present factors were also different from Osgood’s. 
is analysis suggests that the three semantic differential factors discovered 
{i Bood are relatively stable in general content only. The specific content, 
eh loadings, and order of the extracted factors may be expected to vary 
ding on the particular combination of rating scales, concepts rated, and 
sts used. This is consistent with Brinton’s (1) comment that “the 
rsviig and evaluative strengths of adjectives can change from one concept 
Another” (p, 92). 
athe basis of this information it would seem that the best way to keep 
eye Ace appropriately related to the particular sample being used would 
‘orm a factor analysis on one’s own data and to compute factor scores 
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using the resultant factor loadings, rather than introducing an additional _ 
source of error by using the factor structure and/or numerical values (i.e, 
factor loadings) obtained from another sample. 

Heaps (7) suggested that factor scores may be obtained by multiplying the 
ratings for the most highly loaded scales in each factor by their factor loadings 
and summing these values, providing a weighted combination of scale scores 
for each factor. This is simpler than a multiple regression equation method; 
however, the two methods were found to correlate .93 on a sample of the 
subjects used. 


E. Summary 


This study examined the advisability of selecting and using adjectives from 
Osgood’s original semantic differential factors, as frequently occurs in re- 
search, by examining the current factor structure of a semantic differential | 
instrument. The present findings suggested that the three factors discovered | 
by Osgood may be relatively stable in general content only. The. specific | 
content, factor loadings, and order of the extracted factors differed from those 


originally reported. A simplified method for computing factor scores was 
suggested. 
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REACTIONS OF INNER-CITY AND SUBURBAN 
ADOLESCENTS TO THREE MINORITY 
GROUPS*?* 


Georgia State University 


Davi L. PATTERSON AND STANLEY J. Smits 


A. INTRODUCTION 


Researchers and theoreticians have tended to approach prejudice from two 
distinct points of view. One group has viewed prejudice as a generalized 
tendency on the part of certain types of people to regard people dissimilar to 
themselves as less desirable. This approach has produced correlational data 
showing that prejudice against one minority group is related to prejudice 
against other minority groups (6, 9). Attempts to pinpoint the etiology of 
this generalized tendency led to descriptions of the close-minded, authoritarian, 
tthnocentric personality (1, 2, 7) and to the experiences that produced him 
(16, 17, 19). The second group has concentrated its efforts on the unique 
aspects of specific prejudices, such as anti-Negro and antisemitic attitudes, 
Their efforts led to improved descriptions of the ingredients of specific prej- 
udices, as well as to a clarification of their etiologies and consequences (3, 4, 
5,8, 11, 13, 14, 15, 18, 21). 

The implications of the previous research and theorizing about the nature 
of prejudice may be summarized very briefly as follows: “Prejudice” is a 
complex phenomenon which has affective, cognitive, and behavioral compo- 
nents; which has part of its etiology within the respondent and is therefore 
Seneralized from outgroup to outgroup; and which has part of its etiology in 
“perience and therefore is a learned, specific response to a given stimulus. 
This study investigated responses to prejudicial statements about three 
minority groups whose origins are found in external, inherited characteristics 
‘ed i internal values and beliefs (religious) ; and actions (behavioral). 
ee Purposes of this study were (a) to assess the impact of demo- 
decatt characteristics and experiences, such as integrated versus segregated 
b Udi] on, on the generalized tendency to respond to outgroups in a Bree 
a and (b) to investigate the specificity/generality of prejudicial 
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responses when three types of minority groups are used, including one with 
which the subjects were assumed to have a positive affiliation. 


B. MeEtTHopoLocy 
1. Subjects 


The subjects of this study were high school seniors from two schools in the 
Atlanta area. The two schools were selected because of their differences, as 
follows: 

School A is located in a middle-class suburb. It is a public school with an 
enrollment of 1583. At the time of this study, there were only seven black 
students in the school (.4%), while 23% of the students were Jewish. 
Eighty-six volunteers, from a total of 137 who were contacted, completed 
the research instruments. 

School B is located in the inner city near a large ghetto area and near the 
hippie community. It is a Catholic parochial school with a total enrollment 
of 402. At the time of this study, 28% of the students were black. Ten per- 
cent of the total enrollment was Protestant. Fifty-six volunteers, from the 
senior class of 101 students, completed the research instruments. 

The subjects were identified with respect to sex, race, the nature of their 
educational experience, religious affiliation, and social class. Students who 
were in their fourth year of attendance at School A were operationally defined 
as having had a segregated educational experience, whereas students in their 
fourth year of attendence at School B were defined as having had an integrated 
high school experience. Subjects were assigned a social class label on the basis 
of their parents’ educational attainments. Subjects were labelled as “uppet 
class” if both parents had been graduated from college, as “middle class” if | 
both parents had been graduated from high school, and as “lower class” if 
one or neither parent had been graduated from high school. 


2. Instrumentation 


The Intergroup Relations Scale (IRS) developed by Chesler (6) was used | 
with an accompanying personal data sheet to gather the data used in this 
study. The IRS is a Likert-type scale on which the subject indicates his 
agreement or disagreement with each statement on a six-point scale. Chesler 
had judges select items from the Levison-Sanford Anti-Semitism Scale (12) 
and from the California Ethnocentrism Scale (1), which had discriminated 
well in previous studies and which were worded in such a way that a variety 
of minority groups could be randomly assigned to the statement, as in the 
following examples: 


civilization. 

The should make sincere efforts to rid themselves of their con- 
spicuous and irritating faults. 

No matter how Americanized a may seem to be, there is always 
something basically wrong underneath, a loyalty to his own group that 
is never totally changed, 


Chesler reported a corrected Spearman-Brown split-half reliability coefficient 
for a college sample of .91, and a Cronbach Alpha of .66. He concluded from 
the alpha of .66 that “there is considerable homogeneity among responses to 
the items on the IRS” (6, p. 880). 

The 34-item IRS was used in this study in the following manner, First of 
all, one item was randomly selected for exclusion. The racial, religious, and 
behavioral minority groups were then randomly assigned to the remaining 33 
items so that each group was represented by 11 prejudicial statements. 
“Puerto Ricans” and “Mormons” were selected for the racial and religious 
Categories because they represented groups with which the subjects were 
unlikely to have had either close contact or emotional ties. “Hippies” were 
selected as the behavioral minority group. Unlike the “Puerto Ricans” and 
“Mormons,” the Hippies were a minority group with which the subjects were 
expected to have had both contact and emotional ties. 


3. Procedures 


During November 1970, the subjects from both schools were given the 
IRS and a personal data sheet and asked to complete them in group settings, 
such as classes and home rooms. With the exception of the subject’s race, 
which was recorded by the examiner when the instruments were turned in, 
all data were obtained through self-report and were recorded by the subjects 
themselves, Identification numbers were used rather than names, and the sub- 
‘ were assured that their responses would not be associated with them by 

ame, 
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The must be considered a bad influence on our culture and 
( 
The responses were coded, scored, and analyzed to answer the following 
questions : 
ds Did these subjects respond to statements of racial, religious, and be- 
Eo Prejudice in a generalized manner—i.e. are the ratings assigned to 
e three minority groups significantly intercorrelated ? 


4 Did these subjects respond to statements of racial, religious, and be- 
Vioral prejudice in a specific manner—i.e. are the mean ratings assigned to 
three minority groups significantly different from one another? 


Are the ratings assigned to the three minority groups significantly related 


‘ 


) 
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to the sex, race, integrated/segregated educational experience, religious ; 
tion, and social status of the subjects? 


C. REsuLtTs 


Before scrutinizing the data pertaining to the three research question 
is important to clarify the degree of prejudice actually exhibited by 
subjects. As a group, the subjects’ ratings indicated that they ‘“‘disag 
pretty much” with the prejudicial statements regarding Puerto Ricans, ' 
agreed a little” with the statements regarding Mormons, and “agreed a 
with the statements regarding Hippies. The mean scores for the various gt 
tend to obscure the average ratings given to each item. 

The correlation coefficients (r) for the three sets of ratings are sh 
in Table 1. Each of the positive coefficients is significantly different 


TABLE 1 
INTERCORRELATIONS AMONG ScORES ON THE RACIAL, RELIGIOUS, AND 
BEHAVIORAL SuBscaLes (N = 142) 
Subscales 
Subscales Racial Religious Behavioral” 
Racial a 
Religious 1 5 RO — 
Behavioral .60%** 5188 = 
“en Dd < 001. 


zero (p < .001), thereby answering Question 1 in the affirmative. The 


a general response by these subjects to all three minority groups. j 

A comparison of the mean ratings assigned to the three minority gf 
is shown in Table 2. The amount of prejudice expressed toward the 
havioral groups is significantly greater than that expressed toward the relig 
minority group (p < .001). However, the amount of prejudice expr 


TABLE 2 
COMPARISON OF SCORES ON THE Racrat, RELIGIOUS, AND 
BEHAVIORAL SUBSCALES (NV = 142) 

Scores compared t df 
Racial Religious 
(X = 26.15, SD = 10.57) os. (X =29.11, SD = 12.43) —2.16 282 
Racial Behavioral 
(X = 26.15, SD = 10.57) os. (X = 37.53, SD = 11,92) —7.90 282 
Religious Behavioral 


(X = 29.11, SD = 12.43) vs. (X = 37.53, SD = 11.92) —5.47 282 
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toward the religious minority group is significantly greater than that expressed 
toward the racial minority group (p < .05). These results indicate a specific, 
or selective, response to the three minority groups in addition to the general 
response indicated in Table 1. i 

The subjects studied here are described by the demographic characteristics 
shown in Table 3. Twenty-one of the 22 black subjects came from School B. 
The results of the analyses indicate that the subjects who attended the 
integrated school for four years are significantly less prejudiced toward the 
racial and behavioral minority groups (p < .001). Religious affiliation is 
significantly related to responses to the behavioral minority group (p < .05). 
In this sample, sex, race, and social class appear to be unrelated to ratings 
made on the IRS to the three minority groups. 


D. Discussion 


The data in Table 1 are consistent with the significant intercorrelations 
often found when more than one minority group is rated by a given group of 
subjects. A response tendency, either idiosyncratic to the subjects or built into 
the instrumentation, may obscure the fact that differences occur in the amount 
of prejudice expressed toward the stimulus groups. As Allport (2) pointed out 
in his attempts to explain the meaning of these significant, but “imperfect” 
correlations: “In these instances we must conclude that prejudice cannot be 
explained entirely by approaching it at the level of generalized personality 
Structure and dynamics. Situational, historical, and cultural factors are also 
important” (p, 72). The factors operating in this sample produced results 
indicating greater prejudice toward “Hippies” than toward “Mormons,” and 
greater prejudice toward “Mormons” than toward “Puerto Ricans.” A logi- 
cal explanation is that these subjects have assimilated current middle-class 
values which make racial prejudice unacceptable, but which, in turn, make 
the behavior of the Hippie a target for scorn. 

The comparable negativism with which these high school seniors reacted to 
the “Hippie” was unexpected. In fact, a positive response was anticipated 
from the mod dress and antiestablishment comments commonly encountered 
among high school students. It seems that the achievement-oriented middle- 
class values were strong enough in both groups to elicit more negative reac- 
tions to a minority group that publicly rejects this value system. The inner-city 
adolescents were probably attending School B because of its college-preparatory 
academic program. 

Tf the greater negativism expressed toward the “Hippie” was the result 
of accepting many of the values held by their parents, then the rejection of 
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the “Mormons” to a greater degree than the “Puerto Ricans” toflows logi- 
cally from the same system. In adhering to their own religious beliefs, the 
subjects were expected to deny competing religions. They merely overgen- 
eralized. This explanation, while generally consistent with the research liter- 
ature, is inconsistent, from the standpoint of logic, with a finding by Allport 
and Ross (3) that “people who are indescriminately pro-religious are the 
most prejudiced of all” (p. 432). i 

The results shown in Table 3 seem to demonstrate the positive impact of 
attending integrated high schools. Subjects whose educational experiences in 
high school were defined as integrated are significantly less prejudiced toward 
the racial and behavioral minority groups. However, since School B is a 
private institution, the parents of these subjects deliberately chose to send their 
children to an integrated school. These families, therefore, may be atypical in: 
values and attitudes in the first place. However, these data do allay the fear 
that integration will result in increased prejudice. On the contrary, the results 
support those who contend that contact with minority groups has an ame- 
liorating effect on attitudes (8, 10, 13, 23). The data in Table 3 suggest 
that the students who experienced an integrated educational setting may have 
acquired a more positive, or accepting, attitude toward other minority groups. 

The significant difference among the Catholic, Protestant, and Jewish sub- 
jects regarding the ratings given to the behavioral minority group is difficult 
to explain. One possible explanation is the proximity of many of the Catholics 
to the hippie community. Another possibility is the difference in emphasis 
assumed to exist among the three religions regarding the importance of pro- 
ductivity. And finally, the difference may have resulted from the commonality 
of antiauthoritarianism prevalent among the hippies and currently increasing 
within the Catholic church. ' 

The absence of significant differences in IRS ratings among the three social 
class levels is inconsistent with the results obtained by Westie and Westie 
(22). However, the absence of significant differences in the ratings given by 
blacks and whites is consistent with findings by Simon (20) that middle-class: 
whites and blacks aspiring to middle-class status tend to “dispense with dis- 
crimination” (p. 192). The blacks in this study often chose to attend School 
Basan entry into college. 
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PSYCHOVECTOR ANALYSIS: A NEW DISCIPLINE 
WITHIN THE BEHAVIORAL SCIENCES* 


Behavior Analysis, Inc. 


Hunter B. SHIRLEY 


A. INTRODUCTION 


In dealing with the evolution of systems, Rashevsky (11) offered a hypoth- 


esis of singular promise. His hypothesis is best summarized by Pask (9, p. 
104-5). 


The organism, regarded as a control system, can be mapped on to an 
image wherein all metrical properties are discarded, but all “structural” 
relations preserved. The image will be a graph of the kind we use to depict 
states, only in this case the nodal points represent biological properties such 
as “feeding” and “secretion.” Rashevsky says that the various graphs which 
have arisen by evolution can be transformed into one another—which is 
incontrovertible if we accept the regularity of the real world—and are 
derivable from a primordial graph by repeated application, representing 
Stages in evolution, of a single topological transformation T, some param- 
eters of which are variable. For reasonable choices of T the primordial 
graph is a homomorph of any later graph and it is possible to work back- 
wards, Now T is unknown but presumably it can be discovered by judi- 
cious comparison between the mathematical possibilities and the biological 
facts in order to obtain a “best fit.” 


The notion of discovering a prototypal pattern representative of basic life 
Processes which could be delineated as a topological network is no mean goal, 

en for a science like cybernetics which sets its level of aspiration so high. 
The Pursuit of this goal is not a quixotic one, however; on the contrary, it 
Sa very real necessity dictated to us by the present advance of knowledge. 

u nless one closely examines Rashevsky’s hypothesis, however, one tends to 
think it has already been partially achieved in Ashby’s Design for a Brain 
(1), in Gabor’s “A universal non linear predictor and simulator which 
optimizes itself by a learning process” (5), in McCullock’s studies of neural 
nets (6), or even in Pask’s “Natural history of networks” (8). Glancing 
back over Pask’s summary of Rashevsky’s hypothesis is sufficient in itself 
though to disabuse ts of this hope. Obviously the kind of topology Rashevsky 
"sin mind is closer to Plutchik’s (10, p. 110) “structural model of the 
Weteeee) 
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emotions” with its primary, secondary, and tertiary dyads, or to Blum’s (2, 
p. 14) “overview of a conceptual model of mental functions.” 

These latter two candidates do not meet the qualifications either, however, 
since a primordial network designed to Rashevsky’s specifications would have 
to be infinitely regressible (back to the single-celled animal) and would peed 
to be both more fundamental and more detailed than these two cursory 
models. 

Ideally the kind of model we should have to design in order to achieve our 
goal would be one in which the essential subsystems of the animal control 
system were related in a systems analysis (as Blum and associates attempted 
to do), and then were each further related in a series of hierarchical net- 
works consisting of a logic and circuit design of each major subsystem (a 
Plutchik attempted to do for the emotional system, not using cybernetics a8 
an organizing concept unfortunately). . 

In pursuit of the ideal posed by Rashevsky, the author spent four years 
doing naturalistic field studies of inhabitants of small villages, performing 
‘personality ratings both within and outside of mental institutions, and. 
working endless hours attempting to synthesize past findings in factor analysis 
and trait theory, eliminating overlap, and ordering essential control signals 
into appropriate subsystems. During this period of full-time research, most 
of it spent in Europe and North Africa, the author corresponded and col- 
laborated with such eminent investigators as Hans Eysenck, Gordon Allport, 
Sidney Cobb, and Juliette Favez-Boutonier. The results of these years of 
research include a systems analysis and circuit design of the basic subsystem 
‘of personality, which comprise, to the author’s best knowledge, the ee 


tesponses to psychopathological responses. Among its many uses, the model 
has demonstrated itself to be a highly efficient interviewing device, providing | 
a means of systematizing observation. It also promises to serve as an important 
new clinical tool, as a diagnostic aid and a psychotherapeutic guide for &* 
plaining to patients the identity of their characteristic response patterns and 
the resulting lines of behavior likely to flow from these responses. 

A more detailed report of these findings and their theoretic background has 
been submitted to Genetic Psychology Monographs (14). Meanwhile this 
paper shall be restricted in content to pursuing the essential purport © 
Rashevsky’s ideal, that of delineating a primordial graph that might serve & 
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a topological net representative of primordial processes characteristic of all 
animal species. 


B. THE PrimorpiaL Processes 


It seemed clear from the beginning that if Rashevsky’s hypothesis were 
correct, if the life processes could be reduced to a primordial graph repre- 
senting a quintessential evolutionary network, then the evolutionary line 
would descend (regressively) from the emotional system. This likelihood is 
dictated by the simplest dynamics of animal behavior, most dramatically 
illustrated on the lowest phylogenetic level. If we define emotions (as we 
shall) as directional signals which function as internal stimuli to induce the 
organism to continue or modify its line of behavior, then it is clear that the 
amoeba experiences such state transformations in retreating from a painful 
stimulus (needle prick), or approaching a pleasureful one (warm light), or 
even attacking a restraining one (pushing its way through the enclosing walls 
of an encapsulating algae in order to escape confinement). That one-celled 
animals lack nervous systems does not, of course, mean that they lack means of 
Processing information nor that this information cannot be essentially affec- 
tive in content. 

In attempting to isolate the primordial processes—the essential determinants 
from which all more complex processes ensue—it was necessary to search for 
4 more basic pattern-producing set than any heretofore used. Inferring as 
basic processes certain instincts or drives—the way McDougall (7), Scott 
(12), Plutchik (10), and many others have done—without isolating from 
the environment a causative pattern-producing set would be equivalent either 
to begging the question or to claiming from the reader privileges of @ priori 
credence, 

The most fundamental pattern-producing set in our universe appears to be 
Space-time itself. Though space-time is evidently a continuum, its basic at- 
tributes are reducible to four. If we suppose the attributes of space-time to be 
not length, width, height, and time, but choose rather the determinants of 
Space-time as its attributes—namely, mass, distance, velocity, and deviation 
(alteration of relationship)—then it is possible to determine the primordial 
Processes of a living organism from the environmental variables to which it 
must be designed to respond. 

From a relativistic point of view it is of course arguable that one or more 
of these attributes need not de rigueur exist for the four dimensionality of the , 
Universe to be maintained, For our purposes, though, it is sufficient to note 
that these are the existing attributes that are central to nearly all measure- 
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ment techniques devoted to defining and quantifying space-time. Similarly, it 
is to these gross attributes of space-time—not the microcosmic ones 
living organisms, through their earliest phylogenetic states, have been condi- 
tioned to respond. It might be said that living organisms have, as it were, 
shaped their own inner variables in response to the gross variables imposed | 
upon them by their environment. As Ross Ashby (1, p. 54) puts it, ina 
highly complex probabilistic machine such as a living organism, “A variety of 
disturbances will therefore evoke a variety of matched reactions.” : 
These primordial responses can be isolated easily enough once we establish 
the basic environmental processes that are derivable from these four basic 
attributes of space-time. These latter processes can be identified by the fol- 
lowing cause and effect sequences. 
If mass exists in any quantity, there must exist some process that enforces 
such a coagulation of energy. This process could, not unreasonably, be called 
that of unification. If the relationship of mass in distance is such that distance 
is used as a dynamic of relationship between the units of mass, then the 
process that enforces this dynamic use of distance might be termed affinity. 
The causative process ensuring movement (velocity) we might identify as 
mobilization. Deviation, or the tendency for mass, velocity, and distance t0 | 
vary their relationship patterns, rather than remaining constant, might be f 
identified simply as variety.2 : 
With the identification of these four processes, we have isolated primordial 
dispositions of the universe to relate its parts in certain precise ways that 
appear characteristic. Why these processes exist we do not know, but the 
effect of these elemental processes on all life forms (on all systems, in fact) 
cee be calculated by determining their variables and correlating the summa 
tion of their compound effects. The product of these two procedures will, 
course, be a topological net—a primordial one such as Rashevsky suggested 


must exist as the fundamental structuring agent of all life systems. ie 


Consider the following possible set of variables which appear inevitable 
correlates of these fundamental environmental processes: 


Dispersion $$ Unification = Compression; 
Hypermobilization & Mobilization <t Immobilization; 
Hyperaffinity + Affinity = Hypoaffinity; j 
Chaos $$ Variety = Monotony, : 


1 . * 7 i 
For detailed interpretation of how the four attributes of space-time serve as | 


Ee Eey iri pattern producing set imposing primordial processes upon life pe) 
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C. Tue State GraPH i 


When these variables act as stimulating agencies upon a living organism, 
it is likely that its responses (developed in order to protect and optimize it- . 
self) would approximate the states we find in the simple state graph shown 


in Figure 1. 


A brief inspection of this state graph will provide certain interesting i 
First, the organic responses, 


formation to the student of life systems. 
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this graph indicates to be characteristic of living systems responding to the 
basic variety of stimuli existing in any typical environment, provide the 
organism with a number of activities sufficient to achieve all functions neces- 
sary to life. Second, they provide the organism with primordial processes of 
sufficient variety (a cybernetic requisite of an ultrastable system) not only to 
modify its behavior but even to alter its inner states in order to adapt to the 
environment. Third, it is evident that, by the simple expedient of coupling 
any two or more of the variables from adjacent or opposite subsystems, the 
variety of possible conduct is greatly extended. 


D. A Circurr Dracram or THE EmotionaL System 


It is clear, for example, that an amoeba or a Paramecium will perform at 
least these basic behavioral acts in relating itself to its environment. Coming 
up the evolutionary scale, though, we could say with assurance that man could 
experience all possible correlations of these basic states. Positing a network 
that might be roughly iconic to man’s emotional system, we might identify 
our operant variables in accordance with Figure 2, where pride represents, 


Pathos Affection Hauteur 


Distraction Curiosity Disdain . 


FIGURE 2 
Tue Basic Emorions 


f 


_ the measurement technique involving the observational analysis 0 
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Note that vectors? at left and bottom represent basic anxiety responses; 
those at right and top, basic aggressivity ; and the center vectors represent the 


‘normal mean balances, emotions the client should experience the majority of 
the time. Partial ambivalences (attraction-anxiety, attraction-aggression) pro- 
yide the nuances necessary for directional changes. Total ambivalences (not 
represented here) of anxiety-aggression couplings comprise the neurotic am- 
bivalences, such as resentment, envy, cynicism, etc. Listed beneath the circuit 
diagram, one notes that equal force of all four aggressivities would yield hate; 
all four anxieties, grief; all four attractions, love. Specific forms of these 
master emotions would occur when, for example, dread dominated consterna- 
tion, yielding guilt; antagonism dominated suspicion, yielding animosity; etc. 

This circuit design of the emotional system provides the clinician a most 
subtle, patterned approach to observing affective change in a client, testing 


| Specific areas for blocks or hyperfunction, as well as furnishing for the first 


time the detailed view of the emotional system necessary in order to plot 
4 critical path analysis of a client. As Cherry (3, p. 238) observed, ‘‘a state- 
description represents a ‘strongest’ factual statement that can be made, within 
4 given universe of discourse; it uniquely places a set of system points within 
the cells of attribute space.” 

It is clear that this network of retreat-approach-attack directional signals 
comprises that part of the control system that provides the motoric drives for 
the organism. This system fulfills a vital function in the data processing 
Necessary to regulating information flow, since the cognitive system (probably 
a refined evolutionary product of the emotional system) cannot automatically 
act as decider without first consulting the affective tone stored in memory 
(concomitantly with the memory traces relating the matter under question) 
before it would be possible to know what line of behavior might be good for 
the organism at the present moment. 

Another system, that of hedonic tone—or the activation levels of psychic 
energy, as Duffy (4) likes to call it—serves as encoder of affect stored in 


2 In physical theory, anything that produces movement characterized by direction 
and magnitude is spoken of as a vector. It seems appropriate, therefore, also to con- 
sider all psychological variables that are involyed in the direction and magnitude of 

ehavior as vectors—or psychovectors, if one wishes to distinguish them from physical 
Vectors. As a semantic tool for the behavioral scientist, the word psychovector seems 
in many ways superior to the word #rait. Most personality factors that serve the 
qe! function of guiding and mediating behavior, as well as those that function as 

rive components in initiating or maintaining behavior, are really control LEN 
father than “enduring characteristics” or psychic entities contained by perso ality 

Or this reason, the term psychovector analysis has been employed ee HiZIOE 


expressive signals as evidence of ongoing control functions. 
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FIGURE 3 
Norma Emotions (Directionat Forces, Motivation) 


memory. By serving as a monitor of the energy flow into the system, and feeding 
back the information as to whether the flow of energy into any one of the 
four basic subsystems of the €motional system is being processed through 4 
pleasure or pain center in the brain (which will to some degree dictate whether 
the emotion being experienced is that of anxiety, enjoyment, or aggressivity), 
hedonic tone summates all Precise affect memory traces in binary code of one 
(yes) for pleasure, zero (no) for distress. This facilitates retrieval of affec- 
tive information from memory and serves, moreover, the invaluable function 
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of keeping the emotional system informed as to the state of its energy avail- 


ability and the intensity (magnitude) of its states. Thus hedonic tone plays 


avital role in homeostasis. 
The phenomenological identification of the discrete value states of hedonic 
tone which was accomplished by the present investigator is as complete as that 


of the emotional system. Other systems, such as those of impulsion (emergency ‘ 


arousal), attitude, cognition, etc., are similarly treated to this precise and — 


exhaustive phenomenological white box design in the new discipline which — 


the author has chosen to call psychovector analysis. s 


E. PsycHovector ANALYSIS 


Little known so far and unpublished, psychovector analysis consists of a 
systematic attempt to bring the whole sphere of feeling and emotion within 


the confines of observational methods and measurement techniques. Psycho- 


vector analysis constitutes a methodology for organizing and improving sci- 


entific observation of affective phenomena as they take place in man and 
animals. The high effectiveness of this methodology rests upon the cybernetic 
model of the human control system developed by the author. This model 


serves as the instrument that structures observation, leading the behavioral 


scientist toward a systematic and rigorous set of observations which are ex- 


haustive in their extensiveness, while still providing insight into the deeper 


mechanisms that accompany affective experience. The chief characteristics of 


tsychovector analysis are listed numerically as follows: 
1. The model of the human control system which has evolved as the 


central contribution of psychovector analysis constitutes the only model of its 


kind in existence, the first model in which behavioral responses are intimately 


linked to the personality variables that elicit them. t 

2. The psychovector model of the human control system is divided into 
systems and subsystems and subsystems within subsystems—all completely 
Specified and ordered in an intricate hierarchy of effects which for the first 
time represents personality with requisite variety. 

3. Vector space relating to each major affective signal has been explored 
and charted, 

4. Affective signals are treated as psychovectors, and a circuit design of 
tach basic system is achieved in such a way that the precise relationship of each 
Psychovector with one another is ascertained and represented within the 
circuit network, : 

5. These intricate circuit designs provide the scientific observer a means 
of structuring his observations during an interview and of varying with ac: 
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curacy the stimulus parameters of the independent variables he uses in mea- 
suring dependent variables, 

6. The detailed state graphs of this model offer clinicians the opportunity 
of using the circuit design of each system as a checklist for ascertaining the 
health of basic response areas and diagnosing malfunction such as blocks, 
hyper- or hypoactivity in a given area. 

7. The psychovector model not only incorporates all the latest research 
in psychology, psychiatry, and psychoanalysis within the confines of its net- 
work matrices, but it does not outrage common sense and it is relatively easy 
to use despite its intricateness, 

8. The psychovector model has been designed as a cybernetic model of 
personality, making it ideal for use by such other disciplines as anthropology, 
sociology, industrial training, etc. for the simulation and interpretation of 
behavior (13). 

9. Being an extremely complete model of the control system, the state 
graphs, which undergird the methodology and techniques of psychovector 
analysis, also promise to serve as both an extremely sophisticated computer 
model of personality and an ideal matrix for the development of an android 
that can recapitulate most human control functions. 

10. The potential of this model to serve as a diagnostic instrument for 
identifying affective malfunction and suggesting etiology and kind of therapy 
needed has been tested in two large psychiatric hospitals abroad and one state 
psychiatric hospital in America, as well as in private clinical practice. All 
reports indicate that this Personality model may—through its tracing of 
normal emotions as their effects graduate past thresholds into their abnormal 
states—have uncovered some of the basic mechanisms that are instrumental 
in causing mental illness, 

No series of diagrams has ever served the clinician and psychotherapist in 
the Precise and efficient way that those of psychovector analysis promise to do. 
By using current terminology as a system of adequate calculus which defines 
its own symbols by their position within the network (thus acting as a servo- 
mechanism which services its own symbols), psychovector analysis provides 
the counselor and psychotherapist with his first cybernetic instrument for 
reading affect and delineating Personality, Furthermore, by tracing normal 
emotional levels past their thresholds into abnormal responses, lines of be- 
havior leading to emotional malfunction have been isolated and set forth in 


other state graphs (integrated subsystems which intimately correlate with 
the state graph presented in this article), 
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F. Conciusion 


Psychovector analysis represents a new personality theory—or perhaps it 
would be more apt to say a cybernetic model that is simply large enough to 
contain all of the old personality theories within its confines. Soon to be 
made available too as a standardized interviewing instrument, psychovector 
analysis offers large promise in the area of non-Husserlian phenomenology. 
As Rashevsky pointed out, an iconic network—a white box depicting pri- 
mordial life functions—can be designed which defines vector space with verbal 
symbols that are capable of relating value sequences and determining the order 
of hierarchical transformation series. 
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AMOUNT OF BODY MOVEMENT AS AN INDICANT OF 
TASK-IRRELEVANT TEST ANXIOUS RESPONSES** 


Langley Porter Neuropsychiatric Institute, University of California Medical Center 
C. R. SNYDER 


A. INTRODUCTION 


A popular theoretical position regarding the high test anxious individual is 

that he responds to stressful classroom-type learning situations with behaviors 

| that are irrelevant to, or interfere with, task completion (1, 4). Typically, the 

task-irrelevant behavior of the high test anxious § is conceptualized as an 

internal cognitive process, whereby the high test anxious 8 is characterized by 

more rumination, self-deprecation, and negative affect than the low test anxious 

8 (2,5, 6, 8). There has been little research, however, examining the possibility 

: that the high test anxious § may manifest external physical signs of task-irrele- 

vant behavior. To date, only two studies have examined this possibility, and the 

results differ. Mandler and Sarason (4) found a significant positive correlation 

| between test anxiety and amount of observed body movement, while Snyder 

and Ray (9) report a small and nonsignificant relationship between these two 

measures. By use of a large sample (n = 251), the purpose of the present study 

was to ascertain whether the high test anxious § exhibits more physical body 
movement in the test-like situation than does the low test anxious 8, 


B. Mernop 
1. Subjects 
All Vanderbilt University students in the four Fall sections of Introductory 
Psychology were given the State-Trait Anxiety Inventory—STAI (10)—as 
modified by Snyder and Katahn (7) to estimate test anxiety. Under the modi- 
fied instructions, the $ imagines that he is about to take a classroom test, and 
tesponds to the STAI questionnaire items accordingly. This modified version 
of the STAI is best conceptualized as a trait disposition to react with anxiety 
in the test-taking situation. Snyder and Katahn (8) have demonstrated that 
at pReeeived in the Editorial Office on November 5, 1971, and published immediately 
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this trait disposition to react with test anxiety, as measured by the modii 
STAI, accurately predicts ongoing state anxiety during the test-taking si 
tion, The modified STAI was distributed and collected by the teaching 
tants in each section in order to minimize the possibility that the Ss wo 
make any connection between the STAI and the subsequent experimental 
Two hundred and fifty-one male students participated in the subsequent experi- 
mental task as part of their course requirement. 


2. Experimental Procedure 


Groups of six to 10 Ss were administered a verbal learning task in a clas 
room in the psychology department. The complex verbal learning task, de 
scribed elsewhere (7, 8), was utilized because of its potential similarity to th 
learning process in the typical classroom situation. Three trials were gi 
Subjects were told that they would (a) fill out a short questionnaire descri 
their feelings at the moment in the experiment, (4) study information pr 
sented by slides on the screen, and (c) supply fill-in-the-blank answers 
questions over the information, Therefore, each trial consisted of (a) a sé 
report phase in which the S was given 60 seconds in which to complete th 
self-report scale; (4) a study phase in which the 16 items covering Hullia 
learning theory were presented, each visible for eight seconds; and (c) 
performance phase in which the 14 questions were presented, each visible for 
eight seconds, with instructions to write the answer. It must be emphasi 
that the verbal learning task was employed only to establish a stressful cl 
room test-like situation, and performance on the learning task was not 
focus of the present study. 

The self-report scale completed at the beginning of each trial consiste 
the “negative” affective words scornful, anxious, shy, angry, ashamed, fear] 
discouraged, and irritated, and the “positive” affective words joyful 
attentive. The § could vary his response from 1 to 9, with 1 denoting “little 
and 9 denoting “strongly” the degree to which each word was descriptive 
his experience at the moment in the experiment. These affects represent th 
words with the highest factor loading for each of the seven negative 
two positive affect factors of the Differential Emotion Scale—DES (3): 
developed on Vanderbilt Ss. Anxious was added to the self-report 
because of its possible relationship to test anxiety as measured by the STAI. 

Unknown to the Ss, the E rated each $’s amount of overall physical mov 
ment in the self-report, study, and performance phases of each of the 
trials from the back of the classroom. Amount of body movement was b 
upon a subjective observational estimate that the E made on the basis of ! 
leg, trunk, arm, hand, and head movements taken together. The follov 


ee 
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scale values were used: 0 = none or slight; 1 = moderate; 2 = strong. 
Previous research (9) employed this same observational procedure for mea- 
suring the amount of physical movement, and the interrater reliabilities on 
two separate tasks were .85 and .90. One of the raters of this initial study (9) 
was utilized in the present study. The sequence in which Ss were observed 
was the same for the three phases of a given trial, but Ss were observed in 
different sequences on trials 2 and 3. Subjects were observed for 15-second 
intervals, 
C. RESULTS 

The STAI correlated positively and significantly (all » < .001) with 
all negative ongoing self-reported affects, summed over the three trials. The 
STAI exhibited the following correlations with negative affects: fearful = 
29; scornful = .23; irritated = .34; shy = .23; discouraged = .41; angry 
= .26; ashamed = .17; anxious = .18, Additionally, the eight negative 
affects were summed and correlated .34 (p < .001) with STAI scores. The 
STAI correlated negatively and significantly (p < .01) with the two fol- 
lowing positive affects: attentive = —.16; joyful = —.17. The two positive 
affects were summed and correlated —.21 (p < .001) with STAI scores. 

The STAI did not correlate significantly with movement during the self- 
report (r= .01), study (r = .05), or performance (r = —.02) phases of 
the classroom-type learning situation. 


D. Discussion 

The fact that all ongoing negative affects correlated positively, and ongoing 
Positive affects correlated negatively with STAI scores indicates that the 
trait of test anxiety as measured by the modified STAI accurately reflects 
ongoing self-reported affects during the classroom-type learning situation. 
These results are consistent with other results in our laboratory (9). The 
inclusion of ongoing self-reported affects in the present study, then, corrob- 
eas the fact that the high test anxious Ss were feeling (cognitively) more 
Negative and less positive affects during the test-like situation. 

The STAI did not exhibit any significant relationships to amount of body 
heii in either the self-report, study, or performance phases of the 
Classroom-type situation. These results, taken together with previous findings 
'n our laboratory (9), indicate that high test anxious Ss do not exhibit any 
more body movement in the classroom test-like situation than do low test 
anxious §s, A probable reason for this finding is that the task-irrelevant test 
*nxious responses may occur primarily at the internal cognitive level. That 
18, the high test anxious individual may be unable to achieve task completion 


2 ier “learn”) because of task-irrelevant ideation. There has been considerable 
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research supporting this position in that high test anxious Ss engage in m 
ruminative and self-deprecatory thinking (2, 5), as well as negative a 
experience in a classroom-type learning situation (8). Overall, these res 
(a) cast doubt upon the notion of excessive body movement as task-irrele 
test anxiety responses, and (4) indirectly highlight the important role 
internal task-irrelevant thinking has in test anxiety theorization. 


E. Summary 


Test anxiety did not correlate significantly with amount of observed body 
movement in a classroom-type learning situation. As measured in the cu 
study, amount of body movement is therefore not indicative of task-irrel 
test anxious responses, A possible reason for this finding may be that’ 
nature of the task-irrelevant response occurs principally at the internal co 
tive level rather than at the overt physical level. 
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DIFFERENTIAL CHANGE AMONG SENSITIVITY 
TRAINING PARTICIPANTS AS A FUNCTION 
OF DOGMATISM* 


University of Arkansas at Little Rock and Memphis State University 


Syivia A. Jourz, RoLaNp L. Frvz, BARBARA MBIERHOEFER, 
AND Rosert N. VIDULICH 


A. INTRODUCTION 


Many variables influence sensitivity training: e.g. trainer behavior, orga- 
nizational climate, type of group, results desired. Another influential dimension 
about which little is known is concerned with the differences between the 
participants themselves. A study (7) of “sensors” and “intuitors” suggested 
that those individuals who gained information from their senses—e.g., what 
they could feel, touch, smell, or see—profited less from sensitivity training 
than those individuals who were classified as intuitors. The latter gained in- 
formation about their environment from logic, reasoning, and abstraction, as 
well as from sensory data. The intuitors gained much more from a sensitivity 
training experience, and the authors further suggested that another type of 
change device might be more beneficial for sensors. 

Other laboratory studies relate personality dimensions to success in training. 
Miles (3) found that ego strength, flexibility, and need for affiliation cor- 
» related ‘with success in a T-group experience, In her review of the literature 
on sensitivity training laboratories, Stock (8, p. 434-435) concluded ‘“These 
findings converge on the idea that personality factors having to do with 
receptivity, involvement, lack of defensiveness, and a certain kind of energy 
or openness may be important facilitators of learning.” Thus, the literature 
Suggests that the individual differences of T-group participants—e.g, COgMI- 
tive style or personality dimensions—contribute significantly to the benefit 
obtained by the participant from the experience. 

Dogmatism is another individual difference characteristic that may influence 
the outcome of sensitivity training experiences. Basic differences in personality 
and life syle, as theorized by Rokeach (4, 5), suggest that the high dogmatic 
will not be as amenable as the low dogmatic to variables necessary for a 
Successful T-group experience. For example, high dogmatics are more prone to 

threat and poor self-concept, and less susceptible to logical ramifications 


at Petteived in the Editorial Office on November 8, 1971, and published immediately 
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within their belief systems (9), all of which are characteristics prohibitive of — 

the development of an environment of psychological safety necessary for the 

effective use of the T-group method (6). “e 

Consequently, it was hypothesized that (a) low dogmatic and high dog- 
matic Ss would be differentially influenced by a sensitivity training experience, 


and (6) the changes found would be supportive of Rokeach’s theory of 
dogmatism, 


B. Mrrnop 


wl. Subjects 


Two hundred sixty-two male Memphis State University students enrolled’ 
in introductory psychology were administered Rokeach’s Dogmatism Scale, 
Form E (4). Those males who scored one standard de 
standard deviation below the mean of the group were approached and asked 
to volunteer to participate in a T-group for which extra credit was given only ~ 
if the § came for all sessions of T-group and pre- and Posttesting meetings. 
Of the 122 students contacted, 31 volunteered. The 31 Ss were randomly 
assigned to two experimental and two control groups as follows: Experi- 
mental High Dogmatics (ups), N — 7 ; Experimental Low Dogmatics 


(zips), N = 9; Control High Dogmatics (cups), N = 8; and Control 
Low Dogmatics (ctps), N = 7, 


viation above or one 


2. Procedure 


All Ss were pretested with the Sixteen Personality Factors Test (1); 
Rokeach’s Value Survey, which contained ked j 


two sets of values to be ran Sie 
(5); and the Tennessee Self-Concept Scale (2), relabeled for administration 


7 p.m. until 10 P.m. on each of two consecu- 
the Exp group experienced a sii . 


were administered the posttests, T 
Concept Scale, the Value Sury 
mental groups took these tests 


differences between experimental Broups asa function of intelligence, 
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C. ReEsuLts 


The Ss’ characteristics were analyzed to ascertain any possible age, race, 
intelligence, or educational differences that could have influenced the results. 
No significant differences in these variables were found. 

Subject selection statistics were computed on Dogmatism Scale pretest 
scores. The norm on the Dogmatism Scale for the 262 males tested was 150.23 
with a standard deviation of 21.41. The mean of the high dogmatics was 1.17 
standard deviations ‘above the group mean (175.38), and the mean of the 
low dogmatics was 1.17 standard deviations below the group mean (125.21). 
The mean for the EHDs was 175.15; for the cups, 175.50; for the ELDs, 
123.39; and for the cLps, 127.00. With use of ¢ tests, no significant differences 
were found between the Dogmatism Scale means of the experimental and the 
counterpart control groups on either pre- or postadministration of the scale. 
When an experimental group’s pre-T-group Dogmatism Scale mean was 
compared with its postexperience mean by ¢ tests, there was no statistical 
significance found. 

Before and after comparisons of the mean rankings of the Life Goals of the 
Value Survey were analyzed by # tests. The CHD, CLD, and EHD groups showed 
no change on any of these values. The ELDs changed significantly (.05 level) on 
two values: Happiness (I) and Inner Harmony (J). In both cases, the values 
were seen as more important on the posttest than on the pretest. 

The Instrumental Values of the Value Survey were similarly analyzed for 
ye differences. As above, there were no changes on any of the mean 
7 


value rankings for the control groups. For the EHDs, one scale, Cheerful (D), 
became significantly more important at the .05 level. For the ELDs, the Honest 
(I) value became more important at the .001 level of probability. 
The Tennessee Self-Concept Scale was analyzed by means of a three-way 
) analysis of variance with repeated measures (10): one variable was the type 
of 8, experimental or control; the dichotomous level of dogmatism was the 
second dimension ; and third, the pre-postadministration of the inventory, was 
the repeated measure. The only significance found was in the interaction be- 
tween the level of dogmatism by pre-postadministration. 4 postiori ¢ tests 
revealed that the mean self-concept score on the ELDs decreased from pre- to 
|. Posttesting, and the reverse occurred for the EHDs. The net result was that 
, he perceived self-worth of the EDs and the ELDs became more similar at post 
urveying than they were at preadministration time (see Figure 1). 


; ‘ 


D. Discussion 
F There was no change between pre- and postmeasures on the Dogmatism 
cale; however, no change was expected. Dogmatism represents a relatively 
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SCORES ON THE TENNESSEE 
SELF—CONCEPT SCALE 


RE POST 
TRAINING TRAINING 
TIMES OF 
TEST ADMINISTRATION 

FIGURE 1 


PRE- AND POsTTRAINING Scores OBTAINED FoR EXPERIMENTAL Low Doomatic Ss (ELDs) 
AND EXPERIMENTAL HicH Docmatic Ss (EHDs) oN THE TENNESSEE 
SeLr-Concepr ScALE (2). 


enduring cognitive style and should not change drastically, regardless of most 
experimental manipulations. 

The ELps changed significantly on Happiness and Inner Harmony, both of 
which became significantly more important to them. As predicted, these are | 
both values that pertain to a “truer” image of the self. ‘ 

The impressive shift on the instrumental value Honest implies that the BLD ’ 
group accepted the T-group message of “openness and honest.” The BHDs did 
not show any change on this variable, The posttest mean showed that they 
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considered it slightly less important than before. The only instrumental value 
on which the EHD group changed was Cheerful, which could be construed 
as an affirmation of the importance of their keeping up a defensive “you can’t 
hurt me” front at all times. Two of the members of the group expressed their 
opinion that the T-group only served to confirm the impression of themselves 
that they already had. In consideration of the superficial nature of the group’s 
feedback and the poorer self-concepts (at least initially) of the high dogmatics, 
this does not seem to be a very honest interpretation. 

In light of the ELDs’ receiving more open and honest feedback and changing 
on more of the measured variables, the Tennessee Self-Concept Scale findings 
suggest that the ELDs experienced more “true” change during which their 
value systems underwent some disturbance. (ELDs saw themselves in a less 
positive light.) However, it might be conjectured that sometime after the 
T-group experience, the ELDs restructured their belief systems or changed 
their behaviors, which eventually resulted in more congruent values and, 
consequently, more positive self-concepts. } 

In short, the Exps did change more than the EHDs, and most of these changes 
were in the direction of a more honest self-appraisal: However, the number — 
of scales on which change did occur was small when compared with the 
number of scales on which possible changes could occur; but there was no 
Teason to expect changes on all the variables measured—for example, the 
values of National Security or Clean. 

This study produced support for the hypothesis that the personality variable — 
_ of Dogmatism—high dogmatics vs. low dogmatics—contributed to the dif- 
ferential effect of the sensitivity training experience. Also, the personality and 
value measures given to the high dogmatics and the low dogmatics, both 
experimental and control, which significantly differentiated between the two 
Cognitive styles, were quite congruent with Rokeach’s formulations ail 
matism, vases 


E. Summary 


The purposes of this study were to isolate dogmatism as one possible vari- 
able contributing to differential change as a result of a T-group experience 
and to provide empirical evidence supportive of Rokeach’s theory of dogma- 
tim, The subjects were divided into four homogeneous groups: experimental 
high dogmatics (zEms), experimental low dogmatics (ELDs), and two similar 
control groups. The experimental groups underwent 12 hours of sensitivity 
training. Experimental and control Ss were tested on survey measures before 
and after the T-group. The control groups did not significantly change on any 
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of the survey instruments. ‘The ELDs changed on more values than EHDs. A 
self-concept measure suggested that the self-image of the ELDs decreased from 
pre- to posttest administration, while the reverse occurred for the EHDs. The 
differential change hypothesis was supported. The values that changed and 
the descriptive personality measures supported Rokeach’s formulations of 
dogmatism. 
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EFFECTS OF MAGNITUDE OF SUCROSE AND 
SACCHARINE ON BODY WEIGHT* 


University of Melbourne, Australia; and University of Maine at Portland 


LawrENcE WEINSTEIN! AND VINCENT M. Cotuccr 


A. IntRopuction 


The few studies that have examined the effects of amount of sucrose and 
saccharine on body weight have reported that exposure to a large amount of 
sucrose significantly alters body weight as compared to a smaller amount of 
Sucrose (e.g., 2), while experience with a large amount of saccharine has no 
significant effect on body weight as compared to a smaller magnitude of 
saccharine (e.g., 1). That is to say, ingestion of large amounts of sucrose 
tends to lead to heavier body weights than ingestion of smaller quantities of 
sucrose, while it appears that animals who eat a large amount of saccharine 
weigh as much as Ss who eat less saccharine, 

It would be of interest to ascertain the generality of these findings, That 
is to say, will solid sucrose and saccharine lead to the same weight differences 
as have been reported for liquid sucrose and saccharine? Furthermore, will 
Sucrose concentrations have an effect on weight in situations other than the 
°perant conditioning chamber? The present study examined the effects of 
magnitude of solid sucrose and saccharine on body weight. 


B. MetHop 
1. Subjects 


The Ss were 80 experimentally naive, male albino rats of the Wistar strain, 
200-250 gm in weight, 80-100 days of age at the start of the experiment, 
housed in individual cages. 


2. Apparatus 


Dixie cups which served as sucrose and saccharine cups were mounted on 
the inside of each of 80 experimental cages. 
_e———— 
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. 3. Procedure 

Large and small amounts of sucrose consisted of 45 mg and 20 mg Noyes 
rat pellets, respectively. Noyes pellets of the same size served as the large and 
small amounts of saccharine. 

Twenty Ss were randomly assigned to each of four conditions: large sucrose 
(LSu), small sucrose (SSu), large saccharine (LSa), and small saccharine 
(SSa). 

Upon receipt from the breeder, all Ss were given seven days of continuous 
access to Purina Laboratory blocks and water while weights were recorded 
daily. Following 20 days of adaptation to a food deprivation schedule (labora- 
tory blocks in the home cage for one hour at the end of the experimental time, 
daily), Ss were given five days of training, in the experimental cages, to cat 
the saccharine and sucrose followed by 25 experimental sessions of 40 pellets 
each. Body weight was recorded before each session. Throughout the study Ss 
had access to blocks for one hour following the experimental period, and 
continuous access to water. 


C. REsuLts 


"The mean percentage of ad lib. body weight (weight before each session/ 
ad lib. weight 100) was examined in the analysis of the results. 

From Figure 1 it appears that (a) a large amount of saccharine resulted in 
heavier weights than a small quantity of saccharine, (b) the sucrose groups 
weighed the same, and (c) animals who ingested large amounts of saccharine 
weighed more than rats who ate large amounts of sucrose. ity 

_ The mean percentage of ad lib. body weight from sessions 1-25 differed 
significantly between the four groups by-an’analysis of variance, F (3, 76) = 
3.46, » < .05. By Neuman-Keuls multiple comparisons, the difference be- 
tween the sucrose groups was not reliable (p > .05), while the difference 
between the saccharine conditions and between LSa and LSu was each statisti- 
cally significant (p < .05). 


D. Discussion 


The finding that the ingestion of a‘large amount of saccharine had a reli- 
able effect on. body weight, as compared to a smaller amount of. saccharine, 
disagrees with Collier and Novell (1). Perhaps different amounts of solid 
saccharine were not neutral in their effects on subsequent food ingestion, while 
different concentrations of liquid saccharine led»to equivalent amounts of food 
eaten. This would’ account for the different results between liquid (eg, 1) 
and solid saccharine studies. The question is open for empirical examination- 
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The demonstration of no significant body weight differences with different 
amounts of sucrose disagrees with Dunham and Kilps (2). The discrepant 
findings may be due to the different amounts and types of sucrose used as 
compared to the present investigation. Dunham and Kilps trained animals 
under a 32% and an 11.3% sucrose solution, whereas the present study em- 
Ployed 45 mg and 20 mg solid sucrose. 

The finding that a large amount of saccharine resulted in heavier rats 
than an equivalent amount of solid sucrose suggests that animals ate more as 
4 consequence of experience with saccharine as compared to sucrose, The 
Present results contradict the popular belief at the human level that solid 
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saccharine, tablets or granular, leads to a loss in body weight as compared to 
a comparable amount and type of sucrose. 

It would appear that the constituency of sucrose and saccharine reinforce- 
ment is one variable that determines the effects the rewards have on body 
weight. 


E. SuMMARY 


The consumption of large and small amounts of solid saccharine produced 
a reliable difference in body weight, while the intake of larger and smaller 
quantities of solid sucrose resulted in no body weight difference. In addition, 
a large quantity of saccharine produced Ss who weighed more than animals 
who ate the same amount of sucrose. These results contrast with previous 
research and popular belief, and suggest that solidity (i.e., solid or liquid 
state) of reward controls the effects of magnitude of saccharine and sucrose 
on body weight. 


REFERENCES 
1, Cort G. H., & Novett, K. Saccharin . J. Comp. & 
Physiol Psychol. 1967, 64, . ¢ as a sugar surrogate. J. ‘omp. 


2. Dunnam, P. J., & Kips, B. Shifts in magnitude of reinforcement: Confounded 
factors or contrast effects? J. Exper. Psychol., 1969, 79, 373-374. 


c/o John Babchuk 
Box 308 
Willingdon, Alberta 
Canada 


rs 


Published as a separate and in The Journal of Psychology, 1972, 80, 161-165. 


NEGATIVE CONTRAST WITH HUMANS AS A 
FUNCTION OF EMOTIONALITY* , 


University of Melbourne, Australia 


LAWRENCE WEINSTEIN* 


A. INTRODUCTION 


Negative contrast effects, in which the level of performance of a group of 
Ss who receive a reduction in incentive magnitude falls significantly below a 
low reward control group, are well documented (eg., 3). A number of 
investigations have demonstrated that negative contrast effects with infrahuman 
organisms are not obtained under the influence of pharmacological agents 
which presumably attenuate, or eliminate, such aversive motivational states 
as fear or frustration (e.g., 4, 5). Indeed, many investigators maintain that 
downward contrast in animals is a result of an aversive emotional response, 
such as frustration, which elicits responses that compete with the instrumental 
response, impair behavior, and so produce negative incentive contrast effects 
(eg., 1, 2, 7). 

Little attention has been directed to negative contrast at the human level. 
The present investigation examined the notion that human negative contrast 
effects are due to an aversive motivational response, such as frustration, Ifa 
reduction in incentive magnitude produces frustration responses that impair 
behavior and result in downward contrast, one might expect that individuals 
who are more sensitive to the impairing effects of frustration and more emo- 
tional would exhibit larger negative incentive contrast effects than people 
who are less emotional. Such data would be consonant with the idea that 
negative contrast with humans is caused by some aversive motivational state. 
The pitsent experiment examined the effects of a reduction in quantity of 
reinforcement in high and low emotional individuals. 


B. Merxop 
1. Subjects 


On the basis of scores from the Rosenzweig Picture Frustration Test (6) 
30 males and 30 females were selected who were not highly sensitive to the 
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effects of frustrating events and who scored below the overall norm of 67; 
30 females and 30 males were also chosen who were relatively less resistant 
to the emotional consequences of frustrating situations and who scored above 
67. According to Rosenzweig (6), individuals who score higher than the over- 
all group norm tend to be more anxious and emotional than people who score 
below the norm. The 60 male and 60 female Ss were undergraduate students 
enrolled in an introductory psychology course at the University of Maine, 
Portland. Ten high frustration males and 10 high frustration females were 
randomly assigned to each of three high frustration groups; the same number 
of low frustration males and females were randomly assigned to each of three 
low frustration groups. 


2. Apparatus 


The materials consisted of a Kodak Carousel 750 slide projector 139.7 mm 
from a 152.4 mm square piece of cardboard which served as a screen, 50.8 X 
42.6 mm slides with digits printed on (i.e., 126 3), and a stopwatch. 


3. Procedure 


Each § worked the same sequence of the same 20 mental multiplication 
problems. One correct or incorrect answer, or 60 seconds, whichever came first, 
was allowed for each problem, with 4 seconds between problems. 

_ The problems were worked in six situations, as follows: 

(a) Subjects who scored highly sensitive to frustration (high frustration Ss) 
received S¢ after correctly or incorrectly answering the 2nd, 3rd, 4th, 6th, 
9th, 10th, 11th, 12th, 15th, 17th, and 18th problems (5-5HF). 

(4) Another group of high frustration Ss received 50¢ through problem 10 
and then experienced a reduction in incentive size to 5¢ from problems 11 to 
the end of the session (50-SHF). 

(c) Twenty high frustration Ss receive i 
airy. 2) d no reward throughout the preshift 

(d) Some low frustration individuals experienced 50¢ through problem 
10 and then were maintained on 5¢ (50-5LF). 

- (e) Other low frustration individuals received 5¢ throughout the session on 
the previously designated trials (5-5LF). 

. (f) And, finally, a group of low frustration Ss for the first 10 problems, the 
preshift phase, received no reward (NLF). ; 

_ Subjects were read the following instructions: “This is an experiment in 
abstract problem solving, the ability to rapidly work problems involyin ab- 
stract reasoning. You will be given some problems to work. Each one ee, 
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of a three-digit number multiplied by a one-digit number. You are, without 
pencil and paper, in your head, to multiply the numbers as quickly as you can 
and tell me your answer. You will receive money after some answers, The 
speed and accuracy with which you answer, relative to other Ss, will earn 
you more money. In other words you are being rated against other Ss,” 

The £, seated beside the 8, manually administered the nickels and half 
dollars, operated the slide projector, and recorded latencies with the stopwatch, 
As little as possible was said to § between problems. 


C. Regsuxts 


Latency means (the time between slide onset and the first correct or in- 
correct answer) were examined in the analysis of the results. 

From Figure 1 it would appear that the N groups each reacted more slowly 
than all other Ss, 50-SLF Ss reacted more quickly than 5-5LF, and 50-5HF 
Ss answered more quickly than 5-SHF. The mean latency from problems 1-10 
differed significantly among the six groups by an analysis of variance, F(5, 
114) = 3.35, » < .01. By Duncan’s multiple comparisons the difference 
between NLF and each low frustration group, between 50-5LF and 5-5LF, 
between 50-SHF and 5-5HF, and between NHF and each high frustration 
group was each statistically significant (p < .05). 

Figure 1 indicates that on problem 14 the 50-5HF Ss took longer to answer 
than the 5-SHF group (negative incentive contrast effects), while Ss in the 
50-5LF condition took less time to respond than individuals in the control, 
5-5LF, condition. The mean latency from problems 14-20 differed signifi- 
cantly among the four groups by analysis of variance, F(3, 76) = 2.85, p < 
-05. By Duncan’s comparisons the difference between 50-5HF and 5-SHF 
was significant (¢ < .05), while the difference between 50-SLF and 5-SLF 
was not statistically reliable (p > .05). 


D. Discussion 


The findings that for problems 1-10 the high frustration N group responded 
more slowly than all other high frustration Ss, and the low frustration N $s 
teacted more slowly than Ss in other low frustration conditions, indicate that 
5¢ and 50¢ served as reinforcing events for high and low frustration Ss. 
Moreover, for high and low frustration conditions the levels of reward, 50 
and 5¢, were discriminably different in that 50¢ produced a higher level of 
behavior, a shorter latency, than 5¢. 

The experiment demonstrated that a decrement in magnitude of reward 
Produced negative contrast effects in individuals whose test scores indicated 
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relative emotionality, while the downward contrast effect was not obtained 
in people who were, according to the frustration index, relatively unemotional. 
As already discussed, these results are compatible with the notion that negative 
incentive contrast effects with human Ss are due to some aversive motivational 
state, such as frustration, 

The present demonstration of downward contrast is in accord with most 
animal and human research that has reduced incentive quantity (e.g., 3, 8, 9). 
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OF HEIM’S AH4 TEST* - a 
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Pahlavi University, Shiraz, Iran 
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A. IntTRopucTION 
Over the past 15 years or so, Persian adaptations of American and British 
tests have been widely, if not always wisely, used for selecting university 
students and for other practical purposes in Iran. In fact one semiacademic 
commercial institute has claimed to have adapted over 200 tests (1). Yet, sur- 
prisingly little has been published as to the relative performance of such tests 
in the context of Iranian culture. Besides a few exploratory studies by Western 
psychiatrists (20) only one study (17) is known to the present authors which 
has reported on the performance of a relatively well sampled group of Iranian 
senior secondary school pupils on a Persian adaptation of Heim’s AHS test of 
high-grade intelligence. Two further studies with the same test (14, 18) 
have produced evidence supporting the psychometric soundness of the Persian 
AHS and its general value as a research instrument in the Iranian cultural 
context. P ‘ 
The present paper reports data obtained with a Persian adaptation of the 
AH¢4 scale (8) given to a large sample of secondary school pupils in an urban 
center in Southern Iran. iy , 
B. Meruop i if 
Subjects were 3120 pupils, 1779 boys and 1341 girls, aged between 13 and 
19 years, attending grades 7 to 12’of several’ secondary schools in Shiraz, the 
capital of the Southern Iranian province Fars. The overwhelming majority 
of subjects were Persian speaking Iranian nationals, mostly born and brought 
up in the city of Shiraz, and of Muslim parents. sista qf i 
Tranian secondary schools, although much less restricted in their admission 
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policy, are similar to British grammar schools in both objectives and outlook. 
‘They represent the overwhelming form of middle education open to Iranian 
children, and everyone who successfully completes the six years of secondary 
education is theoretically eligible to enter a university. In practice, however, 
secondary school graduates have to sit a highly competitive entrance examina- 
tion to get a university place. 

The curriculum of secondary education falls into two equal parts. In the 
first three years, grades 7 to 9, all pupils receive the same kind of general in- 
struction in arts, sciences, and a foreign language. From the tenth grade the 
studies are divided into three main branches: literature and humanities; 
mathematics and physical sciences; and natural sciences and biology. These 
three branches correspond to various specialized courses at the university level. 
Although, theoretically, the choice of any branch depends on the pupil himself, 
only those with a minimum score in the main courses of each branch are 
permitted to enter that branch. Because of higher pay and status enjoyed by 
the engineering, medical, and scientific professions, there is a great deal of pres- 
sure, both social and parental, on the more intelligent pupils to opt for the 
mathematics and natural science branches. Moreover, the standards of teach- 
ing are better established and applied in these branches than in the literary 
branch. Although no objective study of the relative standing of the three 
branches in terms of intelligence is available, it would seem to be generally 
assumed that the most intelligent and serious students go to the mathematics 
and natural science branches, and that a sizeable proportion of those entering 
the literary branch do so not because of any particular talent or interest in arts 
and humanities, but because they were not able to follow any other outlet for 
their wish to finish secondary education and enter university. 

The selection of subjects was made in terms of schools with known intake 
areas, 80 that the sample could be regarded as fairly representative of the 
socioeconomic composition of the secondary school population of the city of 
Shiraz. The original research design envisaged coverage of a minimum of 
200 boys and 200 girls in each grade. Due to the fact, however, that sampling 
was based on schools and, within each school, on total classes, the inclusion of 
equal numbers of subjects in each grade level did not seem feasible or desirable. 
Moreover, because of the separate fields of specialization after the ninth grade 
and in view of the known differences among those following the three branches 
of specialization (13), the number of subjects had to be increased at grades 
10-12 so as to include a reasonably large number of each branch of specializa- 
tion. Again, because of the varying number of subjects enrolled at different 
branches, it was not possible to include an equal number of subjects in each 
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of the three areas of specialization. This was particularly difficult because 
of the small number of subjects studying literature and humanities in Shiraz 
schools and the limited number of girls attending the mathematics branch. 

The test was a Persian adaptation of the AH4 scale, a group test of intelli- 
gence developed by Dr. A. W. Heim (8) of the Cambridge Psychological 
Laboratory, widely used for research and practical purposes in Britain (11). 

Essentially a speed test, the AH4 consists of two separately timed parts, 
one verbal-numerical and one perceptual, each containing 65 problems. Each 
part is preceded by several examples which are carefully explained and solved 
by the tester before the main body of the test is attempted. Subjects are given 
10 minutes to do each part of the test. The Persian adaptation has been pre- 
pared by the senior author, and a deliberate attempt has been made to keep it 
as close to the original English version as possible. 

Subjects were tested in groups of varying sizes. Care was taken to make 
sure that the subjects understood the purpose and procedure of the test and 
knew how to record their answers in the separate answer sheet distributed 
prior to the test. An attempt was also made to secure subjects’ interest and 
motivation by emphasizing the similarity between this test and other tests that 
they might have to take for academic and other selection purposes. 

Prior to the test, a personal data sheet was filled in by all subjects. This 
included information on subjects’ family background, academic achievement, 
interests, and self-concepts. The relationship between these variables and test 
results will be presented in future communications. As required by the manual, 
subjects were given 10 minutes to complete each part of the test. : 


C. REsutts 


Table 1 summarizes the means and standard deviations achieved by different 
grades separately for boys and girls. This table also gives the means and 
standard deviations of the wrong responses to each part of the test. Correlations 
between the two parts of the test and each part with the total score are set 
out in Table 2. The percentage cumulative distributions of the total score 
are given in Table 3. 

Table 4 gives the means and standard deviations of grades 10-12, classified 
in terms of the branch of study, for boys and girls. The main findings of the 
study are as follows: 


1. Age Differences 


From Table 1 it would appear that, as expected, the average test perfor- 
mance of subjects progressively increased from grade 7 (age range 13-14) to 
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TABLE 2 
CorRELATIONS BETWEEN THE Two Parts oF THE AH4 anp EACH Part WITH THE 
Torat Score For Six Groups oF IRANIAN Boys AND GIRLS SEPARATELY 


Boys Girls 
Groups Grades Grades 
r between 7 8 a 10 ll 12 7 8 9 10 11 12 


Pt. I-Pt. IL 59 (62 9 33 os ease suse 72 Sl 48 70 «66 «(37 
Pt. I-Total 74 4B. 4 Tho! BORO TBs rage, 81 86 61 76 91 49 
Pt. -Total 78 58 58 81 73 75 79. 87 (89). 84 Teisonags 


grade 12 (age range 17-18). Girls, on the whole, would appear to show a 

more regular and much smoother increase than boys. The difference in mean 

performance of the youngest and oldest is only slightly higher for boys (29.51) 

than for girls (28.89). The relative sizes of differences in mean test score be- 

tween the adjacent age groups are seen to vary for the two sexes: for boys the 
( largest and smallest differences are between grades 7-8 and 9-10, while for 
girls the largest and smallest differences are observed between grades 8-9 and 
9-10, respectively. Thus there would seem to be no appreciable or significant 
difference in mean intelligence scores between boys attending the eleventh and 
twelfth grades, and between boys and girls enrolled in the ninth and tenth 
grades. 


2. Differences Between the Two Parts of the Test 


It seems, from the sizes of its means, that the nonverbal part II of the test 
was found easier than part I by Iranian subjects. Boys scored higher on this 
Part of the test at all but one grade (eleventh grade), while girls achieved 
higher means on this part of test at grades 7, 8, and 9. It is also evident from — 
Table 1 that at all grade levels subjects registered a much larger number of 
% incorrect responses to part II. 

The correlations between the two parts of the test also varied from one age 
group to the other (Table 2), On the whole, the two parts would seem to be 
more closely correlated in the case of girls and, probably, the younger and less 
intelligent groups. This same trend is observable in Table 4, where girls and 
those following the less intellectually demanding literary branch produced 
i higher correlations between the two parts of the test. The correlations fall 

somewhat short of those reported by Heim (8) for the original English version 

of the test (.70 to 81). Heims’ British groups also scored higher on part IT 

of the test. Heim (8) has argued that this may be partly due to the familiarity 
_&ained with the earlier given part I of the test. 
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3. Sex Differences 


Comparison of the upper and lower halves of Table 1 shows that boys 
achieved higher means than girls at grades 8, 9, 10, and 11. Girls, on the other 
hand, excelled boys at grades 7 and 12. The differences are seen in both verbal 
and perceptual parts of the test. The overall mean of the boys, 47.53, is signifi- 
cantly higher than that of girls, 41.47 (p < .01). From Table 4 it would 
appear that even among the three specialized groups of grades 10-12, girls 
scored lower than boys in all but one instance (natural science twelfth grade) 
in terms of the total score. Girls following the mathematics branch of grades 
10 and 11 and the natural science branch of grades 10 and 12 also scored 
better than their male counterparts on the first, verbal, part of the test. 


4, Differences Among the Three Fields of Specialization 


From Table 4 it would also appear that boys and girls following different 
branches of specialization scored differently on both parts of the test. For boys, 
those following the mathematics branch scored the highest at all three grades, 
followed by natural science and literary branches, respectively. This same 
trend is observed in the case of girls at grades 10 and 11. At the twelfth grade 
level, however, the natural science girls excelled their counterparts in the 
mathematics branch. On the whole, the two science branches are much more 
similar to each other and present a greater measure of divergence from the 
literary branch. The findings are well in line with both general expectation 


and findings of similar investigations employing different intelligence scales 
(6, 10, 17). 


5. Comparisons with British Norms 


Table 1 demonstrates the generally low performance of Iranian subjects on 
both parts of the test. Since each part of the test consists of 65 items, it is 
obvious that the mean scores of Iranian groups hardly reached half the number 
of items per each part of the test. To bring out the depressed level of Iranian 
subjects more clearly, a comparison with certain British groups might be of 
interest. For a group of Naval Ratings (N = 1183) employed as representa- 
tive of the British general adult norms, total mean score is reported as 75,23 
(SD = 14.58), only 10 percent having scored 57 or below. Comparable means 
for groups of 14-year-old secondary modern (N = 565) and grammar school 
(N = 533) children are 59.51 (SD = 19.50) and 85.26 (SD = 11.21) 
respectively, Within these latter groups the bottom 30 percent scored lalow? 
50 and 78, respectively. In comparison with these figures, the score of Tranian 
subjects would seem to be low indeed, As mentioned earlier, Iranian secondary 
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schools are more in the pattern of British grammar schools, their main func- 
tion being to prepare pupils for higher studies, In view of this and the 
seemingly easy nature of items included in the AH4 test, the differences be- 
tween the Iranian and British groups are surprisingly large. Similar large 
differences between Iranian and Western norms have been found with Persian 
adaptations of other tests (10, 16, 17). 


* 6. Reliability and Practice Effect 


As the AH4 is basically a speed test; its reliability cannot be estimated 
through the ordinary split-half and internal consistency methods (5). Test- 
retest reliability coefficients are available for two groups of girls attending 
seventh grade (N = 47) and tenth grade natural science branch (N = 60). 
They were retested after an interval of three months. Results are summarized 
in Tables 5 and 6. Two important points emerge from Table 5. On the one 


; TABLE 5 
Test-Retest DATA ON THE PEeRsiAN AH4 ror Two Groups oF GiRLS 
Grade 7 (N = 47) Grade 10 (N = 60) 
Measure Pt I Pt. IL Total Pt. I Pt. I Total 
Test = t 
Mean 11.34 15.02 26.36 24.40 25.32 49,72 
SD 6.36 6.60 12,00 733 7.12 12.93 
Retest 
Mean 18.47 21.49 39.96 * $2.05 31.75 63.80 
SD ‘ 6.78 7.69 13.38 7.94 7.38 13.27 
Test- 1 
-retest 
difference 7.13 6.47 13.60 7.65 6.43 14.08 
Test- 
oe ee 58 65 72 67 77 81 


~ 


hand, while the reliability of the test score is sufficiently high, the temporal 
stability of the individual parts is far from satisfactory. This is particularly 
true in the case of the younger age group and the verbal-numerical part of the 
teeta oo 52 : 

On tH® other hand, at both ages, subjects demonstrated a substantial gain 
in their mean ‘scores over the test-retest interval. The gain amounts to just 
over one standard deviation unit of subjects’ respective original scores, the 
relative amount of gain remaining more or less constant over both parts of the 
test and from one group to the other. In comparison with the mean values of 
various age groups in Table 1, the repetition of the test after a three-month 


- interval would seem to have resulted in a rise of over two years in mental age 
A 
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TABLE 6 
Mean AH4 Tesr Score ror Ture Levers or Test-Retest Gain 


Test-retest difference 


Group Negative Moderate Large 
and or no gain gain gain 
test —11 to 0 1to 8 9+4- 

Grade 10 

Part I 
Mean 27.12 25.00 23.11 
N 8 24 28 

Part II : 

Mean 29.44 26.70 22.74 
N 9 24 27 
Grade 7 

Part I 
Mean 17.40 12.06 9.64 
N 5 597, 17 25 

Part II ee 
Mean 20.43 13.52 14.68 
N 7 21 19 


units. In both groups there was a trend for the greatest amount of gain to be 
achieved by those with the lowest test scores and vice versa (Table 6). This 
is mainly responsible for the lower reliability of the test. A similar trend has 
been reported for the Persian form of the AH5 (18). 


7. Validity 


Besides such indirect evidence of validity as differentiation between groups 
varying in age and field of specialization, some direct evidence of validity is 
also available. This consists of correlations between the AH4 scores and 
average school marks of subjects in the academic year immediately preceding 
the test. Thus, for a group of 45 eleventh grade girls, AH4 scores correlate 
35, 32, and .34 with the average mark for the previous year, and —.51, —.35, 
and —47 with past academic failure as reported by subjects themselves. 
Similarly, for a group of 85 twelfth grade boys, the three AH4 scores cor- 
related .27, .15, and .30 with the average mark earned in the previous grade. 
These correlations, while too low for Practical purposes, indicate that the 
Persian AH4 measures something with certain relevance for academic perfor- 


mance within the framework of Iranian secondary school system. 


D. Discussion anp Conc.usions 


‘The main findings of this study are well in accord with similar results ob- 
tained with other tests given to different groups of Iranian subjects, The more 


ee SE ae 


ee 
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or less normal distribution of test scores, a rise in average performance with 
increasing age, and the differential outcome of the two parts of the test are 
all in line with Heims’ original findings, and lend support to the conceptual 
validity of the AH4 test in its new format. The ability of the test to discrimi- 
nate between the three groups of specialization with known differences in 
intelligence level (17) and certain other aspects of personality (15) provide 
further evidence as to the practical validity of the Persian AH4. To these 
must be added the already presented data on reliability and validity which, 
although meager, are in the expected direction. This part of the evidence needs 
little elaboration. There are a few other points, however, which require some 
explanation. 

To start with, the suprisingly depressed scores of Iranian subjects in com- 
parison with British norms are well in line with similar results reported for 
Persian forms of the AH5 scale (17), the Primary Mental Abilities test (6), 
the Otis test (9), and some General Aptitude Test Battery scales (16). In 
view of the easy nature of the AH4 test and its obvious dependence on infor- 
mation and skills common to formal schooling, the observed differences between 
Iranian and British norms cannot be attributed to the cultural content of the 
test. For the overwhelming majority of subjects, however, this experiment 
provided the first opportunity to take a Western type intelligence test under 
strictly controlled conditions. The use of a separate answer sheet was no doubt 
a further impediment. The size of gain in retesting after an interval of three 
months (Table 5) clearly substantiates the role played by lack of test sophisti- 
cation in lowering the performance of Iranian subjects. The time restriction 
imposed on subjects provides another possible explanation. Mehryar and 
Shapurian (18) have previously commented on the slower performance of 
Iranian subjects on tests and questionnaires, and have offered some explanation 
in terms of both the lower tempo of life in Iranian society and the traditional 
emphasis of Iranian culture on the stylistic aspects of writing and reading as 
against their speed, communicative efficiency, and precision. The speed element 
is particularly relevant for the AH4 test results because of the narrower time 
limit associated with this test. In fact a few experiments have indicated that 
subjects gain substantially under more generous time limits (15 to 20 minutes). 

There is, however, some evidence that speed and test familiarity by them- 
selves do not account for the lower scores of Iranian subjects. In the present 
study, for instance, although tenth graders (age 16 years) taking the test for 
the second time scored higher than their twelfth grade counterparts, their new 
mean, 63, still falls short of the mean of 14-year-old British grammar school 
children, 85.26. The reason for this difference should probably be sought in 
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the earlier experiences of Iranian children and their relative deprivation of 
the kinds of cultural stimulation that are known to go with higher intellectual 
performance in the West (21). 

The superior performance of Iranian boys in comparison with Iranian girls 
is also supported by results of other studies (6, 9, 17). Bash (2, 3) has 
reported similar sex differences for his groups of illiterate villagers and tribes- 
men in Southern Iran. Even among young school children, boys have been 
found to surpass girls of their age and educational level on several of Piaget's 
conservation tests (4). Only in one study using the Goodenough-Harris (7) 
Drawing Test, girls aged between 10 and 13 have been found to excel boys 
(19). 

The inferior performance of Iranian girls is the more surprising in view of 
the fact that girls’ schools, particularly at the secondary level, include a much 
better selected sample in terms of both socioeconomic background and academic 
performance. It is a general observation that only girls from better educated 
families and/or with apparent academic talent are encouraged to continue 
their studies, if at all, to the level covered by this study. This is due partly to 
the lingering doubts as to the desirability of formal education beyond the 
rudiments of literacy for women on the part of the undereducated lower class 


families will undertake any financial sacrifices for the sake of keeping their 


ly moderated 
and above such 
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gross indicators of environmental variation as parental education and occupa: 
tion, the underlying causes should most probably be sought in the differential 
expectations of and attitudes to boys and girls inherent in the traditional 
culture of Iranian family. 
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CLASS VERSUS RACE DISCRIMINATION 
ATTRIBUTED TO SELF 
AND OTHERS*?* 


Department of Psychology, Hanover College 


Rocer L. TERRY AND JANE E. EVANS 


A. INTRODUCTION 


One of the most difficult areas in which to conduct valid research is that 
of interracial attitudes. It has long been apparent, for example, that a person’s 
questionnaire responses cannot be immediately assumed to be predictive of an 
overt, behavioral expression of his attitude (5). Earlier research indicated 
that behaviorally manifested attitudes were less racially discriminatory than 
verbally expressed attitudes, but because of greater social pressures against 
discrimination today, one might expect this relationship to be reversed ; that is, 
few persons would now confess to discriminating along race lines, although 
there are numerous behavioral indications of racial discrimination. Until our 
measurement technology greatly improves, this problem will remain. 

Another difficulty in the area of interracial attitudes is that since race is an 
organismic variable, it does not lend itself to easy experimental manipulation. 
Because race is correlated with many other variables, it is difficult to ascertain 
if a “racial” attitude is, in fact, a function of race or of these correlated, yet 
often unmeasured, variables. For example, several studies, following the lead 
of Rokeach, Smith, and Evans (7), indicate that belief similarity is a more 
important determinant of interpersonal attraction than racial similarity (4). 

Without question, the variables of race and socioeconomic status are closely 
correlated in our society; yet few studies have attempted to isolate the effects 
of these two variables on attitudes. Clark and Clark’s (2) famous study 
indicated that very young (Negro) children do demonstrate racial discrimina- 
tion, while Radke and Trager (6) demonstrated further that in a doll-play 
situation (white) children attribute lower class attributes to Negroes. These 
two studies demonstrate that even young children are aware of race-class 
interrelationships (3). One attempt (1) to vary race and class orthogonally 


* Received in the Editorial Office on November 12, 1971, and published immediately 
at Provincetown, Massachusetts. Copyright by. The Journal Press. 

1 This study was partially funded by a grant from the Research Committee, Han- 
over College, Hanover, Indiana. 
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did find the social class variable to be a more important determinant of social 
stereotypes than the race variable. 

The present study was designed to obtain independent measures of class 
and race discrimination and to determine if these measures vary with race. 
We were also interested in whether Negroes and whites differ in their attribu- 
tions of class and race discrimination to other persons of the same and dif- 
ferent racial group. 


B. MetHop 
1. Subjects 


The final sample consisted of 50 Negro and 50 white high school students 
from Louisville, Kentucky. Questionnaires, distributed to volunteers by study 
hall monitors, were returned by 210 students, but 58 had to be discarded 
immediately for various reasons—e.g., incomplete or inappropriate responses. 
There remained responses from 50 Negroes and 102 whites; to equalize the 
sizes of the racial groups, the responses from 50 white students were randomly 
selected. 


2. Procedure 


The questionnaires were composed of 20 descriptive statements of the form, 
people. . .”: people are not willing to do a hard day’s 


“ 


rege 
work,” and “—— people are more prejudiced than most people realize.” The 
respondent gave three endorsements of each statement as it was applied to 
“Upper class,” “Black,” “Lower class,” and ‘White” people (presented in 
this order), by checking a four-point Likert scale of agreement. The endorse- 
ments were to reflect how the respondent himself felt about the statements, 
how “the majority of Blacks would answer,” and how “the majority of whites 
would answer.” The order in which these endorsements were made was 
randomized, Respondents thus provided 12 scores for each of the 20 state- 
ments: three different endorsements of descriptions of four categories of people, 

From these data, class and race discrimination scores were generated for 
each of the endorsements. A tespondent’s class discrimination score was com- 
puted by summing the 20 absolute differences between his amounts of agree- 
ment to descriptions of “Upper class people” and “Lower class people.” A 
respondent's race. discrimination score was similarly 
the 20 absolute differences between his amounts of agreement to descriptions 
of “Black people” and “White people.” These two scores could range between 
zero (no difference on any statement) and 60 (maximal or three points dif- 


computed by summing 
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ference on every statement). (Note that discrimination is treated simply as a 
differential response without any necessary implication of positive or negative 
prejudice. ) 


C.. RESULTS 


Mean class discrimination scores (the larger the score, the more discrimina- 
tion) are presented in Table 1. An analysis of variance, accounting for repeated 
measures (8, p. 302 ff.), revealed a significant main effect of Endorsement, 
indicating that both Negro and white respondents attributed more class dis- 
crimination to other persons of either race than to themselves (F = 7.42; 
af SoG pe 01). There was a nonsignificant tendency, as reflected 
in the Endorsement X Race interaction (F = 2.75; df = 2, 196; p < .10), 
for Negro and white respondents to attribute more class discrimination to 
Negroes than to whites. 


TABLE 1 
MEAN CLAss AND RACE DISCRIMINATION SCORES 
Endorsement 
Race Self Same race Other race 
Class discrimination scores 
Negro 17.56 21.52 19,38 
White 17.80 19.70 21.60 
Race discrimination scores 
Negro 12.42 19.94 17,34 
White 13.28 19.70 23.10 


Mean race discrimination scores (Table 1) were also analyzed. Again the 
Endorsement main effect was significant, with greater race discrimination 
being attributed to other persons than to oneself (F = 30.16; df = 2, 196; 
p < 001). A significant first-order interaction between Endorsement and 
Race indicated that Negro and white respondents attributed greater race 
discrimination to Negroes than to whites (F = 4.48; df = 2, 19637 < .05). 
This interaction was due to the fact that greater race discrimination was 
attributed to Negroes by whites than to whites by Negroes (PS 2412; 
df = 98; < .01). 

As a measure of total discrimination, the class and race discrimination scores 
were summed and analyzed. The Endorsement main effect (F = 24.74; 
p < 001) and the Endorsement X Race interaction (F = 4.90; 6 < 01) 
were both highly significant. 
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In a final analysis, race discrimination scores were subtracted from class} 
discrimination scores, and the differences were analyzed. Only the Endorse- 
ment main effect was significant (F = 12.07; p < .001), indicating that 
respondents of both races attributed more class (as opposed to race) dis- 
crimination to themselves than to other persons of either race. 


D. Discussion anv ConcLusions 


These results clearly indicate that both Negro and white respondents 
believed that other persons, especially Negroes, discriminated along race and 
class lines more than they did. There are two immediate interpretations: one 
based on the concept of Projection and one on pluralistic ignorance, The pro- 
jection hypothesis questions the validity of the respondents’ self-reports on 
grounds that they reflect a social desirability bias or a “holier than thou” 
attitude. That is, since class discrimination and especially race discrimination 
violate the egalitarian ethics of our society, respondents may have been hesitant 
to admit, even anonymously, to discriminatory tendencies, 

The findings, on the other hand, may be an illustration of pluralistic 
ignorance, if the validity of the responses can be assumed. Thus, everyone may 
believe that all or most other Persons are discriminatory, although they, them- 
selves, are not. Without an independent assessment of response validity, a 
choice between these two interpretations is difficult. 

Our results also indicate that both groups of respondents, although admit- 
ting to relatively little discrimination at all, were more likely to discriminate 
along class lines than they believed other people were. This suggests that the 
variable of socioeconomic status may be a more important factor than race in 
affecting discriminatory behavior. The results are consistent with the several 


studies which have shown race to be a relatively minor factor in interpersonal 
attraction (ice. 4), 


E. Summary 


This study was designed to isolate the variables of social class and race as 
bases of discrimination reported by persons, themselves; and attributed to other 


major findings were that more class and race discrimination were attributed 
to other persons, especially to Negroes, than to the self and that class was a 
than race. These results are compared 


i rev: wing that race, per se, is a relatively 
minor factor in interpersonal attraction. 


& 


8. 
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ABBREVIATED VERSIONS OF THE 
VISUAL RETENTION TEST®** 


The University of lowa 


Artuur L, Benton 


A. IntRopuction 


This brief report describes an attempt to develop an abbreviated version of 
the copying form (Administration C) of the Visual Retention Test (1,2) with 
the aim of achieving economy of administration of the test in clinical and in- 
vestigative work, Neuropsychological evaluation of brain-damaged subjects is a 
time-consuming (and costly) process which is often fatiguing to both the patient 
and the examiner. Hence any procedural modification that will reduce adminis- 
tration time without sacrificing reliability or breadth of assessment is desirable. 
The relatively high reliability of the copying form of the Visual Retention Test 
(correlation coefficients between equivalent forms being within the range of 
-85 to .90) and the clinical impression that patients with impairment in design 
copying are likely to show defects in their reproductions from the very begin- 
ning of the task suggested the possibility that this 10-design test could be 
abbreviated without loss of diagnostic value, The present study explored this 
possibility. 

B. Mernop ann Resuvts 
1. Phase I 


In this phase of the study, the test records of 100 control patients without 
history or current evidence of cerebral disease were drawn from the files, These 
patients, who ranged in age from 18 to 62 years, had been given Form C, D, 
or E of the Visual Retention ‘Test as a copying task. The three forms, when 
administered as a copying test, can be considered to be of equivalent difficulty 
level (3, 4). The distributions of scores of the group on the full test (10 
designs) and on abbreviated versions of it (first nine designs, first eight designs, 
etc.) were computed. 

Table 1 shows these distributions of scores and also the distribution of 
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scores on the full test of the original standardization group (1, 2). Comparison 
of the performances of the two groups discloses some minor differences in the 
score distributions. A smaller.proportion of patients in the present sample made 
perfect scores as compared to the standardization sample (40 percent vs. 56 
percent), and a higher proportion made 1 to 2 errors (47 percent vs. 34 per- 
cent). However, the proportion of poor performances (4 or more errors) is 
approximately the same in the two groups (6 percent vs. 5 percent) and pro- 
vides the same indications for diagnostic interpretation. On the basis of the 
original standardization, an error score of 4 was interpreted as indicating 
borderline impairment, an error score of 5 as indicating significant impairment, 
and an error score of 6 or more as indicating severe impairment. The distribu- 
tion of scores in the present sample leads to the same interpretations: viz., 
borderline impairment reflected in performance poorer than 94-95 percent of 
control patients, significant impairment reflected in a performance poorer than 
that of 98 percent of controls, severe impairment reflected in a performance 
poorer than that of all tested controls, 


TABLE 1 
» VISUAL RETENTION TEST, ADMINISTRATION C: DISTRIBUTIONS OF SCORES OF THE 
ORIGINAL STANDARDIZATION GROUP AND THE EXPERIMENTAL SAMPLE 


Standard- Experimental group (N = 100) 
No. ization 
verrors...(N = 100) 10 designs 9 designs 8 designs 7 designs 6 designs 
0 56 40 43 46 48 54 
1 25 31 28 35 37 34 
2 9 16 7 13 10 7 
3 5 7 7 4a 4a se 
4 3a 4a 5a 2p yb —s 
5 ap ab —e —e —e ree: 
6 I ee —e aS 


4 Bordetline impairment. 

_» Significant impairment. 

~-© Severe impairment, 

v-@ Borderline or Significant impairment. 

| ‘The score ‘distributions for the abbreviated versions of the test show, of 
course, a smaller total number of errors and a higher proportion of perfect 
performances, since there is less opportunity to make errors. Nevertheless, the 
same statistical criteria that were utilized to define defective performance on 
the full test can be applied to performances on the abbreviated tests. Thus, 
for the eight-design yersion, an error score of 3. would be interpreted as indi- 
cating borderline impairment, since it represents a Pepa poorer than 
that of 96 percent of controls; an error score of 4 (exceeded by 98 percent of 
controls) would indicate significant impairment; and an error score of 5 (ex 
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ceeded by 100 percent of controls) would indicate severe impairment, The 
diagnostic categories assigned to specific scores in the several distributions are 
shown in Table 1. As will be seen, these categories were utilized in evaluating 
the diagnostic accuracy of the abbreviated versions of the test. 


2. Phase II 


In this phase of the study, the test records of 50 patients who had made 4 or 
more errors on the full test were analyzed. This group consisted of 44 patients 
with cerebral disease and the six control patients in the first group who met 
the selection criterion in terms of number of errors. 

Error scores on five abbreviated versions of the test (ranging from the first 
five designs to the first nine designs) were computed for each patient. The 
part-whole correlations between the scores on each version and on the full test 
were then determined. These product-moment correlation coefficients are 
presented in Table 2, which also shows the number of patients who would 
have been misclassified as normal on the basis of their performances on each 
abbreviated version of the test. 


TABLE 2 
CORRELATION COEFFICIENTS BETWEEN ERRoR Scores ON Four ABBREVIATED VERSIONS 
(Five To Nine Desicns) oF THE VisuaL RETENTION TEST AND THE 
Fu. Test (10 Desicns) 


Version r No. misclassifications 
9 designs 987 _ 
8 designs 972 _ 
7 designs +943 4 
6 designs 886 14 
5 designs 829 21 


Inspection of Table 2 shows that the five-design and six-design versions 
yield an excessively high number of misclassifications and, in addition, the 
correlations between them and the full test are not impressively high. The 
seven-design version correlates highly (r = .94) with the full version, but still 
yields an unacceptably high number of misclassifications (8 percent). The 
employment of the eight-design and nine-design versions results in no diagnos- 
tic misclassifications. 


C. Comment 


The implications of the findings are reasonably clear. It is not possible to 
reduce the length of this copying task to any substantial degree without some 
sacrifice in the accuracy of interpretation of performance. The eight-design 
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version can be safely used but this would save not more than one minute in 
administration time. However, despite this minimal difference in administration 
time, the eight-design test might well be employed in clinical and investigative 
work, since there is the additional element of boredom and fatigue to be con- 
sidered. Copying designs is not a very interesting or gratifying task for either 
the patient who is performing well or the one who is experiencing difficulty, 
and it is worth while to make it as short as possible within the constraint of 
securing an accurate estimate of performance level. 


3. 


4. 
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DIFFERENTIAL REACTIONS TO FRUSTRATIONS OF 
ADOLESCENT AND ADULT INSTITUTIONALIZED 
RETARDATES* 


Willowbrook State School, Staten Island, New York 


Louis Srrcet, Manny STERNLICHT, GABRIEL PusTEL, 
Micuaet DeRespinis, AND HaroLp BRANDWEIN 


A. INTRODUCTION 


The concept that aggressive or hostile behavior represents the typical re- 
sponse to frustration is a generally accepted view, one which has its origins in 
the work of Dollard, Doob, Miller, Mowrer, and Sears (2). Rosenzweig and 
his associates (5, 6, 7) furthered the frustration-aggression hypothesis by pre- 
senting three possible behavioral directions that an individual may take n re- 
sponse to a frustrating event: (a) the individual may blame others and display 
anger —i.e., extrapunitive; or (4) he may blame himself and display remorse 
and guilt—i.e., intropunitive; or (c) he may see the frustrating event as an un- 
preventable accident, with no one to blame—i.e., impunitive. 

Several studies employing the children’s form of the Rosenzweig Picture 
Frustration Study have been preformed with use of mentally retarded Ss for 
purposes of investigating direction of aggression. Portnoy and Stacey (3) 
explored aggression among 30 Negro and 30 white institutionalized preadoles- 
cent retarded children, and they found that, among both groups, the extra- 
punitive reaction predominated consistently, followed in order by the im- 
punitive and intropunitive modes of reaction. Also, the Negro group was found 
to be more impunitive than the white group, and both groups were more im- 
punitive than the normal children in Rosenzweig’s (7) normative group. 

Angelino and Shedd (1) compared 102 mentally retarded children with 
Rosenzweig’s normative group. They ascertained that the retarded group 
responded similarly to the normative group on the children’s form of the 
Rosenzweig Picture-Frustration Study, beginning with a basically extra- 
punitive response and progressing, with increasing age, to an intropunitive 
response. The retarded group, however, reached each stage of reaction about 
two years later than did the normative group. 
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The present study is an attempt to explore the direction of aggression and 
type of aggressive response among adolescent and adult institutionalized retar- 
dates, by means of the Rosenzweig Picture-Frustration Study. We hypothesized, 
on the basis of previous research, that institutionalized adolescent retardates 
would display more extrapunitiveness than similar adults, who would exhibit 
a greater degree of intropunitiveness. — 


B. Metuop 


The Ss of this experiment consisted-of a total of 52 institutionalized (at 
Willowbrook State School) retardates, dichotomized into an adolescent group 
(CAs 12-20, with a mean CA of 16.7) and an adult group (Cds 21-44, with 
amean GA of 28.3). Total JQs ranged from 52-71, with mean JQs and Mds 
of 62 and 7-2 for the adolescent Ss, and 62 and 10-7 for the adults. 

Each S was administered the Rosenzweig Picture-Frustration Study on an 
individual basis. Each pictorial response was then classified according to two 
main characteristics: direction of aggression, and type of aggressive reaction, 
Under the former category, all scoring was trichotomized on the basis of re- 
sponses considered to be either extrapunitive, intropunitive, or impunitive, 
while under the latter, responses were held to b-either obsessive-dominant, ego- 
defensive, or need-persistent. In determining the criteria for scoring a response, 
Pine to the guidelines outlined by Rosenzweig, Fleming, and Rosenzweig 


C. Resurs anp Discussion 


Tables 1 and 2 depict the results obtained here, in terms of ¢ score analysis. 
Table 1 indicates that there are no significant differences between institution- 
alized adolescent and adult retardates in the manner in which they will direct 
their aggression to frustrating situations; both will react in either an extra- 
Punitive or impunitive direction, but not in an intropunitive fashion. However, 
the manner in which they will handle a frustrating situation is essentially 
discrepant, as revealed in Table 2. 


TABLE 1 
# SCORES FOR DIRECTION OF AGGRESSION 
Direction Adolescents Adults 
Extrapunitive-intropunitive ‘iad 
Extrapunitive-impunitive ey aoe 
Intropunitive-impunitive : 6.648 ine 
*p< 05. 


p< 01. 
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TABLE 2 
t Scores For TyPE OF AGGRESSIVE REACTION 
Type of aggressive reaction Adolescents Adults 
Obsessive dominant-ego defensive 5.48* 4.02* 
Obsessive dominant-need persistent 85 4.07 
Ego defensive-need persistent 4.86% 49 


*p< 01 


Our adult retardates tended to focus more upon the frustrating situation 
itself, while the adolescents tended not only to focus in this manner as well, 
but also to direct their energies on attempts at finding solutions to the frustrat- 
ing problem. 

The present findings confirm those results obtained by Portnoy and Stacey 
(3) in that our retarded Ss displayed extrapunitive and impunitive ways of 
expression and aggression. However, these results are in contrast to the findings 
that, at later ages, the intropunitive approach for directing aggressive reaction 
is paramount. 

The only discrepancy between institutionalized retarded adolescents and 
adults seems to arise in their reactions to frustrating situations. Our results 
would seem to suggest that the longer a retardate is institutionalized, the more 
frustrated he becomes in trying to find a solution to a problem, with the con- 
sequence that he finally resigns himself to the notion that he is helpless and 
defenseless. This finding supports the conclusions drawn by Pustel, Sternlicht, 
and Siegel (4). They concluded that the institution is perceived of as a punish- 
ing monster, and adults, on account of routine ongoing frustrations of institu- 
tionalized living, developed a defense of perceptual indifference. This results in 
their focusing more on the frustrating situation than on an attempt to solve 
that situation, on account of their felt feelings of inadequacy and helplessness. 
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THE CORRELATION BETWEEN PIAGET'S CONSERVATION 
OF QUANTITY TASKS AND THREE MEASURES OF 
INTELLIGENCE IN A SELECT GROUP OF 
CHILDREN IN IRAN* 


Pahlavi University, Shiraz, Iran 


M. A. Bat-Hazz, A. H. Menryar, anp V. SABHARWAL 


A. INTRODUCTION 


Although Piaget’s genetic-developmental view of intelligence and the tradi- 
tional psychometric view have developed from somewhat different orientations, 
they represent but two different approaches to the understanding and evalua- 
tion, respectively, of the same phenomena. The fundamental similarities of the 
two approaches have been increasingly recognized over the past few years (3, 
6). These similarities may be considered either from a conceptual point of 
view (6) or in terms of the more applied end results (3, 4). 

Perhaps the main conceptual commonality between the Piagetian and psycho- 
metric approaches to the study of intellectual phenomena resides in what they 
regard as the nature or essence of “intelligence.” To both schools, the most 
central feature of human intelligence is its rationality (6). While to many 
psychometricians the purest measures of intelligence—or Spearman’s g factor— 
typically involve some kind of generalization and abstraction (or the education 
of relations and correlates, as Spearman used to call them), for Piaget, too, 
the essence of intelligence lies in an individual’s reasoning capacities. Piaget, 
however, is primarily interested in the functional or operational aspects of these 
abilities as revealed in the process of an individual’s adaptation to his environ- 
ment. 

According to Piaget (10, p. 156), the best criterion of the emergence of 
operations at the level of concrete structures is in fact the constitution of in- 
variants or the notion of conservation. Piaget’s approach to the investigation of 
these phenomena—particularly that of quantity—at this age level (seven to 
11 years) has become one of the most popular aspects of his work. Yet, sur- 
prisingly little has been done to investigate the association between operations 
underlying conservation and those of intelligence as covered by traditional 
mental tests, two exceptions being the work of Laurendeau and Pinard ( 7) 
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and Dudek et al. (4). The present study was designed to investigate the 
relationship between Piagetism tests of conservation and certain standard tests 
of intelligence in a select group of children in Iran. 


B. MetHop 
1. Subjects 


Ss were 65 children (35 boys and 30 girls) aged between six and 11 attend- 
ing grades 1 to 5 of the Community School in Shiraz, in Southern Iran. 
Organized by a group of English speaking residents of the town, the school 
caters to children of American and European professional people working in 
Shiraz, as well as Iranians who either because of their English speaking spouses 
or their professional interests in the West favor a more Western type of educa- 
tion, with English as the main language of instruction, for the children. In 
the present sample the national origins of parents were as follows: both 
parents Iranian, 30 (46%); both parents Western, 18 (28%) ; one parent 
Iranian and one Western, 17 (26%). In terms of the socioeconomic back- 
ground of parents, almost all subjects come from higher professional and ad- 
ministrative echelons, This is well reflected in their mean intelligence level as 
revealed by Table 1 and justifies the term “selected group” in the title of this 
Paper. 

2. Procedure 
Tests used included the following: 
1, Piaget’s tests of conservation of quantity—of the type of conservation of 


TABLE 1 
MEANS AND SDs oF THREE MEASURES OF INTELLIGENCE AND PIAGET'S 
ConseRvATION Tests For Grapes 1 To 5 (N = 65) 


Piaget’s WISC wIsc 
Grade N CPM tests Vocab. Arith. 
1 A 21 

ean 16.19 62 17.24 5.62 
; SD a 5.22 1.02 5.03 92 

Mean 18.27 0 
20.36 6.73 
, SD Ha 5.20 0 7.00 1.62 
be 23.07 3.20 27.33 8.07 
“ “ 6.30 3.26 7.28 1.87 
Pn 30.91 4.73 37.18 10.36 
: , 2.62 2.29 3.91 1.35 
pian 31.28 6.00 38.57 11.43 
3.21 3.00 6.24 2.07 


ae Sate se cand ey es 


ne oe 


ee aie 
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equivalence as describe by Elkind (5). For each quantity (mass, weight, and 
volume) Ss were asked to predict, judge, and explain their conservation or 
nonconservation responses. Since three questions in relation to each test of con- 
servation were asked, and each conservation response was scored as “1,” there 
were nine possible scores for these tests; and the composed score obtained by 
each § was used in the analysis. 

2. Raven’s Coloured Progressive Matrices (CPM) which, like its adult 
version, is regarded as one of the best measures of factor g. 

3. Vocabulary and Arithmetic scales of the Wechsler Intelligence Scale 
for Children (WISC). 

The guiding principle in the choice of the tests of intelligence was inclusion 
of representative measures of both g and the more educationally determined 
verbal and numerical dimensions of intelligence. In view of the emphasis on 
the significance of conservation of quantity for the development of numerical 
concepts in children by Piaget (9) and others (e.g., 11) it seemed interesting 
to investigate the relationship between conservation and measures of verbal, 
numerical, and spatial ability, separately. 

‘The tests were administered individually by a senior psychology student 
trained for this purpose. The standard instructions for giving each test were 
strictly adhered to. 


C. Resuts 


Table 1 summarizes the means and SDs of the four measures for each of 
the five age groups separately. A comparison of data summarized in Table 1 
with original American and British norms for the Wechsler and CPM scales 
would seem to indicate the relatively high performance of these Iranian Ss. 
The consistent increase of mean scores from one grade to the other is also 
indicative of the basic validity of the three scales in their new context. Despite 
their superior performance on the intelligence scales, Iranian Ss in grades 1 and 
2 curiously failed on Piaget’s conservation tests. This is hard to explain both 
in view of general expectation generated by the results of cross-cultural studies 
of Piaget’s conservation tests and in view of findings by Bat-Hace and Hosseini 
(2) with a much less selected group of Iranian children whose performance is 
well in accord with Piaget’s prediction. In view of the dubious nature of these 
findings—which may have arisen out of a failure on the part of the tester—the 
first two grades were omitted from the correlational analysis of the data. 
Table 2 presents the correlations between the various measures of cognitive 
functioning included in the study. 

As expected, the two subscales of the WISC were highly correlated (7 = 
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TABLE 2 ; 
CorRELATIONS OF COLOURED Procressive Matrices, WISC VocasBuLary, AND 
WISC ARITHMETIC WITH PIAGET’s TESTS OF CONSERVATION, 
Grapes 1 To 5 (N = 65) 


Tests 1 2 3 4 

1. Tests of conservation 
of quantity = 45 36 38 

2, Coloured Progressive 
Matrices _ - 68 72 
3. WISC Vocabulary — _ = 75 
4. WISC Arithmetic - - - ~ 


.75). The nonverbal CPM also correlated substantially with both Vocabulary 
and Arithmetic scales (r = .68 and r = .72, respectively), and it also pro- 
duced the highest correlation with the conservation score (r = .45). 


D. Discussion or Resutts 


The positive correlations between the three measures of intelligence and the 
total conservation score were high enough to support the assumption of a 
general factor running through all three tasks. The correlations in fact were 
not very different from some of those reported for the WISC performance 
scale and certain subscales (e.g., Arithmetic and Comprehension) of the same 
test. The use of individual scales—with their well documented lower reliabili- 
ties—no doubt resulted in a smaller measure of association between intelligence 
and Piagetian tasks in the present study: hence, perhaps, the discrepancy be- 
tween correlations reported by Dudek et al. (4) and those found in this study. 
It is, however, encouraging that the relationship between the CPM test and 
conservation is not appreciably different from those reported by Dudek et al. 
for the full-length performance scale of the WISC. Unlike this study, however, 
Dudek et al. reported slightly lower correlations between WISC performance 
scores and Piagetian measures. It is likely that the WISC performance scale 
and the CPM cover somewhat different aspects of nonverbal intelligence. 

i Tn summary, the present findings would seem to support the contention that 
in spite of the different orientations and origins of the traditional intelligence 
tests and the currently popular Piagetian tasks, they both appear to be sampling 
Cognitive processes which are highly correlated and presumably rest on some 
fundamental construct, like Spearman’s g factor. This conclusion has also re- 


ceived support from a number of studies (e.g., 1, 8) dealing with other aspects 
of Piaget’s theory. 
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E. Summary 


Piaget’s tests of conservation of quantity, Raven’s Coloured Progressive 


Matrices, and WISC Vocabulary and Arithmetic scales were administered to 
a select group of children in Iran in grades 1 to 5. Positive correlations running 
from .36 to .45 were found between the three measures of intelligence and 
Piaget’s conservation tasks, This is interpreted as an indication of a general 
factor common to Piaget’s tasks and the traditional tests of intelligence used 
in this study. 
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THE USE OF HUMAN FIGURE DRAWINGS TO ASSESS 
A CULTURAL VALUE: SMILING IN THAILAND** 


Department of Psychology, College of Saint Teresa 


Harry W. GarDINER 


In an extensive study of group values reflected in human figure drawings by 
11- to 13-year-old boys in 27 cultural groups, Dennis (2) found the greatest 
differences among groups to be in frequencies of smiling faces drawn. It was 
suggested that the number of smiling faces drawn reveals the extent to which 
having a smiling face is a social goal or a cultural value. 

Thailand—often called the “Land of Smiles”—provides an excellent setting 
in which to test the validity of this hypothesis, since as one writer has remarked 
(1, p. 9) “In Thailand, it appears, almost everyone smiles almost all the 
time.” 

The present study focused upon 1043 drawings of men and women by Thai 
boys and girls, aged 9-17, living in two urban centers and two rural villages. 
Subjects were 393 boys drawing men, 147 boys drawing women, 207 girls 
drawing men, and 296 girls drawing women. 

Each drawing was scored according to specific criteria for the presence or 
absence of smiling—i.e., for a face to be scored as smiling, both corners of the 
mouth had to turn upward. When the mouth was represented by a single line, 
or when closed lips formed a single line, the rule applied to these lines; if the 
mouth was open, it applied to the upper lip.? 

Analysis indicated that incidence of smiling ranged from 44% among boys’ 
drawings of men to 81% among girls’ drawings of women. The overall per- 
centage was 64%—well above 24 of the 27 groups cited by Dennis. This 
figure compared with 31% in a Japanese village, 29% and 24% in two 
samples from Taiwan, 22% for a group in Kyoto, and 21% among Cambodian 
children (2). Among non-Asian groups, the proportion of smiling figures 
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epee among judges—a finding in accord with that previously reported by Dennis 
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included 37% for nonorthodox Israelis, 27% for children in Edinburgh, 20% 
in Mexico City, and 5% for Chiapas Indians (2). 

These data, in light of the known value placed on smiling in the Thai 
culture, lend strong support to the hypothesis set forth by Dennis that children 
draw human figures with smiling faces in proportion to the degree to which 
smiling is socially approved or culturally valued. They also serve to demon- 
strate further the efficacy of this technique for cross-cultural comparisons (3). 
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MASCULINITY-FEMININITY BODY RANKINGS 
BY MALES AND FEMALES* 


Florida State University 


E.uincton DarDEN 


A. INTRODUCTION 


Recent trends in fashion and dress, as well as various male and female be- 
haviors, have caused concepts of masculinity and femininity to be questioned. 
Masculinity and femininity have been expressed in several ways. Many people 
emphasize behavioral traits, while others rely on physical appearance as the 
most important factors in determining what is masculine or feminine. Several 
researchers (16) have noted that men are becoming more feminine and women 
are becoming more masculine, No doubt much of this is a result of recent role 
changes between males and females, as well as the breakdown of the family 
unit. Perhaps modern trends in male-female role reversals, as well as fashion 
designs, have tended to bring traditional views of what is masculine and 
feminine closer together. On the other hand, with media emphasis on such 
varied stimuli as sex, athletics, education, and the “do your own thing” 
attitudes, men and women might have different concepts of masculinity- 
femininity. In fact, much of this thinking has caused Vroegh (21) to say that 
there is no consensus about the meaning of masculinity and femininity. 

Many studies have compared the physical aspects of maleness and female- 
ness. Draper (7) noted the basic physical characteristics of what was con- 
sidered to be masculine and feminine. These characteristics were later refined 
into a rating scale by Bayley and Bayer (1), who noted some basic char- 
acteristics of masculinity as massive shoulders, marked torso narrowing, and 
strong muscle molding; while femininity was characterized by soft narrow 
shoulders, rounded flared hips, and soft smooth muscles. Other experimenters 
(4, 8, 9, 10, 17) have developed various types of scales for measuring 
masculinity-femininity. 

Additional studies (2, 11, 12) have indicated that the strong, muscular type 
of physique is a social advantage among adolescent boys. Deno (5) found that 
ninth-grade boys most admired a muscular v-shaped upper body. Other in- 
vestigators (6, 13, 20) have shown that both male children and male college 
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students show a clear preference to look like the mesomorph image as opposed 
to the endomorph and ectomorph images. On the other hand, Darden (3) in 
testing 145 male college athletes on a body image discrepancy scale found that 
49 percent of the group were satisfied with their body images, while 37.2 
percent of the group desired to Iook more masculine, and 13.8 percent less 
masculine. Evidently, one would be on ind ground to assume that most males 
desire to look more masculine. — 

Aspects of femininity have also been studied. Vroegh (21) had teachers 
divide preschool children into most or least masculine or feminine categories 
according to various personality variables. It was noted that the most feminine 
girls tended to be more socially adjusted, competent, and introverted than the 
least feminine girls. In working with older children, Slee (19) reported that 
British girls were strongly influenced by a feminine image. Katcher and Levin 
(14) noted that older girls tended to identify with smaller body parts, as many 
times smallness is associated with femininity. 

Few reported studies have compared males’ and females’ rankings of 
masculinity-femininity drawings. Deno (5) and Gottesman and Caldwell (8) 
worked with several ranking tests, but made no male-female comparisons. 
Therefore, the purpose of this study was to compare the rankings of college 
men and women concerning different degrees of masculine and feminine body 
drawings. 


B. MetHop 
1, Subjects 


Both male (mn = 101) and female (m = 133) students enrolled at Florida 
State University during the Spring Quarter, 1971, were used as subjects. The 
age of the males ranged from 18-34 years, with a mean of 21.4; while the 
females ranged from 18-25 years, with a mean of 19.8. Subjects were chosen 
from various classes (English, sociology, business, educational research, religion, 
and physical education), The test was administered in groups of 20 or more, 


either at the beginning or end of a class period. One male experimenter was 
present with each group during the testing. 


2, Testing Apparatus and Procedures 


The masculinity-femininity ranking scale was devised from several basic 
drawings of Gottesman and Caldwell (8) and consisted of 12 two-dimensional 
outline drawings of the body (See Figure 1), as these drawings incorporated 
some of the essential differentiating qualities between masculine and feminine 
attributes. Masculine and feminine differentiation was achieved through 
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MASscuLINITY-FEMININITY DRAWINGS 


changed dimensions in (a) width of the shoulders, (2) size of the waist, and 
(c) size of the hips. Larger arm size (diameter) was emphasized in five of 
the figures (drawings 1 through 5), and thigh and calf size was emphasized in 
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two of the figures (drawings 1 and 2). Size of the neck and head, and height 
of the figures remained the same throughout the series. 

The drawings were presented on standard 8 1/2” X 11” paper in three 
horizontal rows in the following order: 3-10-12-6, 2-9-4-7, 11-1-8-5, Each 
figure had a letter from A to L, consecutively, placed beneath it, and numbers 
1-12 were placed in the left hand margin to facilitate both ranking and 
tabulating. Subjects were randomly placed in subgroups and verbally in- 
structed according to one of four statements: (a) Please rank the 12 drawings 
from most to least masculine. (b) Please rank the 12 drawings from least to 
most masculine. (c) Please rank the 12 drawings from most to least feminine. 
(d) Please rank the 12 drawings from least to most feminine. The words 
most and least were equally reversed in ranking both masculinity and fem- 
ininity in order to counterbalance for the ranking sequence. In other words, if 
all subjects were instructed to rank the figures from most to least masculine, 
the last few figures would be easiest to rank by the mere process of elimination. 


3. Design and Statistical Analysis 


The masculinity-femininity ranking scale was administered to males and 
females and later divided into two main groups according to whether the 
drawings were ranked on masculinity or femininity. Under the heading of 
Masculinity Rankings there were Male Group I and Female Group I; under 
the heading of Femininity Rankings there were Male Group II and Female 
Group II. The rankings of each of the four groups were determined by listing 
each rank of each subject under the appropriate letter (A-L) across from the 
correct number (1-12). The columns were then added, with the largest total 
score being designated as most masculine (#1) and the smallest total score 
least masculine (#12). The ranking data were analyzed with use of Kendall’s 
coefficient of concordance and Spearman’s rank correlation coefficient (18). 
The Kendall coefficient of concordance, W, was used to test agreement among 
the subjects in each group. In order to determine if significant differences 


existed among the ranks of the groups, ¢ tests were applied to the Spearman 
rank correlation coefficient, r,. 


C. Resutts 


The total scores and rankings of each of the 12 drawings (A-L) for the 
four groups are presented in Table 1, which indicates that all groups were in 
accord on the last four ranks (least masculine or most feminine). The males 
and females under the Femininity Rankings were also in accord for all ranks. 
In fact, there was general agreement among all 12 ranks for all groups except 
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TABLE 1 
Compostre RANKINGS AND TOTAL ScoRES OF MASCULINITY-FEMININITY 
Drawincs By MALES AND FEMALES 


Masculinity Rankings Femininity Rankings 
(Most to least masculine) (Least to most feminine) 
Y Male Group I Female Group I Male Group II Female Group II 
Drawing (n= 60) (n= 74) (n= 41) (n= 59) 
1 E-568 A-695 J-470 J-673 
2 J-565 H-693 E-440 E-621 
3 A-564 G-648 A-390 A-556 
4 G-533 L-622 G-381 G-526 
2 H-470 D-620 L-302 L-470 
6 D-467 E-612 D-282 D-422 
7 L-459 K-596 H-254 H-376 
8 K-359 J-534 K-233 K-287 
9 F-291 F-379 F-161 F-236 
10 B-196 B-242 B-104 B-142 
11 1-117 1-150 I- 86 1-130 
12 C- 67 C-100 C- 72 C-106 


Note: The mean ages of the subjects within each subgroup were: Male I, 21.2 
years; Female I, 19.7; Male II, 21.6; and Female II, 20.0. 


Female Group I. The Kendall coefficient of concordance, W, was used to test 
agreement among the rankings of the subjects in each of the groups. The W 
statistic was computed and tested by the appropriate expression [X? = & 
(N — 1) W], which is distributed as chi square with N — 1 degrees of free- 
dom. The results of these tests indicated agreement among members of each 
of the groups at the .001 level. 

In order to determine if significant differences in ordering existed among 
the four male-female groups, the data were further analyzed with use of 
Spearman’s rank correlation coefficient, rs. This coefficient can be tested by the 
formula t# = rf (N — 2) /A — 7°), which is distributed as Student’s ¢ 
with N — 2 degrees of freedom. Table 2, shows the results of these analyses. 
No significant differences existed between the rankings of Male Group II and 
Female Group II, Male Group I and Male Group II, and Male Group I and 


TABLE 2 
ComParRIsONs BETWEEN GROUPS ON MASCULINITY-FEMININITY RANKINGS 

Group comparison df Ts t 
Male Group I-Female Group I 10 75 387? 
Male Group I-Female Group IT 10 05 02 
Male Group I-Male Group II 10 05 02 
Male Group II-Female Group II 10 1.00 0 
Female Group I-Female Group II 10 66 3.56" 
Female Group I-Male Group IT 10 66 3.56" 


* Significant at .01 level. 
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Female Group II. However, significant differences existed at the .01 level 
between the rankings of Male Group I and Female Group I, Female Group I 
and Female Group II, and Female Group I and Male Group II. Therefore, 
the rankings of Female Group I were found to be significantly different from 
the other three groups. 


D. Discussion 


The results of this study indicated some interesting findings. Males and 
females were in perfect agreement concerning the rankings of femininity. 
However, when the same drawings were ranked according to masculinity, 
significant differences existed. Seemingly from the figures and concepts used, 
least masculine and most feminine, and most masculine and least feminine 
should have been synonymous. This was found to be true with males, but not 
true with females. 

The discrepancy betwen males and females concerning masculine body pro- 
portions merits some thought-provoking speculations. Lipman (15) notes that 
the early American value system that stressed the attribute of physical prowess 
was directly anchored and functionally related to the frontier and rural society. 
He further says: “That people still perpetuate the old rural ideal of manliness, 
represents one of the most serious cultural lags of our time—certainly, one 
with direct implications for the education of young men in our society” 
(p. 218), Perhaps women in general have been able to bridge this cultural lag 
more than men, especially since many male children as well as adults still 
idealize and emulate the “big strong athlete” (11). In the same light, Ameri- 
can culture does not positively reinforce female participation in athletics, at 
least not to the same extent as male participation. Certainly physical prowess 
18 Not a prerequisite for success in most yocations as it once was for men. But 
do men generally realize this? Lipman (15) would say, “No!” 

Rosenberg and Sutton-Smith (17) reported in 1959 that girls showed more 
of a tendency for active, outdoor, competitive and team games than was the case 
in earlier investigations. This seems to conflict with the results of this study 
and the thoughts of Lipman (15). Seemingly, if women were becoming more 
active in sports, there would be more agreement between men and women on 
concepts of masculinity. But, have women teally become more active in sports 
within the last decade? Male and female roles within our society are rapidly 
changing. Much of this change is evident in the unpredictable fashion trends. 
Perhaps women are better able than men to adjust to these changing roles. 


Therefore, many men may overcompensate by reverting to the values of the 
traditional rural society of their fathers, 
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E. Summary 


The purpose of this study was to compare rankings of college men and 
women concerning different degrees of masculine and feminine body drawings. 
Male (x = 101) and female (x = 133) students were asked to rank 12 
similar, but different, whole body outline drawings, by using their concepts of 
masculinity or femininity. Between and within group comparisons indicated 
that no differences existed between the groups on the ranking of femininity. 
However, in ranking masculinity concepts, females differed significantly from 
males. 
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CHARACTERISTICS OF STUDENT 
SUMMER EMPLOYMENT*?* 


University of Lethbridge, Canada 


Gorpon W. RusseLt AND RopcEr D. ConNER* 


A. INTRODUCTION 


The vast array of job preference studies reported in the formal literature 
has typically focused on jobs that are normally held for extended periods of 
time. One important emphasis of such studies has been the investigation of 
factors underlying job satisfaction, liking, or preferences. While it is important 
to establish the dimensionality underlying career choices, the area of short term 
employment preferences has been virtually disregarded. Literally, hordes of 
high school, college, and university students annually invade the North Amer- 
ican Jabor market in search of jobs lasting anywhere from one day to the 
entire summer. For many, summer earnings enable them to finance all or a 
portion of their education or, conversely, allow them to incur a lighter burden 
of debt. Other implied, though undocumented, motives include the opportunity 
to travel, being of service to others, making social contacts, experience, etc. 
Thus, students are commonly presumed to seek summer jobs on the basis of a 
fairly lengthy list of possible job attributes. In an earlier review of 150 career 
preference studies, Herzberg et al. (3) summarized the following 10 factors 
in order of their frequency of occurrence: 1) intrinsic aspects of job, 2) 
supervision, 3) security, 4) company and management, 5) working conditions, 
6) wages, 7) opportunity for advancement, 8) social aspects of job, 9) 
communication, and 10) benefits, Except in that rare instance in which the 
student plans a career with his summer employer, many of these considerations 
are apt to be irrelevant to a preference for a summer job. It therefore seemed 
reasonable to predict a factor structure that differed markedly from that 
typical of career preferences. 

Similarly, the practices of students in approaching the job market and, 
equally important, the attitudes and practices of employers with respect to 
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summer employment are sparsely documented in the formal literature. 
unique features of summer employment are its short duration and the gene 
absence of expectations that the experience is. a stepping stone to a career with 
the employing firm. Perhaps this lack of a visible career commitment is pari 
responsible for the bad press occasionally given student workers. Commenta 
to the effect that they are poor and unreliable workers and show little interest 
in the job is echoed in the local media each Spring. The further possibility 
students are seen as intruders in an already oversupplied labor market un-~ 
doubtedly serves to intensify attitudes further. Against this background, the 
present study attempted to assess the attitudes of those in the best position to 
evaluate students, the employers. Thus, in addition to an attempt to identify 
the nature of the dimensions of summer job preferences, a preliminary examina — 
tion of the domain of student-employer attitudes and practices vis-d-vis each 
other was undertaken. 


B. Procepure 


A set of 16 job descriptions was generated, representing the range and 
of summer jobs available to students in Southern Alberta. The descriptions 
detailed the nature of the job and in each case stated the salary and any appli- 
cable benefits, such as free room and board. As a control for experience, it was 
made clear in the case of the more technical jobs that previous experience was 
Cais as on-the-job training would be provided. A nine-point scale rang- 
ing from “not preferred” to “strongly preferred” accompanied each job 
description. The scales were completed by 120 male (28 percent mail return) 
and 141 female (38 percent) first and second year Community College stu- 
dents. The ratings were intercorrelated and subjected to factor analysis by the 
method of principal components followed by Varimax rotation. ; 

A set of attitudinal questions was mailed separately, as part of a broa 
survey of hiring practices, to a sample of employers drawn from the dive 
types of industry indigenous to Southern Alberta, with 557 question 
being returned (36 percent). 

In the same survey, student samples were drawn from the local high s 
(N = 788 males, N = 752 females) ; the Lethbridge Community Col 
(N = 119 males, N = 101 females) ; and the University of Lethbridge (N’ 
333 males, N = 244 females). The student questionnaires were completed | 


part of registration in the case of the latter two institutions and in the . ss 
room in the former i instance. 
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C. ReEsuLts 


In the case of both male and female Community College students, a six- 
factor solution was indicated on the basis of interpretability, although Factors 
IV to VI in both samples are of minimal importance and possibly unstable. 
The mean rated male and female preferences for each job description and the 
rotated factor loadings are presented in Tables 1 and 2. 


1. Male Sample 


Factor I—Consumer Contact (accounts for 40.1 percent of total variance). 
The high factor loadings on Department Store Clerk (.75), Service Station 
Attendant (.67), Motel Desk Clerk (.54), and Elevator Operator (.42) 
reflect a dominant preference for summer jobs that offer opportunities to inter- 


TABLE 1 
Roratep Factor Loapincs—MALes 
Job Mean I il Il Iv Y: VI 

Motel Desk P 

Clerk 5.6 54 —.10 —.18 20 40 49 
Customs 

Officer 5.3 25 13 16 —12 61 18 
Research 

Assistant 5.0 —18 10 09 08 A i) —l1 
Elevator 

Operator 4.7 42 03 38 17 +50 105 
Tour Guide 4.6 23 59 08 —.07 48 16 
Clothing 

Model > Fy 03 29 .06 00 Os: 82, 
Meat Wrapper 3.7 —.13 —A7 29 60 —05 52 
Playground 

Worker 3.6 26 69 -13 08 24 103 
Library 

Assistant 3.5 26 36 —29 59 24 01 
Flagman 3.4 .23 18 71 —.10 9 21 
Department f 

Store Clerk 3.4 75 .23 12 —.03 —.08 14 
Service Station 

Attendant 3.3 67 —.02 24 22 14 — 24 
Fruit Picker 2.9 07 19 79 12 14 —.06 
Poultry . 

Processor 2.8 105 —05 —01 387 04 03 
Volunteer A 

Social Worker 2.7 —.13 81 .22 .05 —.06 a: 
Sugar Beet x 


Worker 18 19 32 30 52 22 4 —.08 
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TABLE 2 
Roratep Factor Loapincs—FEMALES 
Job Mean I Il il Iv ~ VI 

Motel Desk 

Clerk a —.03 40 34 —AS .07 26 
Customs 

Officer 54 .09 12 .20 —.03 83 02 
Tour Guide 5.3 18 —.34 33 35 48 16 
Elevator 

Operator 4.3 Al Al 81 .08 .20 —.05 
Clothing 

Model 48 —.23 21 33 578 —.10 —.09 
Research 

Assistant 47 18 10 —18 19 43 49 
Library 

Assistant 4.3 —.03 -10 .20 04 —02 81 
Playground 

Worker 4.3 30 —.05 —.10 .73 12 16 
Volunteer 

Social Worker 41 AT —.28 —.16 40 .05 33 
Department 

Store Clerk 4.0 .03 04 AT 03 02 35 
Flag Girl 3.7 72 04 .05 06 25 13 
Service Station 

Attendant 3.3 -60 39 30 .03 01 —10 
Meat Wrapper 3.2 1S 81 .09 —.07 —.05 04 
Fruit Picker 2.9 59 —.06 41 12 —Al 13 
Poultry 

Processor 2.2 16 67 = —.02 30 18 09 
Sugar Beet 

Worker 1.6 74 17 06 01 05 AS 


act with the consumer public. Many of the students in this sample anticipate 
careers within the business community where social skills are at a premium. 
Such summer jobs afford an ideal chance to test and develop skills in prepara- 
tion for future business careers, 

Factor H—Altruism (22.1 percent), The fact of extremely low wages being 
coupled with social good in the roles of Volunteer Social Worker (.81) and 
Playground Worker (.69) highlights a strong altruistic theme underlying 
summer job preferences, 

Factor III—Outdoors (3.4 percent). The high factor loadings on Fruit 
Picker (.79), Flagman (.71), Elevator Operator in a resort area (.38), and 
Sugar Beet Worker (.30) clearly portray a preference for jobs that take them 
out-of-doors and out of the city, 


Factor IV —Abdication of Responsibility (1.4 percent). An interpretation 
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recognizing a preference for jobs that do not require responsibility for or to 
other workers or the public, but simply involve doing one’s job, is suggested 
by the factor loadings on Poultry Processor (.87), Meat Wrapper (.60), 
Library Assistant (.59), and Sugar Beet Worker (.52). 

Factor V—Responsibility (1.4 percent). The jobs on which the highest 
factor loadings occur share a common element of responsibility expected of the 
employee. A lesser degree of power is also implied by the loadings on Research 
Assistant (.75), Customs Officer (.61), Elevator Operator (.50), Tour Guide 
(.48), and Motel Desk Clerk (.40). 

Factor VI—Dollars (1.1 percent). While this factor is marginal in impor- 
tance, the loadings on Clothing Model (.82), Meat Wrapper (.52), and 
Motel Desk Clerk (.49), suggest that a theme of earning potential pervades 
the ordering of these jobs. The correlation between the ordering of jobs on 
this factor and the calculated wages derivable from each job description was 
significant (Pearson r = 665, p < 01). 


2. Female Sample 


Factor I—Outdoors (32.6 percent). The loadings on Sugar Beet. Worker 
(.74), Flag Girl (.72), Service Station Attendant (.60), Fruit Picker (.59), 
and Volunteer Social Worker (.47) clearly describe a theme of preference for 
the outdoors. This dimension is essentially the same as that described under 
Factor III of the male sample. 

Factor IJ—Dollars (14.6 percent). Loadings were Meat Wrapper (.81), 
Poultry Processor (.67), and Motel Desk Clerk (.40). The correlation be- 
tween this dimension and the earnings associated with each job is again signifi- 
cant (Pearson r = .698, 6 < 01). 

Factor [1[—Social (3.1 percent). This uniquely female dimension reflects a 
preference for summer jobs that afford optimal opportunities for meeting 
eligible males, Thus, Elevator Operator in a “large resort hotel” (.81), 
Department Store Clerk ( 47), and Fruit Picker in the nearby vacationland 
of southern British Columbia (.41) all hold promise of an active social life. 

Factor IV —Self-Fulfillment (2.6 percent). This factor is similar to Factor 
II, Altruism, of the male sample. The inclusion of Clothing Model (.73), in 
addition to Playground Worker (.73) and Volunteer Social Worker (.40), 
suggests an expanded preference for such jobs as will allow the realization of 
a more personal kind: of success, as well as those that provide a focus for 
altruistic drive. 

Factor V—Responsibility (1.0 percent). Loadings were Customs Officer 
(.83), Tour Guide (.48), Research Assistant (.43), Flag Girl (.25). 


218 JOURNAL OF PSYCHOLOGY 


Factor VI—Education (1.0 percent). The high factor loadings on Library 
Assistant (.81) and Research Assistant (.49), and the lesser loadings on De- 
partment Store Clerk (.35) and Volunteer Social Worker (.33) suggest a 
preference for jobs in which a learning environment is maintained. 


3. Comparisons 


The employer responses to the attitudinal items are presented in Appendix 
A, and the student responses in Appendix B. Males reported expending a 
significantly greater effort in looking for jobs both where the sex comparison 
involved all students and where the comparison was restricted to those who 
had registered for work with the student employment division of the Canadian 
government’s Manpower Centre (C.M.C). While the overall chi-square 
analyses by level of education were significant for both C.M.C. registrants and 
all students, subanalyses indicated that high school students reported expending 
significantly less effort than either the college or university students, regardless 
of whether or not they had registered. No differences were found between the 
college and university students in either analysis. Finally, a highly significant 
difference (x? = 362.6, df = 4, p < -0001) with respect to reported effort 
was found between those registered with C.M.C, and those not registered. 

Parental influence, while reported as “strong” or “very strong” by only 14 
percent of the entire sample, was significantly greater in the case of females. 
When influence was analyzed by level of education, the overall differences were 
significant, Subanalysis, however, revealed no differences between college and 
university students, whereas both groups Teported significantly less influence 
than high school students, 


D.. Discussion 


The exploratory nature of the study would seemingly justify an initial over- 
determination of the factors. What is Particularly noteworthy about the factor 
structure is the relegation of the Dollar dimension to a position of marginal 
importance in the male sample. Whereas the Herzberg et al. (3) review shows 
wages ranked sixth of 10 in priority among career factors, its importance is 
virtually negligible in the case of summer employment. Thus, any notion that 
money dictates summer job preferences for males is without support in the 
present study. However, a fairly generous system of Federal and. Provincial 
grants and loans may have minimized the importance of summer earnings as 
4 requisite for continuation in the ensuing academic year. For females, the 
potential income to be realized from a job assumes considerable importance in 
dictating preferences, second only to an Outdoors dimension. The higher 


GORDON W. RUSSELL AND RODGER D. CONNER 219 


relative position of the Dollar dimension for females is probably more a reflec- 
tion of the desperate financial plight of the girls than of any need to amass 
wealth. This point is given considerable support when one notes that the range 
of rate of average summer earnings for women was .95¢/hour to $1.44/hour, 
whereas that of men ranged from $1.63/hour to $2.25/hour across the three 
types of institutions, Similarly, the average hours of summer employment for 
girls ranged from 80-265 hours and the males from 148-465 hours, 

Opportunities for contact with consumers was of paramount importance 
for Community College males who may be more career oriented than their 
counterparts in the other two institutions. With this exception the results were 
surprising in their emphasis upon ideals. The direction of preferences is clearly 
away from the practical and towards personal and social ideals. Altruism, and 
the outdoors, constitute major motives for both sexes, although females further 
prize jobs providing for social intercourse. . 

Seemingly, the summer job market is unique as a life experience, affording 
the student an opportunity to test his skills in a risk-free situation and to sample 
jobs on the basis of his or her “ideals” rather than the austere criterion of 
practicality. Kenneth Keniston (4) has portrayed the nature of work in con- 
temporary North American society as follows: “Work therefore assumes a 
new significance in technological society. It requires a dissociation of feeling, a 
subordination of passion, impulse, fantasy, and idealism before cognitive prob- 
lems and tasks. As breadwinners, most Americans neither find nor even seek 
‘fulfillment’ in their jobs” (p. 267). On the assumption that this observation 
is accurate and the situation deplorable, the summer job experience assumes 
considerable importance in the light of the present results. If, as Keniston 
asserts: “Work in our society is not intended to satisfy most of our deeper 
feelings and needs” (p. 268), then a very special function is performed by 
summer job experiences in providing a vehicle for the expression of altruistic 
motives and the satisfaction of needs not met by most career options. 

The present study has direct implications for the increasing number of 
government sponsored student summer work programs. If governments wish to 
develop programs of value to students, rather than projects designed merely 
to keep students off the streets, then those that stress the restoration of our 
environment and the upgrading of disadvantaged groups are well suited to 
this purpose. The motives of altruism and a preference for the outdoors would 
find full expression in programs of reforestation, wild life conservation, 
pollution cleanups, volunteer social work, expansion of recreational facilities, 


ete. 
Suggestions to the effect that students are generally not held in high regard 
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by employers are not supported in the present study. As indicated in Appendix 
A, over 80 percent of the employers indicated satisfaction, with 47 percent 
reporting being “‘very satisfied” with the performance of their student em- 
ployees, Where a degree of dissatisfaction was expressed, the bases of dissatis- 
faction were rather equally spread over a number of sources. Perhaps more 
importantly, there was no single source of dissatisfaction uniquely related to 
students, Indeed they are sources one would expect to find attributed to un- 
satisfactory workers of any stripe. 

A partial explanation for the wide discrepancy noted above between average 
male and female summer earnings may be found in the employers’ response 
to item 3b of Appendix A. A surprising 39 percent of the employers indicated 
that they would not hire girls, even though they were qualified and could 
satisfactorily perform the job. While some reasons for refusal may be some- 
what legitimate (e.g., provision of duplicate shower facilities), such a high 
percentage figure undoubtedly reflects a traditional view of women in which 
women either cannot or should not perform on a par with men. 

As a methodological note, it should not be assumed that all students seek 
a summer job nor that they seek employment for the entire vacation perio 
Student item 2 of Appendix B indicates that no more than 79 percent of the 
students were willing to consider a job in any one month (college students in 
July), Similarly, fewer than two-thirds of the university students were avail- 
able for employment in the peak months of July and August. Also, one cannot 
assume that where students are seeking work, all try with equal or maximum 
vigor to find employment. The generally low level of enthusiasm for job 
hunting depicted in item 3 of Appendix B may be partially attributed to the 
lack of success experienced in previous years in that many students simply 
despair of finding work, { 

The use of registration as one criterion of motivation to find work is 
supported by the highly significant chi square between degree of reported effo! 
and whether or not a student registered with C.M.C. A more stringent 
search criterion might additionally require one or more return visits to 
employment centre by the student. 

The foregoing report can only be described as preliminary. While so 
survey-type research has recently been conducted on this topic (1, 2), 8 
studies seldom appear in the social science literature, instead tending to 
as internal SeRERTS within government or institutions. In view of the 
professional literature available on careers and the increasing attention 
given student summer employment programs by government, it seems to be : 
area that merits considerably more investigation by social scientists. 
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Appenpix A 
EmMPLoyER QUESTIONNAIRE RESPONSES 


1. In general, how satisfied were you with the work of your student em- 
ployees this summer? very dissatisfied 3.3%; somewhat dissatisfied 6.1%; 
neither satisfied nor dissatisfied 9.9% ; somewhat satisfied 33.7% ; very satisfied 
47%. 

2. If you indicated some dissatisfaction with the work of some of your 
student employees, kindly check off the reason(s) for your dissatisfaction: 
laziness 22% ; poor attitude 22%; other 19%; poor appearance 15%; poor 
job ability 12%; absenteeism 5%; lateness 5%. There were 41 instances 
(7.4%) of dissatisfaction cited. 

3. (a) Of those summer jobs filled by male students this summer, how 
many of them could have been handled as well by female students? Responses 
not available. (4) If you had such a job(s) and a qualified female applied next 
year, would you be willing to hire her? yes 61% no 39%. 


APPENDIX B 
STUDENT QUESTIONNAIRE RESPONSES 


1. To what extent were you influenced by your parents (or spouse, if 
applicable) in your choice of the kind of summer job you sought (N = 2193) ? 
not influenced 51.4% ; slightly influenced 17.9% ; somewhat influenced 16.6% ; 
strongly influenced 7.9% ; very strongly influenced 6.2%. 

Results of chi-square analyses of these data are as follows: male vs. female, 
x? = 18.31, df = 4, p < .01; high school vs. college vs. university, x? = 51.9, 
df = 8, p < .001. 

2. Place a check mark in each month during which you were available and 
ready to work. Table A shows percentage of students (N = 2253) who 
checked off each of the five spring and summer months. 

3. How hard did you actually try to find a job(s) for the summer (N = 
2192) ? made no effort 22.7% ; made a slight effort 18.3% ; made an average 
effort 28.4% ; made a strong effort 19.7%, made a very strong effort 10.9%. 


TABLE A 
PERCENTAGE OF STUDENTS AVAILABLE AND READY To Work 
Durinc Vacation MonTHs 


Percentages 
Students April May June July August 
High school 18 20 71 4 73 
College 27 47 62 79 76 


University 29 56 63 66 66 
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Chi-square analyses of data on registered Ss (N = 712) showed the follow- 
ing results: male vs. female, x2 = 19.9, df = 4, » < .001; high school os. 
college ws. university, x? = 27.5, df = 6, p < .001. 

Chi-square analyses of data on nonregistered Ss (N = 1480) showed the 
following results: male vs. female, x* = 33.6, df = 4, p < .001; high school 
vs. college vs. university, x? = 124, df = 6, p < .001. 
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A COMPARISON OF MECHANICALLY AND GROUP 
INDUCED REINFORCEMENT OF 
VERBAL BEHAVIOR* 


University of Missouri at Kansas City 


Morton GotpMAN, NELLE WESTERGARD, AND JoHN G. KreTCHMANN 


A. INTRODUCTION 


Since the discovery that verbal responses can be conditioned, two approaches 
to the delivery of reinforcement have become prominent (3, 4). One approach 
utilizes reinforcement of a very mechanical nature, where emphasis is on the 
reinforcement contingencies rather than on content or quality of verbalizations. 
Mechanical reinforcement is constant and consistent, but rigid and lacking 
in flexibility. The Bavelas ef al. (1) study in which the talking of a non- 
talkative person was increased by use of red and green lights is an illustration 
of the mechanical reinforcement approach, 

The other approach has utilized people as dispensers of reinforcement in a 
social setting. Persons serving in this capacity can be more flexible in their 
delivery and capitalize on ongoing social interaction. For example, a human 
reinforcer can draw out verbal behavior, instead of having to wait for it to 
occur before delivering reinforcement; he can make use of gestural cues and 
other methods of nonverbal communication, thus strengthening his reinforce- 
ment delivery; and he can guide the subject into discussion areas that the 
subject is knowledgeable about. Cieutat (2) with the help of a confederate 
increased the talking of two known nonparticipators in a graduate seminar 
discussion by looking at these students whenever they spoke and showing such 
signs of approval as nodding. 

The current experiment compared the effectiveness of (a) mechanically 
induced reinforcement similar to that used by Bavelas, (b) group induced 
reinforcement similar to that used by Cieutat, and (c) a combination of the 
two, in increasing the verbal output of a nontalkative person in a group dis- 
cussion situation. In a second experiment, the effectiveness of a skilled group 
induced reinforcement treatment was compared with that of the mechanically 
and group induced reinforcements. The format of both experiments was ‘the 
same. The performance of a nontalkative person was measured in three succes- 
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sive discussion sessions: during the first session this person was chosen and 
designated the target person (TP) ; he was then reinforced during the second 
session; and the reinforcement was removed for the third session. 

The two experiments presented in this paper had three main purposes. The 
first was to find out if the different methods of reinforcement varied in their 
effectiveness in increasing talking. Thé second was to find out if the different 
methods of reinforcement varied in their effectiveness in maintaining talking 
behavior once the reinforcement was removed. The third was to discover if 
the TP’s perception of himself and the other group members’ perceptions of 
him would be differentially affected by the different methods of reinforcement. 


B. MetxHop—Exprriment I 


1. Subjects 


Both male and female students from introductory psychology classes vol- 
unteered for this experiment to fulfill an experimental participation require- 
ment in those classes. A total of 238 subjects were used; 30 subjects were 
selected as TPs and randomly distributed over the three experimental treat- 
ments, each treatment containing 10 TPs. The remaining subjects were used 
to help carry out the experiments as explained below. 


2. Apparatus 


Four wooden boxes, each with a red and a green light, were placed before 
four chairs around a discussion table in such a way that persons seated in each 
of the chairs could view their own lights only, The lights were controlled by 
switches located behand an observation mirror in an adjoining room. A Lehigh 
Valley event recorder with six channels was used to record the frequency and 
duration of each person’s talking during discussion sessions. 


3.. Procedure 


Since the study involved a series of discussion sessions, suitable topics that 
could generate 10 or 15 minutes of conversation had to be found. Seven pos- 
sible topics were pretested in group discussion situations, and three topics, 
generating about equal amounts of discussion, were chosen. One dealt with 
the problems faced by the head of a company doing a substantial amount of 
business in a black area of a large city. (A black citizens committee is threat- 
ening a boycott if he doesn’t rehire an employee fired by a supervisor who they 
say is very prejudiced, but a bloc of white employees say they will leave en 
masse if the man is rehired.) The second topic posed the question of whether 
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steps can be taken by students at the high school level when a new teacher is 
extremely poor, and, if so, what steps. The third topic concerned a school 
superintendent faced with circumstantial evidence that a high school teacher 
is a homosexual. 

The experimental period consisted of three discussion sessions, the first 10 
minutes long and the last two 15 minutes each in length. Each of the three 
discussion sessions was composed of different members, with the exception of 
the TP who was, of course, present in all three discussions.* 

Seven subjects were scheduled to arrive on the hour, four for use in the 
first session and three for use in the second session; three more were scheduled 
to come on the half-hour for use in the third session.? Of the subjects that 
arrived on the hour, three or four (depending upon the number available) 
were seated around the table in the discussion room, and the remainder were 
taken to another room and asked to “‘wait a second.” 

a. First session, The group seated at the table took part in the first 
discussion session. They were given instructions to discuss the human relations 
problem, on the completion of which they filled out the sociometric rating 
scale, ranking every group member including himself on amount of participa- 
tion, quality of ideas, effectiveness of guiding the discussion, and general leader- 
ship ability. The person who talked the least in this session was chosen as the 
target person (TP). 

While the first session was in progress, those subjects waiting in the other 
room who were to be used in the mechanically induced treatment were told 
that there were only facilities at present for running one group at a time and 
that, therefore, they would have to wait about 10 minutes. Those subjects 
who were to be used in the group induced reinforcement treatment were asked 
to be experimental confederates to help in getting a shy person, the TP, to 


1 Different experimental confederates were used for each discussion session for two 
reasons. Experimental confederates used in the second session of the group induced 
reinforcement and combination reinforcement were informed of the primary purpose 
of the study and, therefore, could not be used in the final discussion session. To hold 
this condition constant over all treatments, different subjects, with the exception of the 
TPs, were used in all three discussion sessions for all three treatments. In addition, 
any influence that increased familiarity with other participants might have on the TPs’ 
talking was controlled. 

2 The discussion sessions were designed to consist of four subjects each. Unfortu- 
nately, great difficulty was encountered in getting the correct number of subjects to 
appear at the correct time. The mechanically induced treatment had 22 discussion 
sessions with three subjects and eight discussion sessions with four subjects; the group 
induced treatment had 20 discussion sessions with three subjects and 10 discussion 
sessions with four subjects; and the combination treatment had 20 sessions. with three, 
and 10 sessions with four subjects. No significant differences were noted between the 
scores of TPs in sessions with three and of TPs in sessions with four subjects. 
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increase his talking. They were given instructions in the use of social rein- 
forcement for this purpose, such as appearing highly interested in anything the 
TP had to say, nodding in approval, directing questions to him, and the like. 
A short practice rehearsal period then followed. 

The experimental confederates to be used in the combination treatment 
were also told, in addition to the above instructions, that the TP would be 
seeing a panel of lights as another form of feedback. 

b. Second session. When the subjects in the first session had finished the 
sociometric ratings, the TP was asked to remain, and the others were taken 


to another room and dismissed. The waiting subjects were then brought in and » 


seated at the table with the TP for the second discussion session. One of the 
three reinforcement treatments was introduced during this second session. 

For the mechanically induced. reinforcement treatment, the subjects were 
again told that they would be discussing a human relations problem. They 
were also informed that they would be receiving feedback to let them know 
how well they were doing. The following instructions were given: 

You see in front of you the box with the red and green lights. These 
are going to be used as feedback lights to tell you personally how well 
you are doing. We have learned from prior studies that certain kinds of 
statements are usually helpful in getting a solution to a problem, whereas 
other kinds of statements are actually hindrances. When you receive a 
green light, that means you are doing something helpful, you are on the 
right path. But if you get a red light, it means that what you are doing is 
likely to hinder rather than help the group reach a conclusion. It is con- 
ceivable that even remaining silent when you might have been clarifying 
a point that had been made earlier is a hindering type of behavior. This 
would rate a red light, indicating you should have said something at that 
point. On the other hand, silence might be good when talking would serve 
to confuse a good point that had already been made. Since it is often im- 
possible for us to determine the effect that a single statement or thought 
will have on the group discussion, a feedback light might be referring 
to the cumulative effect of two or three successive contributions to the dis- 
cussion. These lights are meant as a personal help, so please do not try 
to see what color lights someone else is getting and don’t interrupt the 
ongoing discussion to talk about the lights, 


During the treatment, the TP received a ted light when he was silent for 
too long and a green light when he talked. The others received green lights 
for being silent and red lights for talking. They would, however, receive a 
green light for asking the TP a question or for responding approvingly to @ 
statement made by the TP. At the conclusion of the mechanical treatment, the 


same sociometric test used in the first session was administered, with the addi- 
tion of a question on the helpfulness of the lights, 
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For the combination treatment, the same instructions were given as for the 
mechanically induced treatment. However, the other group members were also 
experimental confederates. The TP thought everyone at the table was receiving 
feedback lights, when, in actuality, he was the only one receiving them. He re- 
ceived a green light for talking and a red light for keeping silent. At the end of 
the session, the same sociometric tests used in the mechanically induced rein- 
forcement treatment were administered. 

For the group induced reinforcement the same instructions were given as in 
the combination treatment, omitting the instructions for the lights, which were 
inoperative. On the conclusion of these sessions the sociometric test was ad- 
ministered. 

c. Third session. After the collection of these sociometric ratings, the 
procedure was once again the same for all three treatments. The other 
participants left, and new subjects who had signed up for the half-hour were 
brought in. The third discussion session was begun with the same instructions 
as were used for the first session. At the conclusion of the session sociometric 
ratings were again taken, and the TP was asked to indicate which of the three 
sessions he had liked the best, which second best, and which the least. The 
subjects were then dismissed. 

Thus, for each TP a measure was taken of duration of talking in a pre- 
treatment (first) session, a treatment (second) session during which rein- 
forcement was administered in one of three ways, and a posttreatment (third) 
session once more without reinforcement. 


C. Resutrs—ExperiMent I 


Data were collected for talking duration, for subjects’ rating of the TP, 
and for the TP’s ratings of himself. 


1. Talking Duration 


During the discussions an experimenter recorded the frequency and duration 
of the talking of each group member on an event recorder. The total duration 
of talking by each person for any one session could then be determined by 
adding the total number of inches of talking recorded for that person in that 
session. The totals for each person in a given session were then added to get 
a grand total of talking by the group. The units of measure used were left 
in inches, since there was no need to convert them into minutes. The total 
time that the group spent talking was then divided into each individual sub- 
ject’s total to get his percentage of contribution to the total talking time. 

Some discussion sessions had three participants, and some had four. If each 
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person had been contributing equally in the groups of three, each would have | 
talked 33 percent of the total talking time. If each person had contributed 
equally in the groups of four, each would have talked 25 percent of the total. 

Therefore, to equate the scores of persons in groups of three with those in 

groups of four, 33 percent or 25 percent, respectively, were subtracted from 

the percent score that each person had. A resulting plus score indicated that 

a participant was talking more than his equal share, and a minus score indi- 

cated that he was talking less than his equal share. 

Three scores were so obtained for each TP: a pretreatment score, a treat- 
ment score, and a posttreatment score. Each of these measures was a percentage 
score above or below the TP’s equal share of talking as determined by group 
size. The score for each subject in the presession was then subtracted from 
his score for the treatment session. This treatment session minus presession 
difference score indicated how many percentage points difference in talking 
there were in the treatment as compared to the presession. This measure was 
indicative of the effectiveness of the treatment in increasing talking. Another 
difference score, postsession minus presession, was computed for each subject. 
This measure indicated a difference in percentage points of talking between 
the pre- and postsessions. It was a measure of the change in duration of talking 
between pre- and postsessions. 

Thus two sets of difference scores, treatment minus pre, and post minus 
pre, were obtained for each of the 10 subjects in each of the three treatments, 
for a total of 30 TPs. The mean difference scores for the subjects in each 
treatment were computed and are presented in the first three rows of Table 
1. The fourth row of Table 1 will be considered later. 

The first column in Table 1 shows the mean treatment minus prescores for 
subjects in the mechanically induced, the group induced, and the combination 
treatments. The second column of Table 1 shows the mean post minus pre- 
scores for subjects in the mechanically induced, group induced, and combina 


TABLE 1 
Mean Dirrerence Score FoR PERCENT OF Time TALKED BY TARGET PERSON 
Treatment Post 

bs Treatment* minus pre minus pré 
Experiment I 

Mechanically induced 31.7 15.5 

Group induced 28.8 16.6 

Combination 34.6 147 
Experiment IT 

Improved group 40.9 12.5 é 


® n = 10 for each treatment group. 
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tion treatments. A one-way analysis of variance over the treatment minus 
pre means across the three treatments yielded an F = .580, df = 2/29, which 
was nonsignificant. Similarly, a one-way analysis of variance over the post 
minus pre means across the three treatments yielded an F = .049, df = 2/27, 
which was also nonsignificant. Thus differences found among the means of 
the three treatments were all well within chance variations. 


2. Sociometric Measures 


At the conclusion of each discussion session the participants ranked each 
other and themselves on four items having to do with amount of participation, 
quality of ideas, effectiveness in guiding the discussion, and general leadership 
ability. Thus each TP received from each of his fellow group members in 
every session a ranking of from 1 to 3 or from 1 to 4, depending on group 
size, on each of the four items. In all cases 1 was the highest ranking and 3 
or 4 the lowest. To equate rankings from groups of three and groups of four, 
the rankings of groups of four were all divided by 1.33. Since Bavelas et al. 
(1) found that all of these items were highly correlated, the rankings by the 
other group members of the TP were averaged for each item, and then the 
mean rankings for each item were again averaged to give the TP one mean- 
ranking-from-others score for each session. The TP’s self-rankings on each 
item, were also averaged to give the TP one mean-self-ranking score for each 
session. The first three rows in the left half of Table 2 give the mean-ranking- 
from-others in each of the three sessions for the mechanically induced, group 
induced, and combination treatments. The first three rows of the right half 
of Table 2 give the mean-self-ranking in each of the three sessions for the 
mechanically induced, group induced, and combination treatments. The fourth 
row of Table 2 will be considered later. 

Two main questions were of concern regarding the mean-rankings-from- 


TABLE 2 
TarceT Persons’ RANKINGS oN SoctomeTRic ‘Tests 
(1 = Hicuest RANKING; 3 = Lowest RANKING) 


Mean-ranking-from-others Mean-self-ranking 
Session Session 
‘Treatment® 1 2 3 1 3 

Experiment I 

Mechanically induced 244 1.65 1.96 2.73 1.56 2.14 

Group induced 2.54 2.12 1.79 2.53 2.02 1.94 

Combination 2.26 1.65 1.95 2.56 1.63 1.84 
Experiment II 

Improved group 2.69 1.21 2.12 2.52 1.63 1.93 


* n= 10 for each treatment group. 


230 JOURNAL OF PSYCHOLOGY 


others and mean-self-rankings. The first asked if the rankings from self ant 
others differed significantly from session to session within the same treatment. 
To analyze the mean-rankings-from-others, ¢ tests for independent means were 
carried out between the mean scores for sessions one and two, sessions one 
three, and sessions two and three for each treatment. To analyze the mi 
self-rankings, ¢ tests for correlated means were used for the above comparisons 
It was found that in all three treatments both the mean-rankings-from-others 
and mean-self-rankings improved significantly from the pretreatment to the 
treatment session. In four out of six comparisons there was a significant in-_ 
crease from the pretreatment to the posttreatment session. However, there was 
no significant difference between the treatment and posttreatment sessions. 
These results paralleled those of Bavelas ef al. ( 1) who found that ratings of 
the TP were significantly higher for the reinforced discussion than for the 
prereinforced sessions. 

The second and more interesting question asked whether rankings for 
same session differed significantly from treatment to treatment. To analyze 
whether the mean-rankings-from-others or the mean-self-rankings were dif 
ferentially affected by the treatment in session one, in session two, or in session 
three, an analysis of variance across treatments for mean-rankings-from-other: 
and mean-self-ranking in each session was run. None of the Fs obtained w 
significant. 

Both TPs and nonTPs were asked at the end of the second session in tht 
mechanically induced treatment whether they felt the lights helped or did not 
help the discussion. Five of the TPs responded that the lights were helpful, 
and five stated that the lights did not help; on the other hand, three nonTPs 
indicated that the lights were helpful, while 17 stated that the lights did not 
help. A 2 X 2 chi square analysis for these results yielded a chi square of 
4.176 (df = 1, > < .05). The results indicate that a significant percentage of. 
nonTPs felt that the lights did not help the discussion, whereas the TPs were 
evenly divided over whether the lights helped or not. 


D. Discussion—ExprrIMEnT I 


During the running 
Experiment I, it was ni 
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in their ability to increase the TPs’ talking. These confederates consisted of 
introductory psychology students, They were given only a few minutes. of 
instruction and a little guided practice before beginning the treatment discus- 
sion sessions. Different TPs had different experimental confederates. Some 
confederates were extremely skillful at getting the TP to talk, others only 
moderately so; and some seeming to forget their purpose, were extremely 
poor and spent most of the time talking themselves. Therefore, it was felt 
that having skillful experimental confederates for all TPs would more uni- 
formly increase the talking of the TPs. A second experiment was carried out 
where all the TPs had the same two experimental confederates, persons who 
had previously shown their talent for delivering social reinforcement and 
getting the TP to increase his talking behavior. 
E. Merruop—Exprrment II 

The procedure in Experiment II was an exact replication of the group 
induced reinforcement treatment, with the exception that the same confederates 
were used with every TP. A total of 54 subjects, 10 of these TPs, were 
recruited from the same population as in Experiment I. 

Two experimental confederates, one male and one female, who had shown 
themselves to be especially adept at getting a TP to talk in Experiment I were 
asked and consented to serve as dispensers of reinforcement for 10 additional 
TPs. They received additional practice and instructions based on the experi- 
menters’ observations of 20 teams of confederates at work. 

Four subjects were scheduled to arrive on the hour. The two confederates, 
who also arrived on the hour with the other subjects, acted as if they too had 
signed up for that time and were ignorant of what the experiment was to be 
about. 

On the pretext of dividing everyone into two discussion groups, the experi- 
menter took the confederates into the back and seated the remaining subjects 
around the discussion table. From this point on, the procedure was exactly 
the same as that for the group induced reinforcement. A TP was picked from 
the first discussion session, The confederates reinforced the TP for talking 
during the second session, and new subjects were again brought in for the 
final discussion. Sociometric ratings and talking behavior measures were 
obtained for all subjects after each session. 


F. Resutts—ExprrimEntT II 
1. Talking Duration 


As in Experiment I, two different scores were computed for each TP, a 
treatment minus prescore indicating how many percentage points difference in 
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talking there were between the treatment and presessions, and a post- minus 
Prescore indicating how many percentage points difference in talking there 
were between the post- and presessions. The mean difference scores for the 
TPs in the improved group induced treatment are shown in the last row of 
Table 1. 

Experiment 1 had already shown that the group induced reinforcement 
treatment was not significantly better or worse than the mechanically induced 
treatment in increasing or maintaining talking behavior. The main question of 
Experiment II was whether a skilled group induced treatment would be more 
effective than the group induced or the mechanically induced treatments. 
Therefore, one-tailed tests were used. 

The test for differences between the mean treatment minus prescores for 
the mechanically induced and the improved group inducd treatments yielded 
a t of 1.732 (df = 18, p < .05 one-tailed). The ¢ between the mean treat- 
ment minus prescores for the group induced and the improved group induced 
treatments was 2.865 (df = 18, p < .01 one-tailed). ‘The ¢s between the 
mean post minus prescores for the mechanically and improved group treat- 
ments, and the mean post minus prescores for the group and improved group 
induced treatments were not significant, Thus, a significant difference was 
found between the improved group treatment and the mechanically and 
group induced treatments in increasing talking behavior, but not in maintaining 
it. 

Comparisons were not made between the skilled group and the combination 
treatments because these two are not truly comparable. A more appropriate 
comparison would be between the combination treatment and a skilled com- 


bination treatment using both lights and the same skilled confederates for 
all TPs. 


2. Sociometric Measures 


nie mean-ranking-from-others of the T'P in each of the three sessions for 
the skilled group induced treatment is shown in the last row of the left half 
of Table 2. The last row in the right half of Table 2 gives the mean-self- 


ranking of the TP in each of the three sessions for the skilled group induced 
treatment. : 


Again, t tests indicated that mean-rankings-from-others and mean-self- 
rankings differed significantly from Session to session. And again the more 
Interesting question was whether rankings for the same session differed signifi- 
cantly between treatments. In a further 


F analysis t tests for independent means 
were carried out between mechanically and skilled group induced reinforce- 


M. GOLDMAN, N. WESTERGARD, AND J. G. KRETCHMANN 233 


ments, and between group induced and skilled group induced reinforcements 
for each session, on both the mean-rankings-from-others and mean-self- 
rankings. The t between the mechanically induced and skilled group induced 
treatments for session two, mean-ranking-from-others, was 2.938 (df = 18, 
p> < .O1 one-tailed), The ¢ between the group induced and skilled group 
induced treatments for session two, mean-ranking-from-others, was 4.945 
(df = 18, p < .01 one-tailed). All of the other ¢ comparisons were non- 
significant. In summary the sociometric ratings of the TP in the skilled group 
treatments differed significantly from the mechanically and group induced 
treatments only in session two, and only in the mean-ranking-from-others. 


G. Discussion—ExperiMenT II 


Experiment II was a comparison of the effectiveness of the mechanically and 
group induced reinforcement treatments with that of a skilled group induced 
treatment. It was found that the skilled group induced treatment was signifi- 
cantly more effective than either the mechanically or group induced treatments 
in increasing talking behavior, but not in maintaining talking when the rein- 
forcement was removed. Also on the sociometric measure (mean-ranking-from- 
others), the T’Ps in the skilled group induced treatment had higher ratings 
than those in either the mechanically or group induced treatments. Therefore, 
although it was not shown that randomly picked confederates are more effec- 
tive than mechanical reinforcement, it was shown that highly skilled confed- 
erates are more effective than either the random confederates or the mechanical 
treatment. It might also have been shown that very poor experimental con- 
federates were less effective than the mechanical treatment or the random 
confederates, 

There was less within treatment variability on the talking behavior for the 
skilled group treatment minus prescores with a standard deviation of 8.54, as 
compared with 14.7 for the mechanically and 10.58 for the group induced 
treatments. However, the standard deviations were not significantly different 
and indicated only a trend towards decreasing variability. It was also observed 
that a TP’s original score was no indication of how much his talking would 
increase in the treatment session. 

Generally, the TP’s ranking of himself rose in the treatment session and fell 
again in the postsession, but not to the level it was originally. The others’ ranik- 
ings of the TP paralleled his own, rising in the treatment session and falling 
again in the postsession. The session to session changes in ratings were signifi- 
cant in all four treatments. 

One problem of research in this area is that there are so many variables to 
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consider—the length of the discussion session, for example, the topics discussed, 
and the two-way nature of the problem, That is, if the TP is going to talk 
more, the other participants are going to have to allow him more “talking 
space” and be polite enough not to interrupt or cut him off. Bavelas et al. (1) © 
found that reinforcing the TP alone for talking was not enough; the talking 
of the other group members had to be suppressed with red lights. A “big 
talker” who monopolizes the conversation, a very aggressive or even rude 
talker in the postsession could (and did) keep the TP and everyone else 
present from having a chance to speak. On the other hand, two rather quiet, 
submissive people would allow the TP, fresh from his treatment session, to 
dominate. The experimental design attempted to randomize such events over 
all three treatments. Perhaps a more refined study would include control of 
the talkativeness of all members of all groups, not just TPs. 

The relationship between changes in self-image and perception by others, 
and the maintenance of change in talking, might also be investigated. The TP’s 
knowledge of what was going on in the reinforcement treatment could be varied 
in a number of ways, and then his self-rankings for these various states of 
knowledge could be compared, as well as his maintenance of talking in the 
Postsession, As an example, the effects of telling the TP between treatment and 
postsessions that he had improved a great deal could be compared with telling 
the TP that he had talked more but only because of the experimental ‘‘stooges.” 


The effects to be compared would be reflected in his talking behavior and his 
self-rankings for the postsession. 


H. Summary 


: Experiment I compared the effectiveness of mechanically induced, group 
induced, and a combination of mechanically and group induced reinforcements 
in Increasing and maintaining the talking behavior of nontalkative target per- 
sons. No significant differences were found between the three reinforcement 
treatments in increasing or maintaining talking. ’ 
Experiment II compared the effectiveness of a skilled group induced rein- 
forcement treatment with that of the original group induced reinforcement 
and that of the mechanically induced. The improved group induced reinforce 
“ment was significantly more effective in increasing talking behavior than the 
mechanically or group induced treatments, but not in maintaining it. Socio- 
metric ratings taken after each discussion in both experiments indicated that the 


more the target persons talked, the higher the ratings they received from other 
and themselves, 
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POSITIVE CONTRAST WITH SACCHARINE AND 
REPEATED INCREMENTS IN REWARD 
MAGNITUDE IN THE RAT* 


University of Melbourne, Australia 


LawrENCE WEINSTEIN! 


A. IntTRopUCTION 


Positive incentive contrast effects are said to occur when an increase in 
incentive magnitude results in a level of behavior that is significantly greater 
than the performance of a control group that has been continually exposed to 
the first, smaller, incentive magnitude. While a number of recent investigations 
with animals have obtained positive contrast effects consequent to an incre- 
ment in quantity of reward (e.g., 2, 4, 5, 6), little is known concerning what 
causes the phenomenon. The present studies examined the notion that positive 
contrast is explained on the basis of an appetitive motivational emotional 
response, 

According to this view an increase in amount of reward results in an 
appetitive emotional response, such as happiness or joy, which produces internal 
cues that energize the instrumental response, thus resulting in an increase in 
performance positive contrast effects. 

If upward contrast effects are caused by affective factors, one would expect 
that a neutral stimulus, which is initially paired with an increase in incentive 
magnitude that produces positive contrast, would subsequently result in a 
reliably higher level of behavior than a cue that is initially associated with the 
control group’s higher reward magnitude. Such results would be in accord with 
the notion that an appropriate increase in incentive magnitude in animals pro- 
duces some form of appetitive emotional response which accounts for positive 
contrast effects. Experiment 1 examined the effects on the runway behavior of 
rats of an auditory stimulus associated with an increase in reinforcement mag- 
nitude, as compared to an auditory cue paired with the control group’s larger 
quantity of reward. 

According to such an explanation of positive incentive contrast effects, Ss 
who experience an increase in amount of reward have emotional cues condi- 
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tioned to the ongoing response. If this is the case, a second increase in incenti 
magnitude should produce familar emotional cues; that is to say, Ss should 
now less sensitive to the facilitating effects of the emotional cues than Ss w 
do not experience a prior increase in amount of reinforcement. A second i 
crease in reward magnitude should result in little, if any, positive contra 
effects, as compared to Ss who experience this second increase in reward qui 
tity subsequent to no variation in magnitude of reinforcement. If positive co 
trast effects are not found in Ss who experience an increase in incentive si 
previous to the reward increase experienced in the test situation, and are fout 
in animals who are exposed to no change in quantity of reinforcement befo 
the test situation, then these results would be consonant with the idea 
upward contrast effects in animals are due to some form of appetitive motiy 
tional state. Experiment 2 examined the effects of an increase in incenti 
magnitude subsequent to an increase in quantity of reinforcement, as compat 
to Ss who experienced the increase in reward magnitude after no ups 
amount reinforcement. 


B. Exprrimgnt 1 
1. Method 


a. Subjects. The Ss, 80 males, experimentally naive, albino rats, 92-11 
days old at the start of the experiment, were randomly assigned to each of fo 
groups. 

b. Apparatus. The operant conditioning box had plywood side 
Plexiglass top, back, and door; and a grid floor. It measured 305.8 x 254 
254 mm. The operant box is more fully described in the Weinstein article (8 

‘The auditory signal, a 1500-Hz, 69-db SPL, 3-second tone, produced by 
audio oscillator, was presented by a hand switch through a 76.2-mm,, 8-ohi 
speaker mounted 25,4 mm above the key. 4 

Another apparatus was a straight alley runway. The floor of the runway W 
made of plywood 9.5 mm thick. Walls were 20.3 cm high and painted 
gtay; each of three sections was cavered with a hinged Plexiglass lid. 
30.5-cm start box and the 122-cm alley, 12 cm wide, were separated bj 
guillotine Plexiglass door. A Plexiglass guillotine door separated the alley ai 
the goal box, the latter being 20.3 cm wide and 30.5 cm from front to bat 
76.2-mm, 8-ohm speaker was mounted on the far end of the goal box wa 
2.54 cm above the floor, 

When £ pulled a counterweight, the start box opened. and started a 
(Hunter, Model 1520). When § entered the goal box, the leading ¢ 
the floor was depressed 2.5 mm, stopping the clock, measuring running time 
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c. Procedure. \ Saccharine ‘concentration treatments’ of .01% and 1.2% by 
weight were used with two shift and two unshifted:groups. One group experi- 
enced 1.2% saccharine throughout the 21 sessions of the study (1.2-1.2); 
another group received 1.2%. through: session’ 14 followed by 1.2% plus the 
3-second tone for the remainer of the study (1.2—1.2T) other Ss experienced 
:01% through session 14 followed by 1.2% for the remainder of the study 
(.01-1.2) ; and, finally; some rats received .01% through-session 14 followed 
by 1.2% plus the tone for the remainder of the study (.01-1.2T). 7 

Following several days of adaptation to a 23-hour food and water depriva- 
tion schedule, Ss were trained on successive days to press the bar for con- 
tinuous reinforcement; with each depression the bar operated the magazine. 
Each session lasted for 40°reinforcements, dipper presentations; Throughout 
the pre and postshift phases of the study all Ss were maintained on 23-hour 
food and water deprivation. After his daily, preshift, operant chamber session, 
each .01-1.2T and 1.2-1.2T S received 20 minutes of free exploration in the 
runway followed by access to food and water in the home cage for one hour. 
Other animals received food and water for one hour following their experi- 
mental periods. Body weights were recorded daily. When S was placed in the 
start box at the beginning of each 20-minute free exploration period, his run- 
ning time to enter the goal box was recorded. 

On session 15, E manually activated the 3-second tone once per dipper 
Presentation when §’s tongue initially touched the dipper. The EZ was seated 
in the darkened room beside the operant box. No runway experience was given 
in the postshift. 

Immediately after session 21,. within 10 minutes, the speaker in the goal 
box was connected, and each tone S§ was given 20 trials in the straight alley. 
Animals were tested in the runway under the same deprivation conditions as 
in the operant box. Each trial was begun by placing S in the start box and 
opening the door 15 seconds later, regardless of S’s orientation. When S’s two 
front feet were in the goal box, £ manually activated the 3-second postshift 
tone. After the tone offset, Ss were allowed to remain in any part of the run- 
way for 5 seconds; they were then picked up and-placed in the start box. No 
8 failed to traverse the runway and enter the goal box. The £, seated behind 
the goal. box, operated the hand switch for the tone and recorded running 
times, 


2. Results 


Body weight data (not presented here) indicated no sonia difference 
between the four groups during the pre or postshift. . 
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The average number of bar presses made per minute was examined in fl 
analysis of the operant box data. From Figure 1 it is clear that from ses 
1-14, 1.2% saccharine produced a higher rate of bar pressing than .01%, th 
shift groups responded at about the same level, and the control groups pel 
formed at about an equal level. The mean number of bar presses per minut 
from sessions 1-14 differed reliably among the four groups by analysis of vari 
ance, F (3, 76) = 3.04, p < .05. By Duncan’s comparisons the differ 
ence between 1.2-1.2T and 1.2-1.2, and between .01-1.2 and .01-1.21 
was each not significant, (p > .05), while the difference between 1.2-1.2 am 
.01-1.2, and between 1.2-1.2T and .01-1.2T was each reliable (p < .05). 

Figure 1 indicates that on session 15 the shift groups suddenly increased 
their response rate to an equal level above their respective control group 
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FIGURE 1 
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FIGURE 2 
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(equal positive contrast effects). The mean number of bar presses from 
sessions 15-21 differed significantly among the four groups by analysis of 
variance, F (3, 76) = 3.21, p < .05. By Duncan’s comparisons the difference 
between 1.2-1.2 and .01-1.2, and between .01-1.2T and 1.2-1.2T was éach 
significant (p < .05), while the differences between the shift groups and the 
control conditions each failed to be reliable (p > -05). 

: Mean running speed for the 20 trials was the response measure examined 
in the analysis of the runway data. For the 14 free exploration trials, the mean 
Tunning speed was analyzed by an analysis of variance which revealed no 
significant difference between .01-1.2T and 1.2-1.2T, F (1, 38) = 62, > 
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.05. In other words, the group ran at the same speed before the introduction 
of the tone. 

Figure 2 clearly indicates that Ss who received the tone paired with an 
increase in incentive concentration took less time to traverse the runway than 
their control group. Analysis of variance revealed a significant difference be- 
tween the two tone groups, F (1, 38) = 10.23, p < .01. 


C. Exprriment 2 


Experiment 1 indicated that a neutral stimulus paired with an increase in the 
concentration of a saccharine solution subsequently produced a higher level of { 
responding than a stimulus initially associated with the more concentrated | 
solution. These data concur with the notion that positive contrast effects in 
infrahuman organisms is caused by some appetitive emotional motivational 
state. Experiment 2 further examined this emotionality hypothesis by attempting — 
to determine the effects of an increase in the concentration of a saccharine 
solution subsequent to an upshift in saccharine concentration, as compared to” 
Ss who experience an increase in concentration after no prior upshift. 


1. Method 


The Ss were 45 male, albino, experimentally naive rats. The details of age, 
housing, operant chamber apparatus, and deprivation were the same as in 
Experiment 1 except that the speaker was removed from the chamber. Fifteen 
Ss were randomly assigned to each of three groups. One group received a 
-017% saccharine solution for 11 sessions, followed by an increase to 1.1% 
from sessions 12-19, followed by another upshift to 1.25% from sessions 20-30 | 
(.01-1.1-1.25) ; another group of Ss experienced 1.1% through session 19, 
followed by an increase to 1.25% saccharine on session 20 and through the” 
remainder of the experiment (1.1-1.1-1.25) ; finally, rats received 1.25% 
saccharine throughout the 30 sessions (1.25-1.25-1.25). 


2. Results 


The same operant box measure of Experiment 1 was examined in Experiment 
2 From Figure 3 it is clear that in the preshift, 1.25% saccharine resulted if 
a higher level of bar pressing than any other concentration, and 1.1% produced 
a higher level of performance than'.01% saccharine. The mean number 
bar presses per minute from sessions 1-11 differed significantly among the three 
Groups by an analysis of variarice, F (2,42) = 3.51, p < .05. By Duncan’ 
multiple ‘comparisons the difference between .01-1:1-1.25 and each other group, 
and: between 1.1-1.1-1.25 and 1.25-1.25-1.25. was each significant (p < .05)+ 
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FIGURE 3 
Mean Bar Presses PER MINUTE, EXPERIMENT 2 (Arrows DENOTE PosTsHIFTS) 


Figure 3 indicates that on session 12, the .01-1.1-1.25 Ss performed at a 
higher level than the 1.1-1.1-1.25 group (positive incentive contrast effects). 
The mean number of bar presses from sessions 12-19 differed significantly 
between 1.1-1.1-1.25 and .01-1.1-1.25 by analysis of variance, F (1, 28) = 

5.63, 9 < .05. 

Further examination of Figure 3 reveals that on session 20, the 1.1-1.1-1.25 
Ss suddenly increased their response rate to a level above 1.25-1.25-1.25 (posi- 
tive contrast), while the .01-1.1-1.25 Ss increased their rate of bar pressing to 
a level below the 1.25-1.25-1.25 group. The mean number of bar presses from 
sessions 20-30 was significantly different among the three groups by an analysis 
of variance, F (2, 42) = 5.18, p < .01. By Duncan’s comparisons the differ- 
ence between: 1.1-4.1-1.25 and 1.25-1.25-1.25 was reliable (6 < .05), while the 
difference between .01-1.1-1.25 and 1.25-1.25-1.25 failed to reach significance 
(p > .05). 
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D. Discussion 


Experiment 1 indicated that rats who experienced a tone paired with an _ 
increase in incentive magnitude ran faster to obtain the tone than animals who | 
experienced the tone paired with the higher reward magnitude. This result, as 
already discussed, is consistent with the notion that positive incentive contrast 
in animals is due to an appetitive motivational state. 

In Experiment 1 animals ran faster to the tone when it was previously | 
associated with a presumably appetitive emotional state, as compared to when 
the signal was initially paired with no emotional state. To the extent that the 
tone elicited in the runway the emotion produced by an increment in reward 
magnitude in the operant box, the runway situation involved conditioned 
appetitive motivation. 

Since the tone on trial 1 occurred before the running response on trial 2, 
and the tone on trial 2 occurred before the response on trial 3, etc., then the 
increment in running speed (for the .01-1.2T Ss as compared to the 1.2-1.2T 
group) was quite probably a direct reflection of the motivational properties of 
the tone, 

In Experiment 1 the finding that from sessions 15-21, 1.2-1.28 Ss performed 
at the same level as 1.2-1.2 Ss indicates that the tone had no effect on bar 
pressing. 

The present positive contrast effects with saccharine contrast with pre- 
vious research with saccharine which failed to obtain the effect (e.g., 3). A 
likely reason for this disparity is that previous researchers used saccharine con- 
centrations that quite possibly did not produce as large a difference in preshift 
performance as in the present study. Dilollo and Beez (1) and Weinstein 
(7) have demonstrated that larger preshift differences in performance are more 
likely to result in contrast effects than smaller preshift behavioral differences 

Experiment 2 demonstrated that positive incentive contrast effects, obtained 
with an_ increase in concentration of saccharine solution subsequent to n0 
increment in concentration, are not found in animals who experience the in- 
crease in concentration subsequent to an increment. As discussed previously, 


these data support an appetitive motivational account of positive incentive 
contrast effects in animals. 


E. Summary 


Experiment 1 demonstrated that animals ran faster in a straight alley rum 
ae a tone Previously. Paired with positive incentive contrast effects pt 
"ed In an operant conditioning chamber than did rats in the control group 
which ran to a tone initially paired with a higher amount of reward. In Exper 
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ment 2 upward contrast effects in rats which resulted from a single increment 
in the concentration of a saccharine solution did not occur following a second 
increment in saccharine concentration. These data are in accord with the notion 
that positive incentive contrast effects in infrahuman organisms are due to some 
form of appetitive motivational state. 
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THE APPLICATION OF TWO MNEMONIC TECHNIQUES 
FOLLOWING ROTE MEMORIZATION OF 
A FREE-RECALL TASK* ! 


University of Richmond 


SrepHEN M, Rust anp Kenneru A, Buick? 


A. IntRopuction 


There has been a recent development in the study of verbal learning con- 
cerning the types of mnemonic techniques that college students employ in 
learning a free-recall task. In one survey of mnemonic techniques, Boltwood 
and Blick (4) were able to determine the types of devices that Ss had selected 
to aid them in learning a list of 19 words. It was found that 38% of the Ss 
used the first-letter (FL). technique, 31% the clustering (C) technique, and 

"22% the descriptive-story (DS) technique. The FL technique refers to any 

manipulation of the first letters of the words designed to facilitate learning. 

The C technique concerns the clustering of words into meaningful conceptual 

categories, like semantic or phonetic similarity; and the DS technique desig- 

nates the use of the words to form a story or stories, These were the three 

Most popular mnemonic techniques reported in the survey. 

In another mnemonic survey conducted by Blick and Waite (2), the Ss 
were asked to report any memory aids that they might employ in learning an 
imaginary list of words. The results indicated that 34% of the Ss reported the 
FL technique, 16% the C technique, and 6% the DS technique, the DS 
technique being the least frequently reported memory aid. 

Following their survey of mnemonic techniques, Boltwood and Blick (4) 
performed an experiment to determine which of the three primary mnemonic 

_ techniques (FL, C, and DS) would prove most effective in the retention of 

verbal material over relatively long time intervals. Prior to the learning of a 

list of 19 unrelated nouns, each of three groups of Ss received instructions in 

their assigned mnemonic technique, while the control group was given instruc- 
tions in the simple-repetition (SR) technique, The SR technique is equivalent 
to no-mnemonic instructions in that the § is told to memorize the words by 


a preceived in the Editorial Office on Varo 3 227 aoe i peblianes immediately 
Tovincetown, Massachusetts. Copyright ie Journ 
C2 SPhiwineh es Linperaieanigetaed in part by a faculty research grant from the Univer- 
‘ty of Richmond. stl 

* Requests for reprints should be sent to the junior author. 


247 


248 JOURNAL OF PSYCHOLOGY 4 


concentrating on each word in succession and reading the list of words ¢ 
and over. There was no difference in immediate recall between the gro 
with or without mnemonics. At the one-week interval the DS and C technig 
were shown to be significantly more effective as mnemonics, and at the eigl 
week interval only the DS technique proved to be effective in aiding 
Boltwood and Blick (4) hypothesized that the DS technique was effecti 
because it resulted in a meaningful syntactical arrangement of the words 
could be efficiently employed by Ss previously naive to its use. They pro) 
that the C technique failed to maintain superior retention at the 8- 
interval due to the fact that the 19 nouns were originally chosen in ordi 
reduce their conceptual relatedness. Perhaps the very popular FL mnen 
technique failed to aid in recall because its application involved a very di 
anagrams task. That the FL technique can facilitate free recall has 
demonstrated by Pash and Blick (9) who found that Ss who employed 
FL technique were superior in their retention of a list of nine words 
48 hour period in comparison to a control group. In addition, their Ss ii 
mnemonic condition were told that the first letters of the nine words 
be arranged to spell the word “education,” thereby removing the factor | 
an anagrams task. 
The primary purpose of the present experiment was to determine whi 
mnemonic technique, the DS or FL, was more effective when the anag 
factor was not a part of the application of the FL technique. To this en 
the FL group was told the phrase that the first letters of all the words woul 
spell, and the DS group was given a descriptive story that was composed | 
vivid and active mental images that appealed to the senses while still ma 
taining consistency and meaning. A secondary purpose of the experiment W 
to examine the effectiveness of the two mnemonic devices when they 


introduced immediately after all Ss have rote memorized the list but be! 
the free-recall retention tests. 


B. Metxop 
1. Subjects 


The Ss were students from three similar general psychology classes ai 
University of Richmond. It was a requirement of the course that they pa 
pate in the experiment which was conducted during regular class time. 
of the experimental groups was assigned one of the two mnemonic i 
FL or DS; while the control group served in the no-mnemonic, Sil 
repetition (SR) condition. The scores from 49 Ss were discarded becau 
Ss failed to follow the initial instructions for rote memorization, and 2 


t 
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failed to attend all three phases of the experiment. The remaining total was 
66 Ss of which 23 were in the FL group, 26 in the DS group, and 17 in the 
SR group. 


2. Verbal Material 


The list of words consisted of 14 English nouns, and each noun had a low 
frequency of occurrence (15 times or less per million words) according to the 
Thorndike and Lorge (10) word frequency tables. In order to minimize 
conceptual relatedness among the words, each word was taken from a different 
one of the Battig and Montague (1) conceptual categories. Each of the 14 
words was checked to insure that it occurred in only one of the 14 categories 
used. The words were: abdomen, cholera, ensign, homicide, indigo, melons, 
mortar, radishes, rafters, roach, shark, slacks, tabernacle, and yacht. 


3. Procedure 


The experiment was divided into three retention tests with the second and 
third tests occurring one week and eight weeks after the immediate test. At 
the immediate retention test the Ss received a five-page booklet. The instruc- 
tions on the first page explained to the Ss that they would have seven minutes 
in which to memorize the list of words on the second page by concentrating 
on each word in succession and reading the list over and over, and then they 
would have two minutes to write down the words in any order. The instruc- 


_ tions further stated that the Ss would be asked to recall the words at intervals 


of one and. eight weeks. The second page contained one of five random 
orderings of the 14 words, capitalized and centered on the page. The five 
orders were unsystematically distributed throughout each group. The third 
page was a blank page for the Ss’ written recall of the words, followed by the 
fourth page on which the Ss could report whether or not they had followed 
the initial instructions concerning rote memorization. 

On the final page of the booklet, the FL class was informed that the words 
could be arranged in such an order that the first letters of the words would 
spell out “merry christmas” and were presented the words in this order, 
capitalized with the first letter of each word underlined. The DS class was 
furnished with a story incorporating each of the 14 words, which were 
capitalized and underlined where they occurred. The story was constructed 
to be as consistent and meaningful as possible, while appealing to the senses 
of the § through vivid and active images. The story reads as follows: “Mo- 
mentarily safe, high up in the rafters of the sacred tabernacle and completely 
exhausted from the violent effects of cholera, he quickly falls asleep. The 
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gnawing hunger within his abdomen invades his mind and he begins to dream ' 
of gardens running over with ripe juicy melons and fleshy radishes. His dream 
is interrupted by a roach crawling up his weather-beaten slacks. After disposing 
of the pest, he imagines himself on his own yacht taking a pleasure cruise, 
‘This fantasy leads directly into a dream of the war when he was an ensign, 
He envisions the menacing fin of a killer shark as it cuts through the indigo 
sea, while his ears are deafened by the explosion of mortar from a nearby 
shore. Suddenly he awakes and realizes it has all been a dream. He knows 
there is no rest for one who has committed homicide and that he must be on 
the run again.” 

The SR class merely received the 14 words in a different order from the 
original presentation and were told that concentrating on each word in succes- 
sion and reading the list of words over and over would aid them in reten- 
tion. The final page for all three groups also contained further instructions 
informing the Ss that it would jeopardize the entire experiment if they did not 
use the technique designed to aid them in the retention of the list of words. 
One week and eight weeks later the Ss in all three classes were given two 
minutes to recall the words in any order. 

Exposing an intact class to the same memory technique was done in order 
to reduce the possibility that an § might be exposed to a technique other than 
the one which he originally received. If the three techniques had been randomly 
distributed within each class, the Ss might have discussed the different tech- 


niques among themselves and therefore confounded the effect of each tech 
nique. 


C. Resuxts 


‘The mean number of words correctly recalled by all Ss at the immediate, 
one-week, and eight-week retention intervals, respectively, was 13.71, 11.2% 
and 9.12 for the SR group; 13.58, 10.69, and 8.46 for the DS group; and 
13.70, 10.87, and 8.35 for the FL group. 

An analysis of variance was performed at each retention interval in order t0 
determine if there was a significant difference in recall among the three groups 
Those Ss were excluded who had failed to attend all three experimental 
sessions and who had not followed instructions, There were no significant 
effects, since all comparisons produced F < 1, 


D. Discussion 


95 = nor FL mnemonic proved more effective than the SR 
condition at any retention interval. In order to account for the negligible effect 
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of these mnemonic techniques, which have previously been shown to have a 
significant effect on the retention of a list of words, a review of the pertinent 
literature is necessary. Previous studies (5, 8) investigating the effects of 
mnemonics on the retention of verbal material: have shown that although 
mnemonics profoundly influence the rate of original learning, they have little 
or no effect upon retention, Due to the fact that in the present experiment the 
DS and FL techniques were not imposed upon the Ss until after the original 
learning of the list of words had taken place, one would not expect to find a 
significant difference in retention between the two mnemonic groups and the 
control group. 

An experiment conducted by Tulving and Osler (11) has shown that (a) 
a mnemonic retrieval cue should be paired with the corresponding words to be 
remembered at the time of input or original learning if it is to be effective in 
facilitating recall, and (b) changing the retrieval cue paired with the word 
between input and output will interfere with retention of the word. It is 
therefore possible that in the present experiment any subjective cues formed by 
the Ss at input in association with the list of words to be remembered interfered 
with the E-imposed cues prior to output, or vice versa. a 

Similar findings were reported by Hudson (6) in which the Ss were in- 
structed, either before or after presentation of a word list, that the words 
could be clustered into categories and given names of these categories. It was 
found that the group receiving this information after the presentation of the 


- word list recalled fewer words than either the group receiving the information 


before the presentation of ‘the list of words or the group that received no 
information. On the basis of these results, Hudson (6) hypothesized that the 
Ss receiving the information after the word list presentation either did not 
or could not employ it in organizing their recall. In the present experiment, 
the §’s original subjective organization of the words at the time of the initial 
learning by rote memorization may not have been compatible with the mne- 
monic organization of the words as defined by £ presented directly after the 


* original learning; therefore many of the Ss did not or could not use the 2-im- 


posed mnemonics. Apparently, application of the DS and FL mnemonic tech- 
niques immediately following rote memorization was interfered with by the 8's 
previously established subjective inter- and intraword associations. j 

Another question that must be considered in relation to the present expe: 
ment is the effect upon retention of mnemonics that are constructed by the E 
(as the FL and DS techniques were in the present study) as opposed a 
Mnemonics’ that are made up by the’Ss, The results of the research in this 
area are somewhat conflicting. In a paired-associates learning. experiment 


ri- 


252 JOURNAL OF PSYCHOLOGY 


reported by Lieberman, Walters, and Cox (7), in which one group of 
was given E-constructed mnemonics and the other group of Ss was instruct 
to make up their own mnemonics, no significant difference in recall was fou 
between the two groups. On the other hand, Bobrow and Bower (3). four 
that Ss who generated their own linking sentence for a pair of nouns displa 
superior recall than Ss who were presented with an equivalent linking sent 
that had been composed by £. It can be concluded that in the present exp 
ment the presentation of E-constructed FL and DS mnemonics may have h 
a more detrimental effect upon retention than if the Ss had been allowed 
make up their own FL and DS mnemonics; but there is no evidence in 
present study to suggest that the E-constructed mnemonics are less effective i 
aiding retention than no mnemonic at all. Therefore the E-made mnemoni 
in the present study cannot be said to have had any negative effect upon rete 
tion beyond the possibility that they interfered with the inter- and intrawo 
associations that § established at the time of original learning. 


E. Summary 


Following rote memorization of a free-recall task involving 14 nouns, 
E-constructed mnemonic techniques (first-letter and descriptive-story) y 
presented separately to two experimental groups, while a control group receive 
instructions in a no-mnemonic, simple-repetition technique. There were i 
significant differences in recall among the three conditions at an immediat 
one-week, or eight-week interval. It was concluded that if mnemonic 
niques are to have a facilitating effect it is imperative that they be pre 
at the time of original learning of the material. 
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A. IntRopuction 


The galvanic skin response (GSR) is one of the most sensitive measures of 
activation. One of the objectional characteristics of the DC coupled GSR 
recording technique is the requirement that the operator maintain a steady 
basal resistance by adjusting a voltage balancing circuit. 

This tedious. balancing problem has been a concern of psychophysiologists 
for many years. Ellson, Davis, Saltzman, and Burke (1) and Grings (2) have 
proposed the use of servomotor mechanisms to. rebalance the bridge circuit 
automatically. Stewart (9) made use of a circuit that employed an AC coupled 
amplifier that amplified only the sudden change in GSR and blocked the 
gradual changes. 

This investigation is concerned with the effectiveness of a modification of the 
circuit specified by Stewart for those experiments that require that only the 
number and size of the GSR be monitored and) are not concerned with 
absolute measurement of basal resistance or conductance changes (e.g, lie 
detection, conditioning, étc.). The effectiveness in a lie detection situation of 
the DC coupled circuit was compared to an AC coupled technique in which 
only the sudden change in GSR (AC component) was recorded and the 
gradual change (D€ component) in base line was ignored. In addition, com- 
parisons were made between the traditional lie detection procedures: relevant- 
irrelevant, peak of tension, and association. 
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B. Metuop 
1. Subjects 


Seventy-six paid college-age student volunteers, recruited through the use 
of posted announcements and classroom contacts, served as subjects. All §s 
were naive as to the purpose of the research. The task was presented as a 
challenge to the S’s ability to “fool the lie detector.” 


2. Apparatus 


A Grass Model 7 polygraph was used to record all responses. T'wo Model 
7P1 Grass preamplifiers, with high stability (34v/hour drift), adequate sensi- | 
tivity (10uv/cm), and low noise characteristics (less than 3yv) were used as 
the input stages for recording GSRs. An input switch allowed the units to : 
function as either an AC coupled preamplifier (with a time constant of 8 
seconds) or a DC coupled preamplifier (utilizing a voltage balancing network 
with a 50 microampere constant current source). The standard driver units 
(Model 7DAB) were used to deflect the recording pens over the eight-inch 
chart paper moving at 2.5 mm/sec. 

Simultaneous monitoring of the S’s GSR response, under both the AC and 
DC coupled recording techniques, was achieved by the use of one pair of 
electrodes. All recordings were made while the § was seated in a chair which 
had one-inch circular electrodes embedded in armrests designed specifically 
for recording GSR responses. Rubber straps were used to keep the palms in 
firm contact with the electrodes. The electrodes were cleaned between subjects 
and coated with EKG Sol Electrode Paste. 


3. Lie Detection Procedures 


Criminal lie detection Procedures, by their nature, imply that the suspect 
has or may have knowledge that the examiner is required to detect. Experi- 
mental lie detection research attempts, under controlled conditions, to mimic 
criminal lie detection procedures. The design used in this investigation required 
the subject to select at random a stimulus card from a deck of assorted stimulus 
cards, the contents of which were known to the examiner. Raters were later 
required to identify, on the basis of the recording obtained by the examiner, 
the stimulus item that had been selected by the subject. 

a. Peak of tension (PT). In this Procedure the numbers from 18 to 31 
were presented in ascending order, once on each of two trials. The critical 
numbers fell between 20 and 29 inclusive; however, these numbers were 
bracketed so that evaluations of low and high stimuli could be made, Although 
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Ss had been informed that blank cards had been included in the deck, in 
reality none were included. Therefore, the raters knew that every S had 
selected a numbered card. 

The guilty person paradigm (3) which requires § to convince E that he has 
drawn a blank card (as opposed to one containing a number) was used. Under 
this procedure, the S’s instructions were “If you draw a blank card, you must 
appear innocent—as in fact you are. If you draw a number, you must still 
appear as though you had drawn a blank card—i.e., innocent. If you seem to 
have drawn a numbered card whether you did or not, you fail on the experi- 
ment.” This paradigm, which requires § to appear innocent, reflects real life 
more closely than the usual guilty information paradigm under which § may 
employ the strategy of creating false positives. 

b. Association (A). The 10 stimulus words chosen for use, under the 
association procedure, were selected from the most frequent responses listed by 
Palermo and Jenkins (6). Each word was the most common response elicited 
from at least 60% of their college sample. The stimulus words that led to 
these most frequent responses were read aloud to the S, who was required to 
respond with a free discrete association to each word. He was restricted, how- 
ever, in that he was told not to respond with the word he had chosen, It was 
expected that blocking a natural response and substituting a secondary 
response would result in a measurable physiological reaction, The stimuli and 
the most frequent responses used were as follows: boy-girl, dog-cat, here-there, 
king-queen, lamp-light, long-short, man-woman, scissors-cut, slow-fast, and 
table-chair. 

c. Relevant-irrelevant (RI). All Ss were tested under two RI conditions: 
relevant-irrelevant words (RIW) and relevant-irrelevant numbers (RIN). 
Under the RIN condition the guilty person paradigm was used (Ss were 
instructed to appear innocent—i.e., as if they had drawn a blank card). The 
numbers used under the peak of tension procedure seryed as stimuli, Under 
the RIW conditions, the guilty information paradigm was used (S was in- 
structed to respond “No” to all stimuli, including the word he had chosen, 
when questioned about the words that were contained in the deck). The 
response words specified under the association procedure were used as stimuli. 
Under both the RIW and the RIN conditions the 10 stimuli were presented 
randomly three times, for a total of 30 stimulus presentations. 


4, Experimental Procedure 


two orders. A number, based 


All S d to all conditions in one of 
uae ee gned to each S. Each odd 


on the sequence in which the Ss reported, was assi 
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numbered S chose a critical number and was tested as follows: peak of tension 
(PT 1), relevant-irrelevant numbers (RIN), and peak of tension (PT2), 
He then chose a critical word and was tested as follows: association (Al), 
relevant-irrelevant words (RIW), and association (A2). The even numbered 
subjects followed basically the same procedure except they chose the critical 
word first and then the critical number, In each case AC coupled and DC 
coupled recordings were obtained simultaneously. 

A total of 12 different recordings were obtained from each subject (PTI, 
RIN, PT2, Al, RIW, A2 recorded simultaneously under the AC coupled 
and DC coupled recording techniques). These 12 recordings were then coded 
by use of the double-blind technique, separated, and randomly mixed prior to 
distribution to the raters, Four raters, all of whom had previous experience 
with evaluating DC coupled GSR, individually scored the 912 records, Each 
rater was required to identify, on the basis of the GSR recording, the critical 
item selected by the subject, In addition, the raters estimated the confidence 
they placed in their decision on a scale ranging from zero (no confidence, a 
guess) to 4 (positive). 


C. Resunts anp Discussion 


greater than chance, the norm 
the data. The PT2 conditions 


1. Comparative Effectiveness of AC and DC Coupling 

As can be seen in Tabl 
he DC coupled conditions th 
of the binomial test indicate 


rater. Only one of the 24 compari toni indicati! 
no real difference between ee a sain 


the two Procedures. 
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TABLE 1 
NUMBER OF CoRRECT IDENTIFICATIONS OF THE CriTICAL ITEM BY INDIVIDUAL 
ExaMINeRS UNDER EAcH Detection AND RECORDING CONDITION 


Rater and Conditions 
circuit coupling PT1 RIN PT2 Al RIW A2 Total 
Rater A 
pc 31 52 16 42 63 36 240 
AC 28 51 14 37 61 33 224 
Rater B 
DC 31 4 15 41 56 30 217 
AC 24 52 17 36 57 34 220 
Rater C 
DC 28 49 15 40 59 37 228 
AC 27 50 15 40 62 34 228 
Rater D 
DC 31 50 17 45 58 30 231 
AC 28 52 14 38 58 32 222 
Total 
DC 121 195 63 168 236 133 916 
AC 107 205 60 151 238 133 894 


Thus, it may be concluded: that for all the procedures used in this study, 
the AC and DC coupling recording techniques were equally effective. How- 
ever, the two techniques are not equivalent in terms of the demands placed on 
the examiner. When the DC circuit is used, changes in basal resistance can 
result in “off-the-recording-chart” pen excursions, demanding constant re- 
balancing of the voltage balancing network by the examiner. However, when 
the AC coupled circuit is used, the examiner is freed of the necessity of making 
this adjustment. 


2. Comparisons Among Lie Detection Procedures 


a. Peak of tension. The peak of tension procedure resulted in the lowest 
number of correct detections. Since each rater’s decision was based on a single 
observation of the 10 stimuli, the low accuracy is consistent with Rouke’s (7) 
conclusion that the simultaneous examination of several PT records is necessary 
for more accurate detection. Furthermore, under the first peak of tension trial 
(PT1) there were significantly more correct detections (p < .05) than under 
the second peak of tension trial (PT2). The decreased efficacy of the PT2 
condition may be attributed to the intervention of the RIN condition between 
the PT1 and PT2 trials, This intervention may haye made Ss more aware 


of the requirements of and the necessary responses to the task. 
Another interesting finding lay in the relationship between the PT pro- 


cedures and the AC and DC coupled recording technique. It has generally 
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been held (4, 5) that the gradual change in resistance provides the examiner 
with information that facilitates the decision process. However, in this in- 
vestigation, the number of detections under the PT AC condition was not 
significantly different from the number of detections under the PT DC condi- 
tion. Therefore, the rater does not seem to need the information provided by 
the gradual change in resistance. 

b, Association. Unlike the peak of tension procedures, there was no signifi- 
cant difference between the first association trial (Al) and the second associa- 
tion trial (A2). In a further comparison of association with peak of tension, 
it was found that while there was no significant difference between PT1 and 
Al, the A2 condition resulted in significantly (6 < .05) more detections than 
the PT2 condition. It would seem that those activating characteristics of the 
association procedure are not significantly decreased by a second exposure, while 
the activating characteristics of the peak of tension procedure are significantly 
decreased by a second exposure. 


It should be noted that the association procedure need not be limited to lie 
detection procedures. It could serve as a convenient means of physiologically 
activating an S in a variety of experimental conditions. 

c. Relevant-irrelevant. The relevant-irrelevant procedures resulted in the 
highest number of correct detections, This could be expected, since these 
decisions were based on three presentations of the critical stimulus, while the 
peak of tension and the association procedures were based on only one. How- 
ever, significantly more correct decisions (p < .01) were made under the — 
relevant-irrelevant word procedure (RIW) than under the relevant-irrelevant 
number procedure (RIN). This difference could not be attributed to the 
different instructions used to define the subject’s task (guilty information 
paradigm under the RIW conditions and guilty person paradigm under the 
RIN conditions), for Gustafson and Orne (3) did not obtain a significant 
difference in the number of detections attained under the RI conditions when 
the same stimuli were used under each paradigm. Thus it might be concluded 
that the use of words as opposed to numbers provided the raters with mor¢ 
cues on which to base their decisions. This conclusion supports the previously 
stated concept that when words are used as stimuli, the subject becomes mort 
active, interested, and responsive, and therefore his critical item is more easily 
ateetegs Words, because of their continual use, may have more affective 
meaning to the subject than numbers, However, this affectivity may also limit 


the usefulness of words in testing situations, since the stimulus words used 
may have idiosyncratic associations, 
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3. Interrater Reliability 


Kendall’s coefficient of concordance (W) was used to determine interrater 
reliability. The coefficient of concordance values determined were .98 for AC 
and DC coupling combined: .97 for DC coupling and 1.00 for AC coupling. 
Since all values were significant beyond the .01, it was concluded that the 
number of correct detections obtained under the detection procedures was not 
idiosyncratic to a particular rater, but rather was a function of the char- 
acteristics common to the recordings. 


4. Rater Confidence 


In order to determine if the confidence rating given to a decision was related 
to the validity of the decision, the mean confidence rating for correct decisions 
was compared to the mean confidence rating for incorrect decisions by means 
of a standard test. The mean confidence rating for the correct decisions was 
significantly higher (p < .01) than the mean confidence rating for the in- 
correct decisions for all four examiners. 

These confidence ratitegs can be taken as an estimate of the ease of evaluating 
the records, since a rater would tend to place higher confidence in a decision he 
found easy to make. A comparison of the AC and DC conditions indicated that 
there was no significant difference in the mean confidence ratings. This finding 
tends to support the conclusion that the raters found no difference in difficulty 
in evaluating the two types of recording. 


D. SuMMARY 


The galvanic skin response (GSR) consists of aslow (DC) component and 
a fast (AC) component. The DC component, for some investigators, may be 
superfluous and undesirable. This investigation, by simultaneously monitoring 
GSR with’ a constant current voltage balancing network (which requires Te- 
balancing) and an AC coupled circuit (which does not require monitoring), 
determined the relative effectiveness of the two recording techniques when 
used under lie detection conditions. Seventy-six subjects were tested under 
three procedures: association, relevant-irrelevant, and peak of tension. All sub- 
jects attempted to conceal the word and the number selected from the 
examiner, Recordings were examined by four independent raters. All but one 
of the 12 procedure-technique combinations yielded a greater than chance 
number of detections. It was concluded that, at least in the area of lie detec- 
tion, the DC coupled voltage balancing network and the AC coupled recording 
techniques are equally efficient; however, the AC coupled recording procedure 


does not require rebalancing. 
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Epwin E, GuisELut AND WituaM B. GuIsELLI 


For some six decades or more psychologists have utilized and examined 
rating methods, and these methods have come to be fully accepted as ways for 
describing stimulus objects. Yet the very process of rating has not really been 
clearly conceptualized, nor has the procedure of rating itself been well defined. 
Guilford (5) takes the position that rating procedures are one form of the 
psychophysical methods and logically belong under the heading of the method 
of successive intervals, A strict psychophysicist, however, might well hesitate to 
agree, especially when he views certain of the relatively newer and more 
elaborate rating methods, such as the forced-choice procedure, and when he 
sees that the stimulus object is a human being and might even be the rater 
himself, Whether or not one believes that ratings should be classified among 
the psychophysical methods, quite obviously the process of rating involves the 
assessment of a stimulus object, human or otherwise, by a person, A person, 
then, is the means by which an assessment of the stimulus object is obtained. 

Throughout the years there has been a good deal of discussion and evalua- 
tion of the various sorts of rating procedures, the characteristics of the rater, 
the social circumstances under which ratings are made, and the psychological 
property or trait being rated. While pertinent empirical evidence is by no 
means conspicuous in amount, the general conclusion about these matters has 


been to stress objectivity. Favored are those rating procedures that spell out in 
ho are impartial and look for tangible 


detail the quality to be rated, raters w . 
evidence in the stimulus individual of what is being rated, and traits that are 
ther than as some 


given in terms of the behavior of the stimulus person ra 
abtruse quality. 

Those who would instruct us that ratings should be as objective as possible 
are, in fact, arguing that it is the qualities of the stimulus individual and not 
of the response of the rater to that individual that are of significance. Indeed, 
they conceive of the rater as an instrument that notes and reports the properties 
of the stimulus individual, but does not add anything of itself to that report. 
They hold that unfortunately raters are “fallible,” and through various means 
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seek to develop him into a “standard” person. For them, in the best possible of 
all circumstances the rater would be a passive thing like a thermometer, an 
instrument that responds in the same consistent fashion whenever it is presented 
with a given circumstance. The only difference is that the rater is taken to be a 
multiple of meters with which it is possible to describe the stimulus person 
along a whole set of different dimensions. 

The concern with the properties of the stimulus object, rather than with 
the properties of the mental response to that object, was termed the stimulus 
error by Titchener (7), and he took it as fundamental to the very foundation 
of psychology. It was called the psychologist’s fallacy by James (4), and with 
more charity the stimulus attitude by Boring (2). The stimulus error brings 
into contrast kundgabe (information, information about the properties of the 
stimulus object) and beschreibung (description, description of the properties 
of the mental response to that stimulus object). In the last part of the nine- 
teenth century and the first part of the twentieth century, the value, and, 
indeed, the validity, of this distinction was a matter of basic importance for 
theoreticians (3). The Gestalt Psychologists declared the distinction to be 
false, and the behaviorists that it was irrelevant. The tremendous impact these 
two movements had upon psychology distracted attention from the differentia- 
tion between kundgabe and beschreibung. Nevertheless, the matter continued 
to be disturbing to theoreticians, so that as late as the 1930’s Tolman (8), 


who embraced both Gestalt psychology and behaviorism, felt he had to deal 
with it and did so at some length. 


Both kundgabe and beschreibung involve 
observer, who senses and perceives the stimulus object. In one case he is 
concerned with reporting the nature of the Pp’ 


e P roperties of the stimulus object, 
and in the other with Teporting the nature of the properties of his own response 


to it. That the two may be quite different is perhaps best illustrated by 
Titchener s classic case of the determination of the two-point limen. When the 
two points of the instrument are quite close together they may “feel” like 


only one point. Nevertheless, because there may be a pattern of sensation 
somewhat different, being rather stretched out, than the sensation that is 
obtained when only one Pick 


is touching the skin, the observer has good 
grounds for guessing that his skin is being touched by two picks. To us? 
Tolman’s neologism, if the observer is kundgabe-ing he will report “two,” 
and if he is beschreibung-ing he will Teport “one.” But, as Titchener pointed 
out, generally we are careless and “unscientific,” and as a consequence out 
reports are neither clear] 


‘ ly kundgaben nor beschreibungen, but rather are 4 
confused mixture of the two, 


a report by an individual, the 
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While it is something of a jump to say that the process of rating can be 
conceived of in terms of kundgabe and beschreibung, perhaps it is not too great 
a one. For certainly on the one hand we could think of the rater as giving 
information about the properties that actually exist in the stimulus person, 
and on the other as describing the nature of his thoughts about him. As pointed 
out earlier, the emphasis in rating has been entirely on the former. Almost 
universally the position is taken that ratings should be objective, giving as 
accurate a picture as possible of the properties of the stimulus person. The 
rater’s views about those properties are said to be entirely irrelevant and 
would, indeed, introduce error. Kundgabe, then, is prescribed, and beschreibung 
is proscribed. When beschreibung rather than kundgabe is reported, it could 
in this framework be termed the rater’s error. 

Yet it seems unlikely that ratings can be considered to be kundgabe, and 
certainly not kundgabe in its pure form. That is, it is not possible to attain 
objectivity in its pure form. Furthermore, as will be seen, it would seem that 
the very procedures that are employed in obtaining ratings force the rater into 
the rater’s error, to beschreibung. In addition, certainly the case is particularly 
strong that qualities for which tangible manifestations are but partial signs 
must be taken to be beschreibungen, for they have no given set of objective 
stimulus qualities. 

The point of view of kundgabe takes the mental response of an individual 
to a stimulus object as being independent of the nature of that individual. 
He is, essentially, merely a registering device. The point of view of beschreibung 
is just the opposite, regarding the mental response as being necessarily de- 
pendent upon the individual. In effect, the point of view of kundgabe conceives 
of the individual as being a simple stimulus-response mechanism, a notion long 
ago given up in psychology. It is difficult to conceive of any circumstance 
wherein the individual does not participate in, and contribute to, the stimulus- 
Tesponse sequence. Clearly, then, it is impossible to have a situation wherein 
there is pure kundgabe, a direct mirroring of the properties of the stimulus 
object through the responses of the individual. Consequently, it is apparent 
that ratings cannot be perfectly objective, objective in the sense that they 
reproduce without distortion the actual properties of the stimulus person. 

However, one seldom, if ever, sees claims to the effect that ratings are 
Perfectly objective. Nevertheless, it is argued that under proper circumstances 
it is possible to have ratings closely approximate objectivity. In effect it is 
admitted that it is impossble for ratings to be true kundgaben because the 
individual always inserts something of himself into the ratings. But it is said 
that with proper circumstances, such as training raters, carefully defining 
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terms, and utilizing behavior-oriented rating methods, it is possible to attain a 
high degree of objectivity in ratings, 

It would be argued that if ratings were beschreibungen and therefore were 
characterized by the rater’s error, then the rating assigned to an individual 
would be a distortion of the true amount of the Property he possesses or 
manifests. Hence, the rating is not objective, but rather it is biased. ‘The 
distortion is ascribed to the fact that the rater is an imperfect “meter,” and 
the ratings he assigns are biased by irrelevant considerations. But what are 
the criteria of bias, or lack of objectivity in ratings? Doubtlessly, it would 
be said that evidence of bias is furnished by low correlations among the 
ratings assigned by different raters to the same group of persons, nonnormal 
frequency distributions or the ratings assigned to a group of individuals on a 
particular property, and a low correlation between ratings on a particular 
property and the true amounts of the Property the individuals actually possess 
or manifest. 

The correlation between the ratings assigned by different raters is not a 
measure of objectivity but of reliability of measurement. Suppose, for example, 
that both a projective test and a test comprised of multiple choice items were 
available for measuring a particular trait. If the reliability coefficient of the 
projective test were found to be higher than that of the multiple choice test, 
it would not be concluded that the Projective test is the more objective. 
Rather, it would be concluded that in spite of its lack of objectivity, it measures 
with less error. 

To say that nonnormal distributions. of ratings constitute evidence that the 
ratings are biased presumes that it has been established that the correct and 
true distribution of the property being rated is indeed normal. Now while 
normal frequency of traits is both a pleasant and useful assumption, it is 
nothing more or less than an assumption. Furthermore, if it does have any 
validity, it simply means that it holds for the Population, but not for each 
and every sample drawn from it. In fact, ratings are almost certainly not 
‘ i Tepresentative sample of the popula- 
tion. The ratings will be of groups, such as those children who are accepted 


» those workers who succeeded in being hired 


ted into a particular educational specialty: 
( r the shape of the distribution of a set of ratings 
1s nonnormal, it cannot be said that the ratin 


are ni ily biased. As 4 
matter of fact, in view of ee hated 


the foregoing argument, a good case could be made 
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for saying that when ratings are normally distributed, that is positive evidence 
of bias. 

Since to be objective is to have the precise, tangible, or manifest properties 
of a material object, the only proper index of the objectivity of ratings is the 
extent to which they reflect the properties of the stimulus person. Objectivity, 
then, is indicated by the degree to which ratings on some property are cor- 
related with the true amounts of the property the individuals actually possess 
or manifest, ‘This is, of course, the criterion validity of the ratings, and, to be 
exact, the concurrent criterion validity. 

The criterion is always a problem in validity, even in the case of concurrent 
validity. This is quite obviously the case, for if the criterion measure were 
available, the substitute would not be used. The substitute is never as good 
as the original, and the question always is how poor a representation of the 
qualities measured by the criterion is the substitute. Certainly in practice 
criterion validity, even in the case of concurrent validity, is at best limited. A 
validity coefficient of the order of .80 for any kind of device used to measure 
psychological qualities almost certainly would be a maximum, and very likely 
the figure would be quite substantially lower in any particular instance. So 
under the very best of circumstances only a very modest degree of objectivity 
could be claimed for ratings. This means, then, that necessarily there must be 
some beschreibung in ratings, and if there is any beschreibung at all there can- 
not be pure kundgabe. 

There are all manner of psychological properties that can be used to describe 
people, and many. of these properties are not by any means completely inde- 
pendent of the measuring device. Consider, for example, the trait of coopera- 
tiveness. Without giving the matter much thought, one is likely to say that 
the person who. offers help is cooperative, and the one who does not is un- 
cooperative. But the frequency with which a person offers help actually has 
little to do with cooperativeness. All raters may agree that one person Is 
cooperative even though he does not often offer help because they feel confident 
that they can count on him if they do need aid. They may also agree that 
another person is uncooperative because he does not let them alone, but 
Continuously interrupts them proffering assistance. Cooperativeness, then, can- 
Not be measured in terms of a person’s behavior, but only in terms of the 
Perceptions of him on the part of those with whom he comes in contact. There 
is a whole variety of other traits of this same sort. Obviously, many of the 
Social and personality traits, such.as friendliness, fall in this category. But so 
do a. number of abilities of one kind or another, such as musical talent, tie 
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assessment of which is entirely on the basis of the judgment of the member 
of a society. 

So, there are properties that are descriptive of the person but that neverthe 
less can only be measured in terms of the impressions other individuals hay 
about him. The reports of these individuals about their impressions, their 
ratings, clearly are beschreibungen. If the ratings assigned to a group of 
individuals happen to be substantially related to measurements obtained | 
other means on qualities that nominally are the same or similar, this is vt 
interesting, but it hardly can be taken as evidence of the objectivity of the 
ratings, for the descriptions obtained by means of the rating procedure a 
the impressions others have about those stimulus individuals, Such positive 
correlations might be taken to indicate some general quality that underli 
a series of measurements obtained by different means, and perhaps they woul 
suggest something about what has determined the impressions of the rater 
that they report. Coefficients of correlation between ratings of such propertié 
and other measurements obtained by some different means are just indices of 
relationship and are not validity coefficients. 

Descriptions of such properties as these have value only when there is 
agreement among different raters who evaluate those properties in the stimulu 
individuals. If this were not the case, they would be completely idiosyncrati 
to the various raters. As a consequence, a given description would have littl 
association with the stimulus person and so would have no value as pertaii 
to a characteristic of him. So the reliability of ratings, the degree of correlati 
among them as assigned by different raters, is useful not as an index of t 
objectivity of the ratings, but as an index of the generality of the particul 
impression the stimulus individual makes on others. 4 

Ratings are the reports made by the rater about the properties of ¢ 
stimulus persons. Rating procedures provide the means for the rater to 
his reports, and in doing so they prescribe the ways in which the rater ¢ 
make his report. These are great differences among the various rating metho 
in the specific ways the rater is required to make his report. The simpl 
rating. scales merely specify a trait, such as intelligence or friendliness, 
then require the rater to indicate where on a numerical scale, perhaps consis! 
of the numbers 1 to 5, he believes each stimulus person falls. Sometimes 
standards on the rating scale consist of graded adjectives, such as “little” 
“superior,” or, as is commonly recommended, graded samples of the prope 

such as the following: “Is seldom able to cope with even elementary pr 
lems,” or, “Is rather an outgoing person.” Even the more sophisticated T 
procedures, such as the weighted check list, provide samples of the prope 
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which are graded. In a weighted check list the rater is required to indicate 
which of 30 or 40 statements like the following are characteristic of the 
stimulus person: “Usually he is careful and accurate in his work,” “He is 
quick in developing new ideas and ways of attacking problems.” 

As is illustrated in the foregoing examples, the ways in which the rater 
is obliged to make his reports is not in terms of the absolute properties as 
they exist in the stimulus person, as perhaps manifest in his behavior, but 
rather in terms of the value of the rater puts on them. The report may be given 
~ in quite abstract terms, such as the numbers on a numerical rating scale, or 
it may be given in terms of the interpretations the rater may make, such as 
“often,” “little,” and “superior.” But they seldom, if ever, are given directly 
in terms of the frequencies or degrees in which the property is in fact manifest 
by the stimulus person. A person does not “often” make mistakes in numerical 
computation; rather when faced with arithmetic problems, he errs in solving 
a certain proportion of them. A person does not manifest “little” initiative ; 
rather he never takes action on his own but only when prodded by someone 
else. A person is not “superior” in selling insurance ; rather he sells a particular 
dollar volume of insurance. 

Typically, then, the amounts or degrees of the properties being assessed 
are not given by the rater in terms of the frequencies or degrees of the property 
as a nonhuman “meter” would give them. Indeed, the rating procedures 
employed almost always forbid him from doing so. Consequently, if ratings 
* are kundgaben, then many, if not most or even all, of the procedures used in 

obtaining them are quite shoddy devices for obtaining measurements of human 

Properties, for they are not measuring those properties in terms of the fre- 

quencies or degrees along the dimensions on which they vary. The surveyor 

who says that a piece of land is small, rather than saying it is a 25-foot by 
40-foot rectangle, is not measuring the size of the land but is reporting his 
impression of it. A banker who says that an account is fairly substantial, 
tather than saying it contains $50,000, is not measuring the size of the account 
but is reporting his impression of it, Neither the surveyor nor the banker is 
giving kundgabe, information, but quite the contrary they are giving beschrei- 

bung, a description of their reactions to the stimulus object. So similarly a 

Tater who checks a stimulus person as being 4 on a 5-point scale of decisiveness, 

or as being superior but not excellent in the trait, or on @ scale of graded 


samples of the property or in a weighted check list selects the statement, “Does 


Not take long in making a decision,” rather than saying that on the average 
js not measuring the decisiveness 


he takes 15 minutes in arriving at decisions, ; i 
of the stimulus person but is giving his impression of him. The fact that the 
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rating procedure severely limits the kinds of reports the rater can make dors 
not make it less beschreibung and more kundgabe; it merely controls the ways 
that he can use to describe his impressions. ta 

By the very fact that ratings involve human beings as instruments, it wo 
appear doubtful that ratings can be considered to be kundgaben, or at leat 
very good kundgaben. Moreover, many of the properties that are rated have 
existence only as impressions the rater has of the stimulus person, and ther 
fore reports of them clearly cannot be sundgaben. Furthermore, it would 
appear that the very nature of the procedures used to obtain the reports of 
raters defeats the possibility of ratings being Aundgaben. It would therefore 
follow that ratings should be taken as beschreibungen, descriptions of the 
rater’s reactions to the stimulus person. That is, the report the rater makes 
about a stimulus person is not a faithful reflection of the qualities that person 
possesses or manifests, but rather is a report of his impression of that person, 
a description of his mental reaction to him. This reaction, of course, is condi- 
tioned by his social and cultural background. 

As was admitted earlier, the validity of applying the notion of a distinction 
between kundgabe and beschreibung to the process of rating may well be 
doubtful. Very likely it is not good to take a.concept that has a special meaning, 
a concept hallowed in history of thought by the sweat of those who sought to 
make sense of it and with the blood of those who contested hotly about it 
Nevertheless, by casting ratings in terms of kundgabe or beschreibung 
stretching the notion from its original connotation with respect to the repott 
of the content of consciousness is of considerable aid in understanding tatings 
It leads to the position that ratings must be considered to be reports by the 
tater of his impressions or perceptions of the stimulus person, his opinions of 
judgments about him, and not reports of the objective, tangible, properties of 
the stimulus person. In making his rating the rater receives certain inform 
tion about the stimulus person and to this adds other data, such as the ciroum 
stance under which the person operates, social and personal values, standards 
of performance, and the like, They are all melded together and result ina” 
assessment of the individual which then is reported in the form of a rating. 

_ The point just made is akin to the point Bingham (1) sought to make ia 
his classic article on halo in ratings. Indeed, the dichotomy drawn here be 
tes! what is loosely termed kundgabe and beschreibung also has a ccertalf! 
similarity to that drawn by Stevens (6) about judgments. Stevens differ 
tiates between the secking of information about the characteristics of ' 


stimulus object, and the seeking of understanding about the way in which the 
responding individual operates, 
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‘To sum up, if the foregoing arguments prove to be persuasive, and ratings 
are accepted as beschreibungen in the generous use made of the term here, 
then the objectivity and validity of ratings become considerably clarified. Quite 
obviously ratings cannot be considered to be objective, and any description of 

| them as such is quite incorrect. This is not to say that the focus of attention in 
ratings cannot be on behavior rather than on some abstract trait. But it is to 
be expected that when ratings of behavior are obtained, they must be taken 
to be descriptions raters give about their impressions, opinions, or judgments 
about the behavior of a stimulus individual, and not quantified descriptions of 
the behavior itself. All kinds of factors enter into and influence ratings, and 
ratings would be considered to be biased when some of these factors are un- 
acceptable social values. Consequently, it is important to understand the sorts of 
factors that influence and are related to the ratings an individual makes, so that 
those that run counter to current mores can be ascertained. The validity of rat- 
ings is not to be judged by the correlation with other independent measures of 
_ the property being assessed. Validity of ratings would have to be defined as the 
extent to which the rater’s reports accurately reflect the impression, opinion, 
or judgment he has formed of the stimulus person. To obtain ratings that have 

a high degree of validity, it would be necessary to insure that those circum- 

stances exist wherein the rater is prepared to report his impressions and is able 

to. Therefore such matters as the training of raters to report their impressions 
properly, and the removal of social pressure on them so that they are willing 
to report their true impressions, become of prime importance. 
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GROUP RISK TAKING AS A FUNCTION OF THREE TYPES 
OF MONETARY INCENTIVE* 


Memphis State University 


Micuag Luprer, Mark Jones, AND Crciiia Quin! 


A. Intropuction 


The. assumption that real and hypothetical monetary incentives differentially 
influence risk-taking behavior continues to be an assumption lacking consistent 
verification. Kogan and Wallach (4) have proposed that real monetary in- 
centives yield greater conservatism than hypothetical monetary incentives, They 
suggest that real money induces heightened concern about potential gains and, 
even more, about potential losses, leading to more conservative decisions. Ex- 
perimental evidence supporting the Kogan-Wallach hypothesis, however, has 
been inconsistent. Among studies demonstrating the expected difference (1, 2, 
7, 9, 10), either one or both of two conditions prevailed: (a) the two types of 
incentive were presented as a within-Ss variable—i.e.,, each § received both 
types of incentive; and/or () the task was a probability learning problem 
where Ss were asked to predict on each trial which of two uncertain outcomes 
was most likely to occur. Among the studies failing to demonstrate a difference 
(3, 8), either one or both of two conditions prevailed: (c) the two types of 
Monetary incentive were presented as a between-Ss variable—ie., each § re- 
ceived only one type of incentive; and/or (d) Ss were presented the Myers- 
Sadler task (6) where they chose between a certain and an uncertain outcome. 

In none of these studies was an attempt made to assess whether real and 
hypothetical monetary incentives differentially influenced Ss’ concern for Bea 
tial gains and losses, Locke (5) has presented evidence supporting his thesis 
that monetary incentives influence task performance only to the extent that in- 
Sentives modify the performer’s motivational intentions; if incentives produce 


No motivational change, task performance is unaffected. Studies reporting no 
Sc ee 
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difference in risk taking as a function of real and hypothetical monetary incen- 
tives may have failed to induce a change in Ss’ concern for gains and losses. 

There is a further reason for assessing such mediating factors as motivational 
states when investigating the effects of incentives on risk taking. A person's 
desire to maximize gains, his desire to minimize losses, his skill in performing 
the task at hand—all presumed to influence risk taking—may be affected by 
monetary conditions, but they are not likely to be solely determined by monetary 
conditions. To the extent that these mediating factors correlate with risk- 
taking differences in a manner not totally determined by monetary influences, 
they contribute to error variance if inadequately controlled. Failure to remove 
their influence from error variance may have obscured the differential effects 
of real and hypothetical incentives when presented as a between-Ss variable, a 
condition that did not prevail when the two types of incentive were presented 
as a within-Ss variable. 

Studies using the Myers-Sadler task may have failed to obtain expected 
results because too few trials were run. Whereas the payoff structure of the 
probability learning task contains only two outcomes, either -+-X¢ or —xX¢ 
which are quickly learned, the payoff structure of the Myers-Sadler task is much 

more complex. Typically the uncertain outcome contains at least 10 different 
possible payoffs. Thus players require more time to discover the parameters of 
the payoff structure, which delays their learning to discriminate risky and con- 
servative choices. Among studies using the Myers-Sadler task, the maximum 
number of trials was 100, whereas several hundred trials were ordinarily run in 
studies using the probability learning task. Therefore the failure to find a differ- 
ence between real and hypothetical monetary incentives among studies employ- 
pi ‘ei Myers-Sadler task may have been due to the insufficient number of 

Tials. 

: Notwithstanding the inconsistent findings, there has been considerable 
similarity among all of these studies. All focused on individual rather than 
group risk taking. All relied on gambling situations requiring little skill on 
the part of participants. In all cases, participants gambled with money provided 
by £ rather than their own money. 

The Present study was conceived to test the generality of the Kogan-Wallach 
hypothesis. These questions were raised: Do real monetary incentives lead f° 
more conservative choices than hypothetical incentives among groups, as well as 
individuals? Does the hypothesis apply when payoffs are influenced by the skill 
of group members, as well as their propensity for risk and the vagaries 
chance? Is the hypothesis valid when members’ own money is at stake, eve? 
more than when E’s money is at stake? Is the hypothesis valid only in those 
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instances in which incentive differences effect a change in the players’ motives, 
as suggested by Locke's work? Affirmative answers to these questions were 
expected. 

The effects of three types of monetary incentive on group risk taking were 
examined: (a) imaginary or hypothetical money (H$), (4). real money pro- 
vided by E with the provision that profits be kept and losses not be reimbursed 
(O$), and (c) real money provided by the persons themselves with the possi- 
bility of realizing profits or losses (P$). The most risky bets were expected to 
occur when H$ was in effect, the most conservative when: P$ was in effect, 


B. Meruop 
1. Item Selection for the Jeopardy Game 


The Jeopardy quiz game, a modified version of which provided the vehicle 
for group betting, is played by providing participants with a series of categories 
(eg., British Empire, television, United States geography), each category 
containing several answers graded in difficulty. A player’s task is to give the 
correct question’ for each answer. For example, the moderator might say, “The 
category is ‘Famous Pairs.’ The answer is ‘The last Tsar and Tsarina of 
Russia.’ What is the question?” If the player says, “Who were Nicholas and 
Alexandra?,” he wins the amount of money attached to that item; if his 
response is incorrect, he loses that amount of money. 

For the study, 19 categories were used, and a pool of 325 items of approxi- 
mately 17 items per category was assembled. In order to assess the difficulty 
level of each item, the pool of items was administered to 78 undergraduate 
Psychology students. ‘The proportions of correct (p) and incorrect (q) re- 
Sponses to each item were’determined. ‘Any item yielding significantly different 
P values for males and females was deleted. Since three-member groups were to 
be used: in the study; the difficulty level of each item was calculated’ according 
to the formula’! — q?,/a formula giving the probability that at least one 
member of any:randomly selected three-member group would respond to the 
item correctly. A total of 279 items was retained for use in the study. 


2. Selection of Subjects and Formation of Groups 


A. 30-item Jeopardy test was constructed, consisting of items of varying 
levels of difficulty and representing all 19 categories. The test was administered 


_ to two groups of: volunteers drawn: from introductory psychology classes at 


Memphis State University. One group of volunteers was to play under the O$ 
and H§$ conditions, and the other group was to play under the P$ and H$ 
Conditions, (For subsequent reference, the H$ condition that occurred in con- 
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junction with O$ is referred to as Ho$. The H$ condition that occu 
conjunction with P$ is referred to as Hp$.) Each volunteer in the P$- 
group was informed in advance that he would be required to stake $2 of his ow 
money in order to participate. The 95 volunteers in the O$-Ho$ grow 
averaged 11.3 items correct with an SD of 5.3; the 96 volunteers in the : 
Hp$ group averaged 10.5 items correct with an SD of 4.8. Only at thee 
of the study were Ss informed of their scores on the test. 

From each group of volunteers, 24 three-member groups were fo 
according to two criteria: All members of any group (a) were of the same 
and (b) obtained approximately equal scores on the pretest. In no group d 
members’ scores on the pretest differ by more than 4 points. As shown in Tal 
1, 48 groups participated in the study (12:male and 12 female groups 
O$-Hof, 12 male and 12 female under P$-Hp$). The group averages on | 
Jeopardy pretest ranged from 2.2 to 23.0 with a mean of 11.2. 


3. Procedure 


For groups playing under O$-Ho$, the following procedure was follo 
Each group met for approximately one hour to play the Jeopardy game. 
learning the game rules and engaging in two practice plays, a group wa 
formed that the game would be played in two portions, each portion 
taining 15 plays. During the O$ portion, the group was given a $6.00 stake 
actual cash and was permitted to keep any profits accumulated over the 1 
plays. If the group’s accumulated total after 15 plays was equal to or less 
$6.00, the group won no money. During the Ho$ portion, the group was s| 
to $6.00 of hypothetical money and merely instructed to perform as 
possible. The order of presenting these two incentive conditions was co 
balanced within each sex. i 

For groups playing under P$-Hpf, one alteration was made. To creat 
P$ condition, each group member had to submit $2.00 of his own money, 
the knowledge that a profit might be realized but also that some or all of 
$2.00 might be lost, 4 

Immediately before playing under each incentive condition, Ss were ash 
state the strength of their motives to “win as much money as possible” an 
lose as little money as possible.” The Ss recorded their responses on five-P 
scales ranging from “indifferent to winning (losing) money” to “strong dee 
to win as much (lose as little) money as possible.” By averaging the mem 
responses, composite measures of each group’s motives to maximize gains 
minimize losses were obtained. In addition, each § responded to the qué 
“During this part of the game, which do you expect will influence your 
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TABLE 1 
DESIGN OF THE STUDY 


Incentive conditions II: 
Real (O$ or P$) ws. 


hypothetical 
Tncentive conditions I: Period 1 Period 2 
Order and sex® (15 plays) (15 plays) 
O$-Ho$ 
Real-hypothetical 
F o§$ Hof 
M Oo$ Hot 
Hypothetical-real 
F Ho$ Of 
M Hof og 
P§-Hp$ 
Real-hypothetical 
F P$ Hp$ 
M P$ Hyp$ 
Hypothetical-real 
F H,$ P$ 
M Hy$ P$ 


* N = 6 for each of Ss under each of the incentive conditions I. 


sions more—the desire to win as much money as possible or the desire to lose 
as little money as possible?” Responses provided an ancillary measure of Ss’ 
motives, 

To begin a play, each § wrote privately the amount he felt the team should 
bet on the play. Bets were permitted to range from 20¢ to $1.00, in multiples 
of 5¢. Though not informed in advance of the subject matter of the item, Ss 
were told that low bets would yield easy items, high bets difficult items.” 

After all group members had indicated their personal betting preferences, 
they conferred with each other and reached consensus on a group bet. The 
value of the group bet determined the bet category from which E randomly 
selected an item and read aloud the category and answer. Each § privately 


wrote what he believed the correct question to be, following which the group 
va oan di h 

2 Items of difficulty level 1 — q3 were assigned to bet category X according to the 
equation X (1 — ‘© = 15¢. ae Tee items with a 1 — a value of .30 were 
assigned to the 50¢ bet category, regardless of subject matter; items with a 1 ae 
value of .15 were assigned to the $1 bet category, etc. Thus the expected gain attacl H 
to any bet always equalled 15¢, But expected losses were not similarly equated, tl 4 
easy items having an expected loss of less than 15¢ and the difficult items an etl 
loss exceeding 15¢. Since the total expected value was positive for easy items an 
peative for difficult items, the reward structure of the game favored conservative 
etting. 


again reached consensus and announced its decision. The group was informed 
whether or not it had provided the correct question, and the bet was accordingly 
added to or subtracted from the cumulative total. 
Upon completing 30 plays, Ss filled out a brief questionnaire which solicited 
their reactions to the study and to each other. The purpose of the study: was | 
{ 
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then explained, and Ss were dismissed. 


C. REsuLtTs 
1.) Selection of Covariates 


Analysis of covariance was selected as an appropriate method for evaluating — 
the betting data. This design permitted the removal from the error term of 
any significant covariance between group betting and (a) group motive to 
maximize gains, (b) motive to minimize losses, and (c) group ability.? Before 
analysis of covariance was performed, each of these three intended covariates 
was correlated with group betting across 48 groups with use of the method of 
partial correlation. The results revealed significant correlations of —A45'(p < } 
.01) between group betting and group motive to minimize losses, and +-.60 
(p< .01) between group betting and group ability. The intended covariate, 
motive to maximize gains, failed to correlate significantly with betting (r= 
—.03, > .20) and was dropped. 


2. Relation of Covariates to Treatment Conditions 


To determine whether any covariate was related to treatment conditions, 
each coyariate was submitted to a Plan 9, Latin-square design for repeated 

measures (11, pp. 554-563). The. design paralleled that shown in ‘Table 1. 
Included were three between-Ss variables: incentive, conditions I (either play: 
ing with other's money or with personal money relative to hypothetical } 
money); sex; and order of incentive conditions (either real-hypothetical of : 
hypothetical-real), Also included were two within-Ss variables: incentive com 


ditions II (either real or hypothetical money) and period of play (either first } 
or second 15 plays), : 


3 The use of betting preferences as a measure of risk taking presumes that the 
probabilities of gain and loss attached to each betting option are the same for ¢@ 
group and that payoffs are governed by chance, But when the betting occurs in 4 CO” 
text involving a skill that participants possess in differing degrees, as in this study, 
such a presumption is not tenable, The probabilities of gain and loss attached to €4 
bet were not the same for groups of differing ability. Indeed, the correlation of +6 
between group ability and group bets indicated that group betting per se could not it 
used as a measure of risk taking, since any group’s bets would reflect its ability a8 itd 
as its inclination toward risk. So that a group’s bets would provide an index of it 
tendency toward risk independent of its ability, the covariance between group abill 
and group betting was extracted from the analysis of bets. ji 


Stee 
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y TABLE 2 
PROPORTIONAL STRENGTH OF Moriyes UNDER EAcH INCENTIVE CONDITION 


Incentive condition 


Motive Hos O$ Hyp$ PS 
Maximizing gains 694_ 681, 583 yy 486, 
Minimizing losses -306, 319, ALT p- S14y 


Note: All means exeept.those sharing same subscript are significantly different from 
one another at the .01 level. 


‘One significant relationship emerged. Groups playing with their own money 
(P$) expressed a greater desire to minimize losses than groups playing under 
the other’s hypothetical. money. (Ho$) condition (F =.7.42, p< .01). These 
data provided limited support for the hypothesis that different monetary con- 
ditions would alter the players’ motivational state, in this case their desire to 
Minimize losses. 


- 3: Effects of Incentive Conditions on Motivation 


<y To examine the effect-of incentives on motives more fully, the ancillary mea- 
sures of Ss’ motives were examined. Prior to playing under each incentive con- 
dition, each S answered the question, “Which do you expect will influence your 
decisions more—the desire to win as much as possible or the desire to lose as 
little as possible?” Responses were assumed to reflect the relative strength of 
Ss’ ‘motives to maximize gains or to minimize losses. Frequencies of choices 
under each incentive condition (Ho$, O$, He$, P$) were tallied and con- 
verted to proportions. As’shown in Table 2, Ss playing under P$ were signifi- 
catitly more influenced by a desire to minimize losses than were Ss playing under 
Ho$ or O$. Indeed; Ss playing under Ho$ and O$ reflected a significant pref- 
etence for maximizing gains. Altogether these data suggested that the Hof, 
O$, and Hp$ did not differ in their effect on players’ motives, but that the P$ 
condition induced 4 heightened concern for minimizing losses. , 

This ancillary motivational measure provided a clearer picture of incentive 
influences on motives than the interval ratings of motive strength, recommend. 
ing the former over the latter measure in'the analysis of covariance of betting 
data which followed. But the nominal nature of the ancillary measure pre- 
cluded its use in the analysis of covariance, whereas the interval ratings, even 
if less revealing, satisfied the requirements for inclusion. 


4, Analysis of Group Betting 


Each set of 15-bets made by-a team playing under a specified ‘incentive con- 
dition was summed, providing data for 2 Latin-square analysis of variance 
4 
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analogous to that performed on each covariate. The results revealed th 
significant sources of variance in the betting data: incentive conditions I (F 
9.41, p < .001) ; incentive conditions II (F = 10.56,  < .001) ; and peri 
of play (F = 31.99, p < .001). 

Analysis of covariance, using the two covariates found to correlate wi 
betting, was then performed upon the same data and, once again, the sa 
three sources of variance emerged as significant at the .001 level. As shown 
Table 3, the results indicated that more conservative betting occurred 
when groups played under P$-Hp$ conditions than when they played 


TABLE 3 
Apjustep MEAN Betrinc Sums Unper Each INCENTIVE CONDITION 


Incentive conditions II : 


Real Hypothetical 
Incentive money money ie 
conditions I (N = 48) (N = 48) 
O$-Ho$ (N = 24) O$ = $8.43, Ho$ = $8.73, 
P$-Hp$ (N = 24) P$ = 6.23 Hp$ = 7.54 


x 7.34 8.12 


_Note: Within each category, all means except those sharing the same subscri 
significantly different from one another at the .01 level. 


O$-Hof$ conditions, and (4) when groups played under real monetary 
tives than when they played under hypothetical monetary incentives. Hor 
the interaction between incentive conditions approached significance (. 
3.75,  < .10), suggesting certain qualifications to these findings. As 
thesized, significantly more conservative betting occurred when a gt 
personal money (P$) was gambled. But inconsistent with expectation, ga! 
bling with other’s real money (O$) did not yield greater conservatism th 
gambling with hypothetical money (H$). In fact, greater risk was obse 
der O$ than under Hp$, whereas no reliable difference was observed bel 
O$ and Ho$ conditions. 

Analysis of the significant periods effect indicated that greater risk t 
occurred during the second half of the game than during the first half, 
less of the incentive condition operating at the time. 


D. Discussion 


As predicted, more conservative betting was evidenced by groups gam 
with their own money than with other’s money or with hypothetical n 
Contrary to prediction, groups gambling with other's money bet no less ¢ 
servatively than groups gambling with hypothetical money. Confirmati 
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the former prediction and disconfirmation of the latter prediction is under- 
standable in view of the influence of the incentives on the motive to minimize 
losses, a motive tending to induce conservatism. Whereas the H$ and O$ 
conditions produced no differential effect on the motive, the P$ condition in- 
creased its strength, Thus the overall findings supported Locke’s proposition 
that monetary incentives affect task performance, in this case risk taking, only 
to the extent that incentives modify the performers’ motivational state. 

Notwithstanding the general coherence of the findings, reservations must be 
registered. The data, while indicating that the motive to minimize losses is a 
significant factor mediating the influence of monetary incentives on group risk 
taking, point to the presence of other mediating factors, After all, the signifi- 
cant relationship between incentive conditions and risk taking was not basically 
altered by extracting the covariance attributable to the motive along with the 
covariance due to feedback and ability. What might these other mediating 
factors be? The motive to maximize gains appears not to be an important vari- 
able on the basis of the present evidence, though questions about the crudity of 
its measurement might be raised. On the other hand, the mediating effect of a 
response set was suggested. The most anomalous finding in the configuration 
of results—the relative conservatism displayed by groups under Hp$—probably 
resulted from a generalized conservatism induced by the condition with which 
it was paired, the P$ condition. If this interpretation is correct, future inyest- 
gations in which incentives are manipulated within-Ss must attempt to control 
for this factor. 

The increased tendency toward risk during the second portion of the game 
recommends the temporal factor as a potentially important mediator. In fact, 
earlier studies revealed a differential effect of H$ and O$ monetary conditions 
only after numerous trials, a finding that might have been replicated in this 
study if groups had played longer. 

Finally, group structural and membership variables, totally ignored in this 
investigation, should be explored. Had the groups interacted sufficiently for a 
stable influence hierarchy to emerge, weighted measures of the motivational 
variables could have been derived on the basis of the degree of influence exer- 
cised by each person on the group’s bets. As always, the use of American college 
students as subjects raises questions about the generality of results. 1 

Beyond the specific findings, this study suggested that a broader perspective 
be adopted than heretofore when examining the influence of monetary incen- 
tives on risk taking. It is not enough to ask merely: Do real and hypothetical 
incentives make a difference? One must consider the sources of real money, 
the extent to which players are held accountable for losses, and the impact of 
incentives on the players’ motivational intentions. 
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E. Summary 


The effect upon group risk taking of three types of monetary incentive were 
examined: gambling with hypothetical money (H$), gambling with thers 
real money with the provision that profits be kept but losses not be reimbursed 
(O$), and gambling with one’s personal money with the possibility of realizing 
profits or losses (P$). The most conservative betting was expected upon the 
P¢ condition, the most risky under the H$ condition. 


A total of 24 three-member groups played the “Jeopardy” game under the 
O¢ and H¢$ conditions, another 24 teams under the P$ and H$ conditions. As 
predicted, more conservative betting occurred under P$ than under O$ or Hf. 
Contrary to prediction, the O$ condition did not produce more conservative 
betting than the H$ condition. The results were attributed to the mediating 
influence of the motive to minimize losses. Whereas H$ and O$ had no differ- 
ential effect on the motive, P$ increased the motive’s strength, eventuating in 
more conservative betting. j 


REFERENCES 


1. EnrARDAL Mi The reliability of probability preferences. Amer, J. Psychol 1954, 

, 68-95, 

2. Goopnow, J. J. Determinants of choice distribution in two-choice situations. Amé 

J. Psychol., 1955, 68, 106-116. 

3. Karz, L, Monetary incentive and range of payoff: determi! of risk taking 

J. Exper. Psychol., 1962, 64, wae ae 

4, Kocan, N., & WALtAcH, M. A. Risk taking as a function of the situation, the 

person, and the group. In T, M. Newcomb (Ed.), New Directions in Psychology 

; Pie New York: Holt, Rinehart & Winston, 1967. Pp. 111-278. ry 

p KE, E. A. Toward a theory of task motivati i ives. O. iz, Behae. 
¥ Wi: pie etasees Borah motivation and incentives. Organt 

6. Myers, J. L. & Sapter, E. Effects of iable in risk taking 
J. Exper. Psychol, 1960, 60, 306-309. ee 

7. i Sine Ln forh J. Gs urea & Suypam, M. M. Differential monetary sit, 

‘. SB robability i pire tert . Psychol 

. ricer wean probability in a two-choice situation. J. Exper. Psy rf 

8, Perersen, R. C., & Kanarick, A. F. Effects of incentive as within- and between 

subjects variables in risk taking. Psychon. Sci., 1968, 12(6), 279-280. 

9. Stscet, S., & ANDREWs, J. M. Magnitude of reinforcement and choice behavior it” 

children. J. Exper. Psychol,, 1962, 63, 337-341. : 

10. Sas 2 ACRE shifts in a three-choice decision situation, Psychom ih 


11. Wrver, B. J. Statistical Princi: i H . + McGraw | 
Hill, 1962, Pp. $54-863. inciples in Experimental Design. New York: McGra¥ 


Department of Psychology 
Memphis State University , . 
Memphis, Tennessee 38111 , 


Published as a separate and in The Journal of Psychology, 1972, 80, 283-288. 


SPATIO-TEMPORAL SAMPLING OF COMPLEX VISUAL 
CUES AND BEHAVIORAL TYPOLOGY** 


University of Pennsylvania and University of Tennessee 


Tuomas E. Wess AND James M, ANKER 


A. INTRODUCTION 


Kagan (6) reviewed research suggesting that selectivity and duration of 
attention may be related to cognitive development and differentiation in young 
children. In the present report, we describe a simple attentional-behavioral model. 
based on the notion that differential sampling of environmental cues may also 
relate to stable behavioral patterns in adulthood. We will show first that our 
measures of selectivity and duration represent two uncorrelated attentional 
dimensions; second, that behaviorally defined groups sample information 
differently along these dimensions; and, third, that these group differences are 
accentuated under conditions of task uncertainty. 

Our model is two dimensional. A spatial dimension delineates the extensity 
or breadth of cues sampled within a given unit of time. The second dimension 
is temporal, denoting the extensity of cues sampled over units of time. The 
definition of temporal units was determined by serial modifications in the infor- 
mative patterns themselves suggesting the passage of time. Attentional strategies 
therefore could be expressed as vectorial combinations of spatial-temporal 

_ polarities: high spatial-low temporal sampling (HS-LT), low spatial-high 
temporal sampling (LS-HT), high spatial-high temporal sampling (HS-HT), 
and finally low spatial-low temporal sampling (LS-LT). 

Behavioral correlates for the four coding strategies were formulated. Exten- 
sive scanning of immediate cues but limited sequential search for additional 
information, the HS-LT strategy, suggested impressionistic cognition and re- 
sponse impulsivity, clinically described as hysteroid in character (Group I). 
Conversely, restricted and focal sampling at any given moment but extended 
sequential searching for information, the LS-HT strategy, was thought to 
correlate with exacting and overly restrained behavior coupled with ruminative 
indecision and obsessional tendencies (Group II). Under conditions of maxi- 

* mal cue sampling in the HS-HT strategy, we expected high arousal and a 
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blending of impressionistic inference with vigilant cautiousness clinically te. 
lated*to paranoid features (Group III). On the other hand, the minimal in- 
put strategy, LS-LT was expected to correlate with inappropriate and bizarre 
schizophrenic behavior dominated by either idiosyncratic sampling of internal 
cues or inadequate sampling of external, fragmentary ones (Group IV). To 
test these hypothesized relations, an experimet was designed to classify indi- 
viduals into the four behavioral types clinically related to hysteroid, obsessive, 
paranoid, and schizophrenic (nonparanoid) patterns and to evaluate their 
spatio-temporal sampling preferences in a decision task. ‘“‘Normal” subjects, 
expected to be intermediate on both dimensions, were not studied. 


B. Mernop 
1. Subjects 


Forty-nine males whose behaviors were sufficiently deviant to warrant 
psychiatric intervention served as Ss. The men were from seven inpatient and 
outpatient facilities in western Illinois. The only preselection criteria were 
age (18-40 years), absence of organicity or visual anomalies, nonchronic 
institutionalization (56 weeks maximum hospitalization history), and at least 
average verbal intelligence (WAIS vocabulary scale = 9). The sample was 
characterized by young adult males representing a diversity of functional 
disturbances, including 18 diagnosed as schizophrenic, 23 with various neurotic” 
and personality disorders, and eight without formal psychiatric diagnoses. 

Behavioral and self-report data obtained during a structured interview, 
Psychiatric Status Schedule (7), and from psychometric testing with the 
Minnesota Multiphasic Personality Inventory and the 16 Personality Factor 
Questionnaire were combined to permit an estimate to be made of the prob- 
ability with which each §’s assessed characteristics matched each theoretical 
group (2). Classification was determined by the individual’s pattern of mental | 
and emotional status as indicated on 54 criterion variables. ‘The relative | 
frequency with which each criterion was expected to appear within a given | 
classification was estimated from available normative data (3, 5, 7) on four 
psychiatric subgroups that seemed stereotypical of our behavioral typology: 
conversion hysterics, obsessive-compulsives, paranoids, and nonparanoid schia0- 
prenics for Groups I through IV, respectively. ‘These joint probabilities wet 
determined for each criterion variable as the proportion of cases in each norm# | 
tive group expected to equal or exceed a cutoff score set at == 1.50 within @ 
general psychiatric population, direction being determined by the nature of the 
continuous variable. 


Each S was assigned to the classification that provided the most probable 
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match with his unique pattern of attributes—e.g., anxiety, guilt-proneness, 
social isolation, depression, etc. The resultant groups were quite similar with 
respect to age (F < 1) and estimated intelligence (F < 1). Groups I and II, 
however, had averaged six weeks total hospitalization, while groups III and 
IV averaged 21 and 23 weeks. The classification procedure also produced 
unequal Ns: 18, 9, 15, and 7 in Groups I through IV, respectively. 


2. Procedure 


To test our model, the decisional task required an informational source 
whose spatial and temporal characteristics could be modulated independently. 
After several pilot studies, we developed a procedure utilizing kinesic informa- 
tion (body movement), a source considered by some investigators to be 
important in the judgment of human affect (1, 4). With this type of stimulus 
material, cue distribution could vary spatially from narrow head areas to the 
broad inclusion of contextual features. Furthermore, serial presentations could 
be used to connote temporal factors associated with the emergence and waning 
of affective states. Accordingly, 20 stimulus pictures, showing two male life- 
size manikins (stimulus and reference figures) in different postures and 
orientations to each other, were selected empirically to simulate either pleasant- 
low arousal (relaxation), unpleasant-low arousal (hopelessness), pleasant-high 
arousal (eagerness), or unpleasant-high arousal (irritation). Five pictures 
were serialized to simulate each of these affects. 

S sat three feet from a 16 X 18 inch opaque screen onto which the stimuli 
were rear projected by a stereo slide projector. Below and in front of the 
‘screen were six illuminated response keys. Four were used to express S’s 
judgment (Relaxed, Eager, Hopeless, or Irritated). One key signalled £ to 
present the next picture within a series, and the last indicated a desired change 
in area of exposure (accomplished by changing paired apertures in the pro- 
jector). Every picture was presented initially with full contextual cues, in- 
cluding both figures, while subsequent reductions showed a full view of the 
stimulus figure, or the torso and head area, or finally the head area only. § 
was instructed that he would be shown pictures on the screen of a man facing 
him portraying different feelings such as relaxed, eager, irritated, or hopeless, 
and that he should decide which of the four emotions was being portrayed. 
After being shown preliminary practice trials illustrating the four stages of 
Spatial reduction and the five successive visual patterns, S was told that for 
each series there would be five picture clues and that he could express his 
decision at any point within the series. The four stimulus series were randomly 
Presented within three replicating blocks. Dependent measures were (a) for 
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spatial sampling, the average number of reductions per series, and (b) for 
temporal sampling, the average number of successive patterns viewed prior to 
expressing a decision. 


C. Regsutts anp Discussion 


Results indicated that across replications, spatial and temporal measures 
were largely independent as required by the model (Pearson rs = —.13, 
—.14, and —.08). Despite unequal Ns among the groups, homogeneity of 
variance was indicated for both measures. Statistical tests revealed that al- 
though no significant group difference emerged, there was, however, a signifi- 
cant group > replication interaction for both spatial and temporal measures 
(F = 4.48, df 6, p < 01); F = 3.16, df 6, 18, p < .05, respectively). The 
effect largely occurred during initial presentations (see Figure 1). Individual 
means were compared for this initial level (9), and results indicated that 
Group IV differed substantially from Groups II and III on the spatial sampling 
dimension (p < .05), the latter two being virtually equivalent. On the 
temporal dimension, Group II deviated significantly from all other groups 
(p < .05). Differences among the groups were largely diminished on both 
measures by the third replication. 

The results support the general hypothesis of a correspondence between 
individual differences in the manifestation of psychopathology and strategic 
differences in coding visual information. But our intuition concerning the 
nature of specific relationships proved to be rather weak. On the spatial 
measure (Figure 1, right side) Group IV showed a preference for reduced 
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SPATIAL AND TEMPORAL SAMPLING PERFORMANCES ARE SHOWN AS A FUNCTION OF 
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exposures particularly in initial stimulus presentations, while Groups I and 
III preferred more extensive cues. On the temporal measure (Figure 1, left 
side), Group II displayed a preference for a prolonged number of successive 
stimulus patterns during initial trials, whereas other groups utilized substan- 
tially fewer patterns prior to expressing a decision. From the standpoint of 
their dimensional combinations, Group IV displayed an LS-LT strategy as 
expected ; but contrary to prediction, Group II manifested HS-HT coding, 
and Group III demonstrated LS-HT coding. Furthermore, the predicted 
HS-LT strategy for Group I was contraindicated by their stable intermediate 
performance. 

These data thus suggest that during inferential tasks individuals may show 
considerable variation in the selectivity and duration of information sampled, 
and that such attentional strategies may relate to stable behavioral and per- 
sonality attributes. More specifically, individuals with obsessional or compul- 
sive tendencies (Group II) seemed to prefer extensive information along both 
spatial and temporal parameters. Conversely, persons displaying disorganized 
schizophrenic-like ideation and bizarre behavior (Group IV) demonstrated 
a preference for highly restricted spatial and temporal information, even when 
as in the spatial condition more task effort was required. Those individuals 
with confused, paranoid ideation but without bizarre mannerisms (Group 
III) ‘seemed to prefer cue distributions that were spatially extensive but 
temporally brief. For the combination inyolving restricted spatial selection 
and prolonged temporal search, no behavioral correlates were identified. 

The fact that strategic differences occurred largely as a primacy effect led 
us to suspect that extreme sampling along either spatial or temporal dimensions 
might involve an anticipatory reaction to situational or task uncertainty. Some 
theorists have considered proclivities toward anxious arousal to be an important 
determinant in modifying attentional characteristics (8). Upon. reexamination 
of interview data, we found that Groups II and IV had.indeed ‘obtained 
significantly higher anxiety, ratings than the other groups (p < 01). Such 
arousal or apprehension may have been a motivational factor in the highly 
reduced spatial and temporal sampling of Group IV, while similar arousal in 
Group II produced the opposite effect, namely extensive spatial and temporal 
Sampling. Thus while behavioral and personality factors may correlate with 
directional differences in attentional strategies, motivational arousal associated 
with external uncertainties may determine the magnitude of their deployment. 

Tn conclusion, attentional differences in selectivity and duration may develop 
in early life, playing a critical role in increasing vulnerability to psychopa- 
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thology during adulthood. Clearly more understanding of attentional processes 
in children and normal adults is needed. 


D. SumMMarRY 


An attentional-behavioral model was proposed and tested. Forty-nine male 
psychiatric patients classified into four behavioral groups and presented an 
affect discrimination task, were predicted to differ in sampling information 
along dimensions of spatial selectivity and temporal duration. Results indicated 
that across subjects, measures of the spatial and temporal dimensions were un- 
correlated, and that along these dimensions, groups of subjects differed signifi- 
cantly. However, attentional strategies were not generally in the direction 
predicted, and major differences occurred only on initial trials. It was con 
cluded that preferential spatio-temporal sampling is correlated with psycho- 
pathological behavioral patterns, and these preferences are accentuated by 
anticipatory uncertainty. 
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CRISIS WAVES: THE GROWTH AND DECLINE OF 
WAR-RELATED BEHAVIORAL EVENTS* 


Joun Vorvopsxy! 


A. Intropuction 


Previous analyses by the author and others (1, 6, 7, 12, 15) have shown 
orderly and patterned behavior as wars are entered upon and gradually become 
resolved. Because the quantitative events that are measured (expenditures, troop 
commitments, casualties, firepower, etc.) show a wavelike progression, a single 
cycle of their growth and decline is here called a crisis wave. 

Lying behind a crisis wave are innumerable decisions made by policy leaders 
in the opposing nations, by their commanders in the field, and by friendly, 
neutral, and unfriendly nations not directly involved in the conflict. When 
such a very large number of factors contributes to a result, the numerous 
equilibrating relationships often produce a surprising lawfulness in the large- 
scale consequences, This paper is directed toward proposing a mathematical 
theory that accounts for this lawfulness by analyzing the mechanism underly- 
ing the formation of a crisis wave. 

The author’s intent in proposing this theory is not to enhance the art of war, 
although it can be used for this purpose, but rather to help national decision 
makers realize the final catastrophic consequences of their initial low magnitude 
decisions made years earlier, so that ultimately war behavior may be diverted 
at the beginning instead of at the end of the war. 


B. THe Waverike Course or Unirep States Minirary 
Commitments, 1939-1969 


Analysis of the wavelike nature of war-related events has shown, first, that 
these events follow an orderly pattern (15) ; second, that large patterns appear 
to be periodic in time, with a periodicity of approximately 25 years (1); and, 
third, that the percentage of conflicts settled as a function of a time has ob- 
served regularity (6). Although, as the UNESCO charter states it, wars begin 
a 

* Recommended by Ernest R. Hilgard of the Editorial Board, received in the Edito- 


tial Office on December 20, 1971, and published immediately at Provincetown, Massa- 


chusetts, Copyright by the Journal Press. fe 
1 The author wishes to express his sincere appreciation for ideas and encouragement 


to James R. McGregor, Professor of Mathematics, and Ernest R. Hilgard, Professor 
of Psychology, Stanford University, without whom this paper would not have been 
Possible, 
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in the minds of men, the pattern of violence which periodically racks hum 
societies is quantitatively tractable and mathematically analyzable, and may 
be described verbally as wavelike in nature. 

Quantitative techniques that have been recently applied in the analysis o 
nations engaged in warfare permit a mathematical and graphic representation 
of complex military, political, social, and economic events. An example is 
shown in Figure 1, which is a graphical representation of the U.S. vital statis 
tics during major U. S. military commitments from 1939 to 1969. The wave 
like nature of the statistics should be noted. 

Starting at the top of the figure and working downward, one observes 
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U.S. defense expenditures in current dollars per calendar year are plotted.” 
The dots are the actual data through which the curves are smoothly drawn. 
Breaks in the curves indicate where data sources were changed. Notice the 
characteristic wave pattern that begins in 1939 and crests in 1945 as World 
War II ends. Then notice the drop to a low, prior to the Korean War, with 
the same characteristic wave shape. The same characteristic upswing of a new 
wave emerges for the Korean War, and then the predictable break takes place, 
resulting in relative disarmament during the period 1954-56. Beginning again 
in 1956 there are increasing waves in the expenditures, as the war in southeast 
Asia builds to its cresting point in the latter half of 1969. 

There is one data point significantly off the smooth curve for the U. S. 
defense expenditures. This occurs in 1949. Two behavioral events caused the 
expenditures to go above the mathematical projections. They were the Berlin 
crisis and a Communist government that assumed power over China, Com- 
pared with the other three major behavioral events of the past 30 years— 
World War II, Korea, and Vietnam—the events of 1948-49 had small effect 
on the U. S. They are ignored in order to show better the repeating behavioral 
patterns of the U. 8, during major armed conflict and the remaining peace- 
time disarmament periods. 

Plotted directly under U. S. defense expenditures are the U. S. S. R. defense 
expenditures in current rubles per calendar year.? Notice that the U. S. S. R. 
budget tracks the U. S. budget with the exception of one of the two points 
which are off the smooth curve. The first point off the curve occurs simulta- 
neously with that of the U. S. in 1948-49 when the Berlin and China crises 
occurred. The second point occurs in 1956 when Poland and Hungary revolted 
against the U.S. S. R. without U. S. intervention. 

A very interesting and politically sensitive series of behavioral events has 
been noted to occur as the ctisis waves crest. This series of behavioral events 
is composed of the leaders of the opposing sides being removed from office, most 
of them by death. For example, for World War II (see Figure 1), Roosevelt 
died 13 April 1945, Hilter died 2 May 1945, and Tojo died 22 December 
1945, and of course Churchill was voted out. For the Korean War, Stalin 
died 5 March 1953, and Truman retired from office in January, 1953. Similarly 
for the Vietnam conflict, Diem died 2 November 1963, and Kennedy died 22 


2 United States defense expenditures in billions of current dollars per calendar year. 
Data for 1939-1949 from The Congressional Quarterly, December 1, 1967. P, zat 
Data for 1949-1969 from S.I.P.R.. Yearbook of World Armaments and Disarmaments 
1968/69. New York: Humanities Press, 1970. Pp. 202-3. 


8. Soviet Union defense expenditures in billions of current rubles per calendar year. 
Data from Ibid., 204-5, 
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November 1963 as the 1961-1964 Vietnam crisis wave crested. As the Vi 
War of 1965-1969 crested, President Johnson was retired, and Ho Chi I 
died 3 September 1969. There are several speculations in relation to tl 
deaths or career terminations. The first speculation is that these events may, 
course, be just coincidences. A second possibility is that subsequent events 
influenced by the death or withdrawal of a strong leader, There is no evid 
for this in the data. A third possibility is that the period of rising involven 
places a strain upon the leader (who may die of natural causes or resign. 
of age or fatigue), or adds turmoil in the nation, putting the life of the les 
in jeopardy, or both, Whatever the facts may be, these events are 
interesting, but are incidental to the analysis that follows. 

Plotted directly under the economic data are manpower and casualty d 
for the U. S. Total U. S. Department of Defense personnel on active du 
any instant is the first curve.* The second curve is the total Army perso! 
on active duty at any instant. Each curve in the third set of curves pertain 
only to the particular war being fought at the moment. These refer to U.S 
Army battle strength—i.e., the total number of army personnel on active du' 
both officers and enlisted men, in the battle zone at a given instant. Readit 
from left to right in Figure 1, U. S. Army battle strength on the far left is t 
total U. S. Army personnel deployed outside Continental U. §. for World 
War II.® For the Korean War, the U. S. Army battle strength is the 
U. S. Army personnel in Korea at any instant (11). For the Vietnam War, 
battle strength begins with the first military mission established in S 
Vietnam by President Truman at the beginning of the Korean War in I 
It is expanded during the Eisenhower administration in 1956. It is expand 
second time during the Kennedy administration in 1960, and a third 
during the Johnson administration in 1965, Currently it is being reduced dui 
ing the Nixon administration dating from the latter half of 1969 (4). 

The final curves at the botom of Figure 1 are the accumulated b 
casualties for each war. These are the accumulated total of the dead, wo' 
missing, and captured at any instant from the beginning of the war. (For m 


; 4 Total Department of Defense personnel and total Army personnel on activ’ 
rom June 1940 to June 1953. Part A of the curves—data from Pocket Data 
Supplement by Department of the Army. Washington, D.C.: Govt. rinting 
1958, Part B of the curves from June 1954 to June 1963—data from Selecté 
Eada Statistics 1968. Washington, D.C.: Directorate for Statistical Services, 
0: a Secretary of Defense, April 15, 1969. P, 19. Part C of the curves from 
1965 to June 1969—data from Military Strength Figures. Washington, D.C. i 
Be Assistant Secretary of Defense (Public Affairs), 1969. 

World War II U.S. Army battle strength data from Annual Report of th 
tary of the Army, 1948. Washington, D.C.: Govt. Printing Office, 1949. P. 295. 
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accurate definitions the reader is referred to the data sources.) The casualty 
curve has the appearance of forcing the crisis wave upward. When the casual- 
ties cease to rise, the economic and manpower statistics that define and describe 
the crisis wave appear to crest and then fall back toward lower values. U, S. 
Army battle casualties for World War II are shown on the left (2). U.S. 
Army battle casualties for the Korean War are shown in the middle (3), and 
U. S. Army battle casualties for the Vietnam War are shown on the right in 
Figure 1 (4). 

Dividing the financial data from the manpower statistics is a shaded 
line which is the Marshall (9) World War II estimate of the total num- 
ber of U. S. men mentally and physically fit to bear arms, This estimate 
was 11.4 percent of the U. S. population at that time. This estimate has been 
Projected with the increase in population of the U. S. for the 30 years under 
review. ; 

A final note of interest in the figure pertains to the vertical pillars which 
indicate U. S. presidential election years. Over the past 30 years approximately 
85 percent of the upward or downward growth of each U. S. crisis wave has 
occurred between election years, 

To summarize, when quantitative data describing a nation’s military posture 
are set into the form as shown in Figure 1, wavelike patterns appear. Addi- 
tionally, Figure 1 is a convenient way of studying large variations in a nation’s 
defense posture over a considerable period of time. It also appears to aid in 
understanding the position that the U. S. finds itself in now. The current 
Position as of this writing (October 1971) is on the downward side of the 
Vietnam crisis wave. 


C. A Maruematicat Tueory or THE Crisis Wave MECHANISM 
ANALYZED wiTH RespEcT TO Troop STRENGTH, FiREPOWER, AND 
THE RESULTANT CASUALTIES 


A possible mechanism for the formation of a crisis wave is that a given 
number of troops can inject a given energy rate into the battle zone. This 
energy rate causes the enemy to suffer a given casualty rate. As the casualities 
mount, commanders call for new troops not only to replace those that have 
fallen but also as reinforcements. As the troop size builds higher, higher total 
energy rates are delivered into the battle zone by the entire army, even though 
individual soldiers’ firing rates are found to be decreasing. The higher total 
energy rates cause more casualties at still higher rates, which in turn bring in 
still more troops. Thus the crisis wave grows in a predictable fashion. The 
crisis waves crest when the average individual soldier’s firing rate becomes 
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small; and one or both of the opposing side’s manpower pools have been con- 
sumed by casualties (16). A decision ‘to end the war or significantly widen the 
war by the introduction of new allied armies or new weapon systems is made 
at that level of conflict. 

To be mathematically specific, let S,(t) represent the battle strength of one 
side and S2(#) represent the battle strength of the other side. Furthermore, let 
G;(t) and Co(t) represent the total battle casualties suffered by each side 
respectively. 

The change over time of the casualties ‘of opposite sides has been found to be 
similar (16). In fact the data indicate that the casualties vary on either side by 
a fixed ratio, This ratio can be written as. 


Ci(t) = ieC2(t) (1] 


where j, is the Lanchester exchange ratio. Not only do the data (16) support 
this simple linear relationship, but also it has been rationalized by Lanchester 
in his Principle of Concentration (7). This relationship is shown in Figure 2 
for the French versus German: army battle casualties in World War [and is 
described in detail in reference (16) for the British versus German army battle 
casualties and battle deaths during World War I, as well as for the Union and 
Confederate army battle casualties and deaths during the U. S. Civil War 
Because of this similarity, due to the Lanchester exchange ratio, the mechanics 
of the wave may be discussed in terms of either side’s data. For notational 
simplicity, herein, the mathematics describing the wave mechanics will be in 
terms of side number one. ' 

This paper is concerned with two specific responses among the many alter- 
native Tesponses that the Commander in Chief, for example, side one, has at any 
instant in time to the stimulation caused by the opposite side. The first respons? 
is to increase or decrease the energy delivery rate, dE,/dt, of the army's weapon 
systems in the battle zone. The second specific response of concern here is to in- 
crease or decrease the army’s battle strength, $;(t). The-first response is con- 
sidered initially. : 
(A given number of side one’s soldiers can cause their weapon systems t0 dis- 
charge into the battle zone at a given energy rate. Lanchester (7) has defined 
this energy rate as the horsepower of the nation as a fighting machine. This 
horsepower is expressed mathematically as 


where ¢;(t) is a multiplier that represents the average energy rate Per soldiet 
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comprising side one’s fighting machine and is defined here as the average energy 
yield multiplier. Equation 2 is a specific case of the more general lateral pressure 
equations which are power functions proposed by North and Lagerstrom (10). 
Equation 2 might be better expressed as a power function. However, in the 
absence of any confirming data, the more simple linear form is assumed, The 
energy yield multiplier, e:(¢), appears to depend upon how long the war has 
been going on. Historical observation of the progress of wars shows that initially 
the energy yield multiplier is a maximum. However, as side one’s casualties 
mount, its energy yield multiplier appears to decrease. Its approaches zero as the 
crisis wave peaks} that is, it approaches zero when the number of soldiers in 
the battle zone reaches its maximum. Of course, the exact variation of this 
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multiplier as the war progresses is not known ; however, a very simple assump- 
tion can be made. If an inverse linear relationship with strength in the battle 


zone is assumed, 
51 (t) 
a(t) =¢m | 1— ; |. (3] 


ey 


the form of the final equations will correspond to the observed behavior. The 
constant, ém,, represents the maximum average energy yield rate per soldier 
possible for the given weapon systems being used. 

S., is the ultimate number of soldiers that are to be engaged by side one for 
that particular crisis wave. S,, is defined as a function of the population of a 
nation by 


Se, = nP1 [4] 
and 


Sig = yaP 2. [5] 
P, and Pz are the total population of each side respectively. y, and y2 are 
multipliers whose values are equal to or less than .114 for any given crisis wave, 
where Ymax for the U. S. was estimated by Marshall (9) to be .114. However, 
y for the U. S. can be much less for any particular conflict but cannot be more, 
because there are simply no more mentally and physically fit persons to beat 
arms in a general population distribution. The value of y appears to depend 
upon where the fighting is taking place and the type of equipment being used. 
The value of y will be maximum if a nation’s army is fighting within its ow" 
borders. It will be less than that when fighting outside its own borders because _ 
of the necessity of maintaining a ready reserve force at home. ‘The values of Y 
below .114 cannot at present be computed prior to the outbreak of the wat. 
Values of 8,, and hence of y can be empirically determined after the war begins. 
Values for S,, for the wars under discussion are given in Figure 4. 
Thus equation 3 states that as a war progresses and more troops are com- 
} 


mitted to the battle zone thereby increasing §,(¢), the average energy yield 
multiplier per soldier, e;(t), becomes progressively smaller in a linear fashion. 
Substituting equation 3 into equation 2 produces the following equation: 


dE. Ss. 
ga [1-3 Jou, 


my 
This is the relationship between the number of troops in the battle zone, 51 (#)» 
the duration of the battle as measured by the total number of troops committe# | 
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td — , the maximum average energy yield rate per soldier, ém,, and 
2 | 
f 2 dE. f 
the energy rates delivered into the battle zone, — by side one. 


By defining another multiplier, 41, as the number of enemy casualties per 
unit energy expended by side one, the relationship between the energy rate which 
produces a given casualty rate in the enemy’s ranks can be written as 


dE; ses dC2 7 
Ma Pr satis ; [7] 
and if equation 1 is substituted into equation 7 
. aE; dC, 
af 8 
Joh at dt [8] 


where 2 is a function of the hit probability, the hardness of the target, and 
some other factors that do not directly concern this paper. By substituting 
equation 8 into equation 6, the first relationship between casualties and troop 
strength is obtained: 


dC 81 (t) 
" = eel [1 st ; | suo. [9] 


It is interesting to note that this equation has the same form as a linear com- 
bination of Lanchester’s (7) two proposed basic laws for the generation of 
casualties during a war. 

In order to solve for casualties or strength, another equation relating these 
two variables is required. At the beginning of this mathematical discussion it 
was pointed out that the Commander in Chief of an army has two specific 
responses important to this discussion. They are to change the army’s energy 
delivery rate or to change the army’s size. The first response results in equation 
9. Turning now to the second response, a second equation is required so that 
the mathematical form of the two variables, casualties and strength, can be 
obtained. 

As casualties mount, the data (16) show that troop commitments also 
mount, as shown in Figure 3. As a first approximation, 


S$; (#) = aC, (4) 4, [10] 


where a; is the theoretical initial strength in the battle zone when the first 
casualty occurs, and f; is the replacement-reinforcement coefficient, both of 
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which are constants for a given war. The theoretical initial strength coefficient, 
a, was found to be equal to .3X 10% for the U. S. Army in Korea, 1 X 108 
for Vietnam War I, 5 X 10% for Vietnam War II, 11 % 10° for the British 
Army in World War I, and 31 & 10° for the U. S. Army in World War II. * 
The replacement-reinforcement coefficient, £1, for the U. S. in Korea was .58; 
Ay for both phases of the Vietnam War was .36; 8; for the U. S. in World 
War II was .38; £; for the British Army strength in France during World 
War I. was .37; and ; for the Union Army during the U. S. Civil War was _ 
approximately .33, For the buildup and cresting of a war, equation 10 is of 
sufficient complexity. However, if the portion of the strength curve after the 
shooting stops, as well as the details of the very beginnings of the wave, is to 
be considered, then a more complex relationship between troop commitment, 
§,(t), and casualties suffered, C;(t), than equation 10 is required. This more 
complex form will be defined later in the paper as equation 22. 

Figure 3 is a graphic representation of a summary of United States Army 
statistics for the Vietnam War, the Korean War, World War II, World 
War I, and the British Army participation in World War I. While’ it is 
acknowledged that consideration of five wars is not necessarily proof of the 
existence of a phenomenon, an examination of the relationships between wars 
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suggests that an important relationship between’ ultimate strength, S.,,, and 
ultimate casualties, C.,,, can be drawn. 

This relationship is that there are limits in strength buildups and casualties 

y_nition will sustain, beyond which it either accepts defeat, changes its leader- 
ship, or acquires new allies, This idea is graphically illustrated in Figure 3 by, 
a line defined here as a lower boundary level, which relates the points where, 
battle casualties and battle strengths are equal. 

This line, in effect, represents a boundary to the right of which a nation 
cannot seem to pass and still sustain its own war effort. For example, Great 
Britain at the end of World War I suffered roughly two million casualties 
with a total strength in France at the end of the war of two million soldiers, 
Since the population of Great Britain at the time was 60 million, this represents 
3.33 percent of her 1918 population. It is revealing to note that 3.33 percent 
of the United States population (110 million during World War I) is 3.67 
million combat troops, or casualties, which is the point where the United 
States experience in World War I intersects the lower boundary when it is 
extended as shown by the dashed lines in Figure 3. Also if a nation is capable 
of raising only a one man army, then as soon as he becomes a casualty the 
army is finished. Similarly, as soon as all members of a 22 million man army 
become casualties, it is finished. 

Furthermore, if for the two United States wars in Asia, the Korean War 
and Vietnam War II, only the combat troops (approximately 50 percent for 
the U.S.) are plotted instead of the total army, both curves intersect the lower 
boundary level. The U. $. Marine Corps had suffered 100 percent casualties 
in Vietnam by June of 1969. Also, the estimated absolute ceiling on the num; 
ber of men physically fit for active war service for the United States during 
World War II was between 15 and 16 million, which again is the intercept of 
the extension of the United States World War II with the lower boundary 
level. This point represents 11.4 percent of the United States population of 
131 million in 1940, Finally, in World War I Russia mobilized 15,5 million 
men. If it is assumed that they could deploy the same percentage of men 
mobilized as the U. S. did for World War I, which was 52.7 percent, then 
theoretically they could suffer 8.1 million casualties. They actually suffered 
7.9 million, and then their government collapsed (5). Thus a warring nation 
cannot seem to pass the lower boundary level and still sustain on its own, its 
own war effort, New allies must be introduced, or the war settled. 

Based on the reasoning that once a nation’s committed manpower become 
casualties, the nation can no longer sustain its own war effort, a final commit- 
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ment ratio, Aj, is defined which equals the ratio of the ultimate battle casualties | 
to be suffered for this particular crisis wave, C.,, to the ultimate number of 
soldiers that are to be committed, S,,. Thus, Ar equals the ratio C,,/S,,. At 
most the final commitment ratio, A;, can equal one, as was the case for Brita 
in World War I. It can be less than one as shown, for example, in Figute 3, by 
the U. S. in Korea where A; equals .4. “ 
Equation 9 and equation 10 are two equations expressed in terms of two 
unknowns, strength, §,(¢), and casualties, C(t). Hence they can readily be | 
solved for the variation of the troop strength with respect to time and the 
variation of casualties with respect to time, as the crisis wave builds toa 
crest. By substituting equation 10 into equation 9 and solving for the buildup 
in casualties, the differential equation 
dC; 1 [1 Q s | 8 11] 
a7 = Jolm bad Se, Cy (t)81 | C(2) A 
is obtained. Critical to the solution of equation 11 is the value of the 
replacement-reinforcement coefficient, £;. Values of £; of primary con 
cern in this paper are 8; equal to 1, 1/2, and 1/3. For the Union Amy 
U. 8. Civil War, for the British Army in France in World War I, fot 
the U. S. Army in World War II, and for the U. S. Army in both 
phases of the Vietnam War the value of 8; was approximately 1/3. Only 
in the case of the Korean War was a larger B1 found which equaled 1/2 
At the very beginning of a war, larger f1’s in the neighborhood of 1 at 
more appropriate. 2, equal to 1 is found to be a good approximation for tht 
first 10 percent of a war. 

Because of the nonlinear nature of equation 11, it cannot have a closed 
form solution without ; being an integer value. Thus the simplest solutiot 
occurs when f; equals 1. In this case the solution to equation 11 for Bi et! 
to unity is called the logistic curve and is S-shaped in nature (8). For valu 
of 1 smaller than 1, the S-shaped nature of the curve is retained. For valuts 
of B; bigger than 1, the casualties begin abruptly. The form of the solution 
equation 11 for values of 8; equal to or less than 1 matches the form of 
known buildup in casualties during 1965 for Vietnam War II as drawn" 
Figure 1. } 

The form of the solution of equation 11 matches the known data for the 
Korean War. However, the data forming the bottom part of the S-shaP 
curve for Korea, as in the case for most wars, are not known. The solution 

_ to equation 11 also matches the top portion of the S-shaped data curve ® | 
casualties built up in the Vietnam War from December, 1961 through DY 
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cember, 1964. The bottom part of ‘this portion of the curve is not shown in 
Figure 1 and is not known prior to December, 1961 (16). Equation 11 
matches the casualty curve for the Vietnam War from 1965 through June, 
1969. In this case the casualty data forming the bottom part of the S-curve 
can be seen for the year 1965. 

The equation for the buildup in battle strength as casualties mount is 
obtained by again substituting equation 10 into equation 9 and ‘solving this 
time for the strength, S,(¢). This substitution yields 


dS jcem Sy(t 
shee ee E _ Sf) J suceyeny [12] 
dt p191 oy 
where 
pr = 1/B1 [13] 
and 
1 
=—. [14] 
Soe Ba 
Oy 


Equation 12 is solved for the fraction S;(t)/S., for the case when p = 3. 
The approximate solution is 


Jolm 4 
so _,_.-f 3y } [15] 


in 8 
where the first assumption for mathematical simplicity is a small ratio of 
initial strength, $;(O), to ultimate strengths, Sin For example, the ratio of 
$1(O)/S., is 1/20th for both phases of the Vietnam War. The second as- 
Soy—S (#) 


sumption for mathematical simplicity is that the ratio 


oy 
small when compared to log[Sz;—S:(#)]. The result of these assumptions 
is equation 15 which produces a curve that can be easily fit to the strength 
data for major U. S. wars. 

Figure 4 shows the data plotted and equation 15 drawn as the smooth 
curves for the U. §. in World War I, for Britain in World War I, for the 
U. S. in World War II, the Korean War, and the two phases of the Vietnam 
War. For the data for Figure 4, see Voevodsky (16). The value of the 
ultimate strength for the U. S., S..,, for each of these wars, plus the numerical : 
value of the time constant, 7, for each war, are also given in Figure 4. 
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‘The specific case of the Vietnam War from June, 1965 to June, 1969 is 
shown in Figure 5. Plotted in the top part of the figure is the total Depart- 
ment of Defense strength in South Vietnam for the four years. The reciprocal 
strength of S..,—S(#), is plotted in the bottom part of the figure. The agree- 
ment between the theory as expressed by equation 12 and the actual data for 
the war is evident by the nearness of the fit. Notice that by December of 1966 
accurate estimate of S,, could have been made. Thus accurate forecasts for 
the remainder of the war could have been made at that time on the basis of 
the mechanics of the crisis wave presented in this paper. 

Defining the time constant, 7, for the case when p = 3 as 


3ha 

t= 

Ic€m Ha 
the variation of troop strength, S(t), with respect to time is characterized. 
For the case of the Vietnam War from June, 1965 to June, 1969 one finds 
that r = 1.56 years. Thus in the first 1.56 years, 63 percent of the troops that 
were to be committed to the battle zone were committed. In the next 1.56 
years, 86 percent of the troops were committed; and in the next 1.56 years 
(yields a total of 4.68 years) 95 percent of the ultimate forces were committed, 

The general form of the equation for the time constant, 7, is given by 


[16] 


4s 
—— (17] 
. jolm Bx 


where £1 can have noninteger values. 

Equation 17 determines the magnitude of the time constant of a war, the 
determining factors of which appear to be set at the earliest stages of the 
war (16), All other factors remaining constant, the time constant for the war, 
7, is made smaller; hence a decision to stop the war or significantly expand the 
war is reached sooner, if the exchange ratio, jo, is made larger. If the maximum 
average energy yield per soldier, em,, is increased, then the time constant, r, 
decreases; for instance, by converting to automatic weapens from semi auto- 
matic weapons or to nuclear explosives in contrast to conventional explosives. 
The time constant can be made smaller by increasing the number of casualties 
Per energy unit expended, 1. In contrast to the enemy’s offensive weapon 
systems, enemy defensive systems. tend to decrease x, such as antiballistic 
missiles, antiaircraft) antisubmarine, thus prolonging the time that it takes 
for a war to arrive at a decision level. The time constant, 7, can be made 
smaller by decreasing the final commitment ratio, Ay = C2,/S.c,- The smaller 
the percentage of casualties to troop commitment that the nation is prepared 
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THE TIME VARIATION IN TOTAL DOD STRENGTH 
IN SOUTH VIETNAM 


a. Strength vs Duration 
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| to invest in the crisis, the faster the war will reach the decision level. Finally, 
the bigger the replacement-reinforcement coefficient, 8, the smaller the time 
constant. For example, in the Korean War £ was larger than in the Vietnam 
War, thereby indicating that reinforcement troops were more vigorously 
employed in Korea than in Vietnam. 
A mathematical property of the equation expressing the solution of equation 
9 and equation 10 in terms of C,(#), C2(#), 8,(t), or S2(t) is that this 
solution always has the same time constant. The data (16) also support this 
finding where, for the U. S. Civil War, the buildups in Union and Con- 
federate battle strengths, Union and Confederate battle casualties, and Union 
and Confederate battle deaths were found to have the identical time constants. 
Found also to have identical time constants were the buildup in the mutually 
caused French and German battle casualties and the buildups in mutually 
caused British and German battle casualties for World War I. Because of 
the identical time constants the following identity is found 


J oe ee [18] 
4 em, Va Bi Ae , 


This equation states that the Lanchester exchange ratio, jo, depends upon the 
product of the ratios: first, the ratio of the maximum average energy yield 
rate per soldier of each side, émo/€m,3 second, the ratio of the number of 
enemy casualties per energy unit expended for each side, p2/p13 third, the 
ratio of the replacement-reinforcement coefficients, 82/81; fourth, the ratio of 
the final commitment ratios, A1/A2, for that particular crisis wave. 

One method for the exploration of the beginning of war can be accomplished 
by setting 6 equal to 1. In this case the equations representing the mechanics 
of warfare yield the classic logistic curve. The beginning portion of this 
S-shaped curve has the initial time constant, 7, 


pirat [19] 
Jo€m Ha 


which is obtained by setting ; equal to’l in equation 17. This time constant 
describes the rapidity of the troop and casualty buildup at the very early 
stages of the war; that is, where S,(t) is equal to or less than 10 percent of 
the ultimate S.2,. This positively accelerating portion of the curve occurs when 
the initial troop strengths and initial casualties are small in comparison to what 
is to come. The actual data on this part of a war are often hidden from public 
view because they are so small and because governments classically hide the 
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usual undercover beginnings of war from the respective citizens. Huge 
can begin just as fast as small wars because they have identical time co 
The determining factors are only in the magnitude of the exchange rati 
the magnitude of the average energy yield multiplier per soldier, a 
magnitude of the casualities per energy unit expended multiplier, jx; 
final commitment ratio, A;. Clearly with the exception of the final com mit 
ratio the larger any of these factors become, the faster the war aco 
As a side note, when the assumption is that 8 equals 1, then the initial 
commitment rate, dS/dt, is zero but built in an exponential manner 4 
fighting progresses. } 
The start-up rates for the general case depend critically on the valu 
the troop reinforcement coefficient, 8, or inversely on the value of p. 
general equation for the start-up troop rate is obtained from equat 
and is ag 
d8;(O) : [1 —81(0)/S.,] Mi 

[eee = Iobbiém, O81 Tes: (0)-2ae 


where S(Q) is the initial troop strength at the beginning of the war. T 
for the most general case where p = 3 


a8;(0) pus BIO }4] 
dt S2,81(0) 1 


Equation 21 states that the initial troop buildup rate depends in’ ers 
the initial troop strength, $(O). Hence small initial troop strength | 


’ 
Se Fotl1ém 1 [ 


The war also starts abruptly with a troop commitment rate given | by 
tion 21. i 

If the portion of the strength curve after the war is over and hie sho 
stops is to be considered, then a more complex relationship betv 
commitment, §,(t), and casualties suffered, C,(t), than equation 10 
quired. In fact, historically, additional troops appear to be committed no! 
as a function of casualties, but also as a function of casualty rates. } 
also ‘wants to consider the fact that biological systems (13, 14, 1 
inertia-like terms, as well as thresholds for stimulation, a very complex 
of equation 10 can be written as 


bier ia bo aC, “e 


dt de | 
PS (t) ie 
dt? oe 


+ S(t) 
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where the biological system is considered as a second order system. The 
constants C, and oe are values above which the total casualties and casualty 


rate, respectively, must be, for a change in the troop size to take place. In 
actual practice these are generally zero because the data are not recorded 
until a war is under way. The terms ar, Br, a, b, and ¢ are constants of the 
biological system. The values of the factors a, b, and c can be determined after 
the crisis wave crests and the shooting has stopped as shown in Figure 1. The 
values for a, and 8, can be determined at the start-up phase of the war where 
the initial casualties, C(t) are too small to be a dominant factor. The values 
of a and f are determined as has been done in this paper. Notice that equation 
22 reduces to the simpler equation 10 in the steady state; that is, if changes are 
not rapidly occurring, 

When the crisis wave breaks, equation 10 does not describe the observed 
behavior. In this case equation 22 used in conjunction with equation 9 will 
yield the exponential decay of troops in the battle zone, as well as the expo- 
nential buildups. The detailed solutions are not included here because of 
their complexity. 

To conclude, a mechanism for the formulation of a crisis wave has been 
Proposed, The mathematical equations derived from these proposed mechanisms 
yield curves that approximate closely the observed behavior as described by 
the recorded data. 


D. Summary 


On the basis of the U. S. warfare vital statistics for the 30 years from 
1939 to 1969, one can see that the growth and decline of war-related be- 
havioral events are wavelike in nature. In this paper a mathematical theory 
uniting the factors underlying the mechanism of the formation of the wave 
Phenomena expressed in terms of a nation’s strength, firepower, and resultant 
casualties is proposed. The mathematical equations derived from these proposed 
mechanisms yield curves that approximate closely the observed behavior of 
nations engaged in war, as described by the numerical historical data recorded 
not only during the past 30 years but also over the past century. 
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DIFFERENCES IN DEGREE OF GENERALIZATION IN 
EXTINCTION OF COMPULSIVE DRINKING 
BETWEEN ONE TYPE AND MULTITYPE 
BEVERAGE DRINKERS* 1? 


California State College at Los Angeles 


Danny J. CoHEN AND JosepH G, PHELAN 


A. IntTRopuUCTION 


Efforts at treating alcoholism have met with limited degrees of success. The 
typical reports of researchers who employ the more traditional talk therapeutic 
methods indicate various rates of achieved abstinence. Tilletson and Fleming 
(21) report 15% abstinence at the end of 18 months, while Wall and Allen 
(24) achieved as much as 24% complete abstinence at the termination of 
periods of between three and eight years. 

Kantorovich (13) reported the first scientific study of the use of electrical 
aversive conditioning in treating alcoholism. Shock was used as the uncondi- 
tioned stimulus, and 14 of 17 of his patients remained abstinent from “one 
to several months.” 

Looking back over the history, researchers note a feeling of disappointment 
concerning the use of the chemical aversion therapy method. In particular the 
Rachman and Teasdale studies (17) cite quite a few disadvantages of this 

‘approach. Some of their more obvious objections to it are that the timing 
of the conditioned stimulus (CS) and the unconditioned stimulus (UCS) 
Proved too difficult to control. Also there was confusion in attempting to 
identify just what the UCR (unconditioned responses) is. Is it the vomiting 
or is it the nausea? 

A review of these studies (16) concluded that the use of shock, rather than 
the use of the emetic drugs, offered the advantage of more precise controls, 

Some of the more recently reported studies have been even more encouraging 


* Received in the Editorial Office on December 23, 1971, and published immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. f 
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aversive conditioning paradigms in the treatment of alcoholism,” a Ph.D. dissertation 
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Services of the Sepulveda Veterans Administration Hospital, Sepulveda, California, 
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Dr. Mathew Buttiglieri, Dr. Elaine Holmes, and Dr. Armond Alkire of the Depart- 
ment of Psychology, Sepulveda Veterans Administration Hospital. 
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in their view of the potentially increasing effectiveness of the more sophisticated i 
methods, using electrical aversion conditioning. This is particularly true of 
the work of Hsu (10) and Blake (1, 2). A serious defect of all these studies, 
in our opinion, is their consistent failure to use control groups. Nevertheless, 
they do point to the possibility of bringing the compulsive alcohol problem — 
under reasonable control—for some patients—by the careful administration 
of the newer electrical aversion conditioning methodologies. 

In another type of study, Farrar et al. (4) investigated the efficacy of 
extinction through a one-trial punishment of avoidance-response paradigm. 
‘The response was consumption of alcohol, the punishing agent was succinyl- 
choline chloride dihydrate. In a six-month follow-up of the study, nine of the 
original 12 Ss were ultimately located. Four of the Ss at this late date had 
managed to remain totally abstinent. After a 12-month period two of the Ss 
continued to remain totally abstinent. One of these Ss was able to report the 
experience of a very broad generalization of his aversion. This subject not only 
became extremely anxious about anything to do with alcohol, but he also 
claimed to experience violent anxiety reactions to such relatively innocuous 
solutions as mouthwash and hairspray. 

Our study intended to explore the individual responses to stimulus general- 
ization. The literature on this phenomenon has been contradictory and con- 
fusing. Lemere and Voegtlin (14) reported two cases in which experimentally 
induced conditioning turned out to be highly specific to the beverages used in 
conditioning. According to their report the conditioning effect never. general- 
ized, and their patients were able to drink alcoholic beverages in forms othet 
than the one to which they were conditioned with no distress. In yet anothet 
study, Voegtlin (22) reported the occurrence of generalization of conditioned 
aversion to alcoholic stimuli other than those that had been present during the 
training, as well as to snuff. Franks (7) objected to assumptions made by 
some—that conditioned aversions are always specific, therefore requiring 
separate and individual conditioning to every kind of beverage. One of the 
purposes of our research was to study the extent and kinds of generalization 
that can and do occur. 

It has been hypothesized by James and Rotter (12) that. those alcoholics 
who see the locus of reinforcement for their action as externally (chance) 
controlled are more likely to resist extinction. This increased resistance 1 
extinction should be reflected in broader generalization of conditioned aversion 
to alcoholic beverages. Our study proposed to determine whether experime” 
and control Ss habitually regard reinforcements as externally or intersaly 


- 
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controlled and to compare this locus of control with resistance to extinction 
and generalization of conditioned aversion to alcohol, 

Among other things, the James and Rotter (12) study investigated the 
effects of partial versus 100% reinforcement schedules upon trials to extinc- 
tion. Some Ss preferred to see the situation as one of personal skill in coping 
(internal locus of control) ; others saw the task result as one of chance (ex- 
ternal). Under the skill conditions (internal locus of reinforcement) the 
superiority of partial reinforcement in resisting extinction did not obtain. The 
50% reinforced (internal) group extinguished much faster than the 50% 
chance (externally controlled) group. This increased resistance to extinction 
of the external (chance) Ss should result in broader generalization of condi- 
tioned aversion to alcoholic beverages. 

Our study which followed from an earlier dissertation study involved two 
training paradigms: avoidance and escape. We planned to execute this using 
the learning variables that Feldman and McCulloch (6) and Franks (8) 
have suggested can be relied on to increase resistance to extinction. These 
consisted of (a) use of spaced rather than massed trials—this suggested by 
Feldman (5); () partial reinforcement (11); (c) delay of reinforcement 
(3); (d) contiguity of UCS offset and the instrumental response (19) ; (e) 
instrumental rather than classical conditioning (19); and (f) varying the 
quantity of reinforcement (15). 

Vogler et al. (23) demonstrated that the procedures we intended to use 
in our study have the potential at least of effectiveness. Using escape paradigm 
only, they obtained estimates of 70% of abstinence after an eight-month 
follow-up. The Vogler et al. study leaves unanswered, however, some challenges 
put forth by Feldman and McCulloch (6) and Solomon et al. (20) : namely, 
that avoidance conditioning, not escape conditioning, typically results in resis- 
tance to extinction or prolonged abstinence from alcohol. We therefore com- 
bined in our procedures the best features of both the Vogler e¢ al. and the 
Feldman and McCulloch studies. 

The work of Hearst (9) with approach and avoidance gradients suggests 
that avoidance gradients are usually flat and elevated and, more importantly, 
that flat generalization gradients are associated with increased resistance to 
extinction, ae 

In this study our hypotheses were that (a) there would occur generalization 
when aversive conditioning was used; (6) an aversion to one type of alcoholic 
drink, once established, would generalize to other similar beverages; (c) 
stimulus generalization would occur and be measurable by an increase in pulse 
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rate when alcoholic beverages, not previously conditioned, were presented ; (d) 
externally controlled Ss would show a greater amount of generalized aversion 
than internally controlled Ss; (f) those Ss who indiscriminately and randomly 
switched from the “hard” beverages (bourbon, vodka) to the “soft” (beer, 
wine) would generalize in extinction to a significantly greater degree. 


B. Metuop 
1. Subjects 


Ss were 23 adult male veterans (ages 24 to 47), admitted to the alcoholic 
research and treatment program at the Sepulveda (California) Veterans Ad- 
ministration Hospital. In this particular program the criteria for acceptance 
included (a) alcoholism as the primary diagnosis and (6) no major psychiatric 
or medical problems other than alcoholism that would require further hospital- 


ization. Ss were randomly assigned to one of three groups: avoidance, escape, 
and control. 


2. Apparatus 


The shock was administered by a Harvard Induction Stimulator, Model 
#330, which has a maximum of 500 ma output. Finger electrodes of stainless 
steel “were attached to the index and little fingers of the nondominant hand. 
A motorized spitoon served as the receptacle for expelled alcohol. The opening 
of the spitoon was operated by a bar press. A green light attached to the top 
of the spitoon indicated the beginning of a trial. A Lafayette polygraph was 
used for plethysmograph recordings. Pulse rate was used as a physiological 
measure of the respondent aspects of the conditioning process, or conditioned 
anxiety. Readings were taken from the temporal artery. 

All Ss were administered the Rotter (18) I-E Test to determine internal 
vs. external locus of control of reinforcement. Ss were divided into two groups 
based on external-internal orientation. 


3. Procedure 


Ss were instructed to take a sip of alcohol «(the conditioned stimulus [cs]) 
when a mree light appeared, and to hold the sip as long as it was pleasurable. 
Ss were given a choice as to what single beverage (e.g., bourbon), what typ 
of beverage (eg., beer or wine), or what combination of beverages they 
preferred to use in the avoidance conditioning and stimulus generalization 
experiment, 

While each S held alcohol in his mouth, shock (the unconditioned stimulus 
[UCS]) was administered on a randomized schedule of reinforced, nontel” 
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forced, and delay trials. For escape Ss, shock onset was contiguous with CS 
and terminated with the instrumental (spitting) response. For avoidance Ss, 
shock onset occurred 10 seconds following CS and terminated with the 
instrumental (spitting) response, _ 

The control group repeated the performance of the instrumental response 
without shock in order to determine whether the effective agent in the condi- 
tioning was the performance of the instrumental response rather than the 
shock itself. 

Conditioning sessions were approximately 30 minutes long, comprising 25 
trials twice a day for 16 sessions. Sessions 1 through 7 were on a 100% rein- 
forcement schedule, while sessions 8 through 16 were on partial reinforcement. 
This was then followed by eight extinction sessions. The test for stimulus 
generalization was made following session 16 of acquisition and following 
session 8 of extinction. It consisted of randomly presenting six or seven types 
of alcohol, including mouthwash as a control. The alcohol was placed behind 
the Ss in small paper cups. A sip of alcohol was taken and held for 15 seconds, 
then expelled. Fifteen seconds later a sip of water was taken and swallowed. 
Each § was tested with the alcohol he used in conditioning. For purposes of 
making comparisons as to duration and extent of generalization, § was also 
tested with the other types of alcohol. 

Next a six-second sample of heartbeat was taken, multiplied by 10 in order 
to obtain heart rate per minute readings. This six-second sample was taken 
before and after the sip of alcohol was ingested. The means obtained on 
before/after heart rate were utilized in immediate analysis of the data. Types 

“of alcohol and other beverages used in this study were bourbon, scotch, beer, 
vodka, gin, wine, and mouthwash. 


C. Resutts 


Stimulus generalization occurred to a significant extent in both avoidance 
and escape conditioning in this experiment. As the test stimulus became in- 
creasingly dissimilar from the conditioned stimulus, the generalization effect 
became weaker, 

In measurement of the generalization effect, we used heartbeat rate as 
4 prime indicator of the conditioned and/or generalization effects. In con- 
ditioning alcoholics to demonstrate anxiety effects to some alcoholic beverages 
and not to others, we expected that the highest heartbeat rate would occur 
when the subject was confronted with the kind of drink to which he had been 
conditioned (C) and that this would be naturally followed by other types of 
alcoholic beverages to which he was exposed (O)—the generalization effect. 
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We thought that we could expect to encounter the lowest heartbeat rate when 
the subject was given mouthwash (M). Since the mouthwash was so dis 
similar to the original conditioned simulus, it was felt that the generalization 
effect was least likely to occur in this case. Generalization, in our case can be | 
defined aaC < O< M. 

It was determined that the most efficient way to bring out the generaliza- 
tion phenomena in half our subjects was to accept a one heartbeat criterion 
between the conditioned alcohol stimulus, the generalized response to other 
types of alcohol, and the mouthwash response, so long as the heartbeat mea- 
sured was always in the predicted direction of C << O < M. 

Generalization of conditioned anxiety seemed to occur to a significant degree 
in the extinction trials for those Ss conditioned to a combination of “hard- 
soft” alcoholic beverages, but not to those Ss conditioned to only one drink or to 
one type of beverage (hard or soft). The chi-square test was used to deter 
mine whether the generalization effect was occurring to a significant degree 
here between those Ss who happened to be conditioned to one particular 
alcoholic beverage (bourbon, bourbon) or to one type of alcoholic beverage 
(either bourbon-vodka or beer-wine), and those Ss conditioned to a “hard- 
soft” combination (e.g., bourbon and beer). Ss in avoidance, escape, and 
control groups were combined so that the data could be considered and 
analyzed in terms of relation of types of alcohol used in conditioning to degree 
of stimulus generalization. In the acquisition stage, stimulus generalization 
did not occur differentially for the three types of drinkers (x? = 4.257, ? < 
:20). In extinction, however, the results were quite different. Generalization 
did occur differentially for the three types of drinkers (x2 = 7.06, > < 
.05). Analysis of the individual cells shows that generalization occurred most . 
frequently with the hard-soft combination Ss and least with those Ss wi? 
consistently chose to drink one particular alcoholic beverage (i.e. bourbon; 
Bourbon) Those Ss who stuck to one type—i.e., “hard” (bourbon-vodka) or 

‘soft” (beer-wine)—fell between these two in rate of generalization. 

All Ss were administered the Rotter (I-E) Internal vs. External Locus o 
Reinforcement questionnaire. Externals generalized significantly more often . 
than internals (x2 = 7.759, » < 05). All Ss who persistently chose % 
kind of beverage (bourbon, bourbon) were internalizers; five of seven drinkers 
who chose the hard-soft combination were externalizers and successfully gen 
eralized in the extinction phase; Ss who stuck with hard (bourbon and vodkt 


or soft (beer and wine) were split 50-50 on the basis of External-Inte™ 
Locus of Reinforcement. 
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D. Discussion 


Generalization occurred significantly often only in the extinction phase. 
The acquisition of a conditioned response seems not to be enough to bring 
about generalized anxiety in response to those alcoholic beverages not used in 
the conditioning itself. In the treatment of alcoholism it is important that 
generalization occur because having to go to “specific” conditioning to every 
type of alcoholic beverage would turn out to be an extremely laborious process, 

Generalization in extinction occurs for those who combine hard-soft drinks 
and who also happen to be predominantly externally oriented. Differences in 
our populations as to the extent of generalization might be due to the fact that 
various Ss have characteristically different ways of viewing the conditioning 
situation. Internals (one beverage drinkers) tend to perceive the training 
event (conditioning) as contingent upon their own behavior or on their own 
relatively permanent characteristics. When a change in the training occurs 
from partial reinforcement (acquisition) to extinction (no reinforcement), 
internalizers, who have already learned drinking habits and regard them as 
being under their control, tend not to resist extinction, They feel they have 
already learned to drink as a consequence of their own actions, Not resisting 
extinction, they generalize less. 

External Ss seem to drink any combination of hard and soft beverages. They 
tend to perceive reinforcement as following some action of their own but not 
being entirely contingent upon their own behavior. For them, reinforcement 
might be contingent upon chance, luck, or the control of someone else. When, 
in the experiment, there occurs a change from partial reinforcement to extinc- 
tion, the external Ss regard their learning of a conditioned response as being 
under someone else’s control and not just as an undoing of a drinking habit 
that was under their own control. External Ss, in extinction, tend not to 
unlearn the conditioning because they regard it as under someone else’s control 
and so beyond their power to change. 

Resistance to extinction is seen in our Ss as reflected in a greater tendency 
to stimulus generalization when they describe themselves as experiencing the 
locus of control as external (2 = 7.759, 6 < .05). Alcoholic externalizers 
tend to drink hard-soft combinations (bourbon-beer). On the assumption that 
¢xternalizers generally see society as at fault for their present condition, then 
they might tend to drink “anything” because they are not in control of their 
Own destiny, h 

In alcohol studies, pursuit of hypotheses suggested by stimulus generaliza- 
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tion research has been largely neglected. Our findings—relating, as they do, 1 
occurrence of generalization to type of alcoholic personality, weltanschau 


and 


research along these lines might be productive. Especially is this true of 
a suggestive finding as the relationship of external orientation to the w 
choice (hard and soft) of beverages consumed. 
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kinds of beverage usually consumed—lead us to conclude that fu 


REFERENCES 


Brake, B. G. Application of behavior therapy to the treatment of alco! 
Behav. Res. @ Ther. 1965, 3, 75-85. 

. A follow-up of alcoholics treated by behavior therapy. Behav, R 
Ther., 1967, 5, 89-94. { 

Crum, J., Brown, W., & BITTeRMAN, M. The effect of partial and delayed | 
forcement on resistance to extinction. 4 mer. J. Psychol., 1951, 64, 228-237. 

Farrar, C., Powett, B., & Martin, L, Punishment of alcohol consumption by a) 
paralysis. Behav. Res, @ Ther., 1968, 6, 13-16. ‘ 

FetpMan, M. A reconsideration of the extinction hypothesis of warm up in 
behavior. Psychol. Bull. 1963, 60, 452-459. 

FetpMan, M. P., & McCuttocn, M. J. The application of anticipatory avoid 
learning to the treatment of omoeeroe hy} I. Theory, technique am 
liminary results. Behav. Res. Ther., 1965, 2, 165-183. 

Franks, C. M. Alcohol, alcoholism and conditioning. Reprinted in Eysenck, 
(Ed.), Behavior Therapy and the Neuroses. Elmsford, N.Y.: Pergamon - 
1960. 

—_. Conditioning and conditioned aversion therapies in the treatmel 
the alcoholic. Internat. J. Addictions, 1966, 1 (2), 61-98. : 

Hearst, E. Stress-induced breakdown of an appetitive discrimination. J. E 
Anal. Behav., 1965, 8, 135-146. - 

Hsu, J. Electroconditioning therapy of alcoholics: A preliminary report. Qu 
J. Stud. Alcohol, 1965, 26, 449-459. i 
HUMPHREYS, L. The effect of random alternation of reinforcement on the 
sition and extinction of conditioned eyelid reaction. J. Exper. Psychol. 
25, 141-158. 

James, W. & Rorrer, J. B. Partial and 100% reinforcement under chance a 
skill conditions. J. Exper. Psychol., 1959, 55, 397-403. a 

Kanrorovitcu, N. An attempt at association reflex therapy in alcoholism. FP 
Abst., 1935, 10, 279. 

Lemere, F., & Vorctuin, W. Conditioned reflex therapy of alcoholic addi 
Specificity of conditioning against chronic alcoholism. Calif. @ Western 
1940, 53, 268-269. , 

McCLeELLAND, D., & McGowan, D. The effect of variable food reinforce 
Ne strength of a secondary reward. J. Comp. & Physiol. Psychol., 1953, 90 
RacuMan, S. Aversion therapy: Chemical or electrical? Behav. Res. 9 
1965, 2, 289-300. P 
RACHMAN, S, & ‘TEASDALE, J. Aversion Therapy and Behavior Disorde 
Analysis. Miami, Fla.: Univ. Miami Press, 1969. 

Rorter, J. B. Generalized expectancies for internal versus external 
reinforcement. Psychol. Monog., 1966, 80(No. 1), Whole No. 609. 

Sotomon, R., & BrusH, E. Experimentally derived conceptions of anxteb 


20. 


ai. 
22, 
23. 


24, 


DANNY J. COHEN AND JOSEPH G. PHELAN 317 


aversion. In Jones, N. R, (Ed.), Nebraska Symposium on Motivation. Lincoln: 
Univ. Nebraska Press, 1956, 

Sotomon, R. C., Kamin, L. J., & Wynne, L. C. Traumatic avoidance learning: 
The outcomes of several extinction procedures with dogs. J. Abn. & Soc. 
Psychol., 1953, 48, 291-302, 

Tittetson, K., & FLEMING, R. Personality and sociologic factors in the prognosis 
and treatment of chronic alcoholism. New Eng. J. Med., 1937, 217, 611-615. 

VorctLin, W. L. The treatment of alcoholism by establishing a conditioned 
reflex. Amer. J. Med. Sci., 1940, 199, 802-810. 

Vocier, R., Post, K., JoHNson, G., Lunpy, S., & Martin, P. Electrical aversion 
conditioning with chronic alcoholics. Paper presented at Western Psychological 
Association meeting in Vancouver, Canada, 1969. 

Watt, J., & ALLEN, E, Results of hospital treatment of alcoholism. Amer, J. 
Psychiat., 1944, 100, 474-479. 


California State College at Los Angeles 
5151 State College Drive 
Los Angeles, California 90032 


oe 
Paiev%e fo 
ROU-Ies 5¥ 


arya auth - eure 
SereALio WAS. EEL 
mle &, guidalde Ap 


s tena os riya 


ethgoladsyel tae W de: belo 
‘ f 


Published as a separate.and in The Journal of Psychology, 1972, 80, 319-322, 


ALTRUISM: REWARD OR PUNISHMENT ?* 


Virginia Commonwealth University; and Central Rehabilitation Center, 
Petersburg, Virginia as 


Rozert M. Tipton anp SHARON BROWNING — 


A. IntTRopuctTIon 


In recent years scientific literature has reflected considerable interest in a 
class of behayior that seems to have particular relevance for this time in 
history—altruism. A number of hypotheses have been offered to explain 
altruistic behavior of American Ss. These include the existence of a social 
responsibility norm (5, 10); guilt reduction (8, 11) ; dependency and obliga- 
tion (4, 9); empathy (2); modeling (6, 10); and reinforcement (1, 3). 
Weiss ez al. found altruistic behavior to be inherently rewarding; and, further, 
they found that an altruistic act could be sufficient reinforcement to condition 
an instrumental response (12). 

The purpose of the present research was to determine the differential effects 
of reward and punishment (expressions of yerbal reward or reproof by the 
Person to whom the altruistic behavior is directed) of an altruistic act on 
immediate subsequent behavior in a situation calling for an altruistic response 
of a different type. 

B. MetHop 


1, Subjects 


The study took place in the parking lot of a shopping center, and the 
subjects were not aware that they were part of an experiment. In order to 
control for certain variables that have been found to be related to altruistic 
behavior, only adult, white, female Ss were selected. Since the design of the 
experiment necessitated that the Ss perform an altruistic act (helping an 
individual who dropped packages), this was also a prerequisite to their being 
included as Ss in this investigation. 


2. Procedure 
Two confederates (Cs) participated in the experiment. A 61-year-old ‘and 
ey 51-year-old woman alternated in the role of CA (experimental conditions 
were randomly assigned to each of the two Cs). CA, holding a bag of 
esa a . . . 
* Received in the Editorial Office on December 24, 1971, and published: immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. 
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groceries, stationed herself in the parking lot of the shopping center. As a 
potential S got out of her automobile and came within 10 to 15 feet of CA, 
CA dropped her groceries without looking at the § and immediately started 
picking them up. If the § attempted to assist the C, she was either rewarded 
verbally with the statement, “How clumsy of me; thank you very much,” 
or reproved by the statement, “Go on; I don’t want any help and I don't | 
need any help.” In a control condition randomly selected Ss were not exposed 
to the grocery dropping situation, 

CB, a 23-year-old female in a wheelchair, who was unaware of which, if 
either, of the experimental conditions the § had been exposed to, positioned 
herself somewhat in the path of the § and appeared to be unable to get onto 
the curb in her wheelchair. The dependent variable was measured dichoto- 
mously in terms of whether or not the Ss helped CB in the wheelchair. 


C. Resutts AnD Discussion 


Not one of the 30 Ss in either of the two treatment groups, reward or 
punishment, subsequently offered assistance to the C in the wheelchair. Eleven 
of the 30 Ss (36%) in the control group (subjects who had not previously 
been exposed to the grocery dropping situation) offered assistance to the C 
in the wheelchair. A 2 X 2 chi square computed between the control group 
and each of the two treatment groups yielded highly significant identical xq 
values of 11.13 (df = 2, p < .001). 

In this study neither the verbal reward nor the Ss’ awareness that they had 
behaved altruistically in the grocery dropping situation was sufficient to in- 
fluence them to act altruistically in the second situation (offering assistance 
to the C in the wheelchair) immediately following. To the contrary, acting 
altruistically in the first situation, regardless of whether they were verbally 
rewarded or verbally punished, apparently influenced Ss not to act altruistically 
in the second situation. 

These findings appear to be incompatible with those of Weiss et al. who 
found altruistic behavior to be inherently rewarding (12). In the Weiss ¢ al. 
study the task of the S, ostensibly, was to evaluate the performance of a€ 
under stress (receiving electric shock) by setting some dials and, after receiving 
a signal, recording his evaluation (rating) by pressing a record butto™ 
Immediately after the S pressed the button to record his evaluation, the 
apparent shock to the GC was obviously terminated. In this study, then, tH 
deliverance of someone from suffering a continuous electric shock was 7 
altruistic act which was found to reinforce the button pressing resp 
thereby reducing response latency on subsequent trials (12). 
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Several differences between the present study and the aforementioned one 
could account for the apparent difference in results. The altruistic task in 
the Weiss et al. study required less physical effort, was more highly structured, 
and required less interpersonal involvement than the present study. It is 
felt that while the above mentioned differences are important, the factor 
having the greatest impact on the results of this study was the demand made 
upon the Ss. In the Weiss et al. study no demand was made on the Ss to give 
up their time or alter their plans, since they were already committed to 
participate in the study under the guise of evaluating performance. In contrast, 
Ss in the present study were, in all likelihood, on a specific errand at the 
shopping center, and performance of the altruistic acts required deviation 
from their previous plans. Thus performance of one or both of the altruistic 
acts in the present study was inconsistent with the Ss’ goal-directed behavior 
and therefore a possible source of frustration, whereas the altruistic act in 
the Weiss et al. study was completely compatible with their Ss goal-directed 
behavior. It seems that the performance of one of the two altruistic tasks 
could be handled by many of the shopping center customers, but the demands 
of the two successive tasks were excessive, and the second task avoided because 
of the potential frustration. ; 

One other feature of this study which may have influenced the findings is 
that the Ss’ attention was not controlled and could not be directed to the 
would-be recipients of the altruistic acts. Many of the experimental subjects 
physically avoided the C in the wheelchair by walking a wide berth around her, 
and it is likely that many also avoided her “mentally.” 

An alternative explanation for the results and one that is not incompatible 
with the above is that the first altruistic act, regardless of whether accompanied 
by verbal reward or reproof, served as punishment. That is, the physical effort 
tequired and personal demands made upon an § acted as punishment and 
served as a short term deterrent to the performance of other altruistic acts. 
This interpretation is consistent with a series of studies by Estes (7) which 
indicate that in many cases punishment temporarily suppresses a response but 
does not weaken it, 


D. Summary 


This study was conducted in a parking lot of a shopping center. The Ss, 
female customers of the shopping center, were not aware that they were PaEE 
of an experiment, Confederate CA dropped her bag of groceries as a potential 
S approached her in the shopping center. If the customer offered help, she 
became an § in the study and was either verbally rewarded or reproved. None 
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of the Ss in either of these two experimental groups subsequently helped GB, 
who had positioned herself in the path of the § and appeared to be having 
trouble getting her wheelchair up on a curb. Of the control Ss (Ss not 
exposed to the bag dropping situation) 36% helped CB in the wheelchair, 
Verbal reward for an altruistic act did not positively influence an S to behave 
altruistically in another situation immediately following, but, to the contrary, 
it decreased the likelihood of this occurring. 
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A. InTRopucTION 


Attitudes are in part culturally dependent (4), and attitudes toward the 
mentally retarded largely determine whether they obtain the acceptance, en- 
couragement, and guidance necessary for socially useful and independent lives 
(8). The present study constituted part of an effort directed by the second 
author to examine attitude-behaviors toward the retarded in several nations, 
using similar samples and design and employing an instrument based on 
Jordan’s (17-20) application of Guttman’s (1, 5, 9-12) approach to the 
components (first-order: content; higher-order: intensity, closure, inyolve- 
ment) and directions (joint and lateral) of attitudes. 

The study was conducted in the Federal Republic of Germany, included 
in the international study because of its (a) similarity to the United States 
in economic and technological development and (6) differences in language, 
history, and educational provisions for the retarded. In contrast to the United 
States, the Germans have traditionally maintained a separate school system 
for the educable retarded and other disability groups (16), while education 
for the trainable or moderately retarded has not even been considered until 
very recently (21). 


1. Facet Theory Attitude Research Design 


Although the concept attitude is usually thought of as a predisposition to 


behavior, Guttman (9) defined attitude as a “delimited totality of behavior 
—_- 
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with respect to something” (p. 51) and later proposed that such a delimited 
totality of behavior be viewed as an attitude universe which could be sub- 
structured into subuniverses which are systematically related according to the 
number of identical conceptual elements they hold in common (10). 

An attitude universe is structured and defined by what Guttman (5, 10) 
has termed “facets” and “facet elements.” Five facets proposed by Jordan 
(17, 20), each with two elements, were used to define the attitude universe 
and subuniverses in the present research. These facets and elements are as 
follows: 


Facet A—Referent (othersa; vs. selfa2). 

Facet B—Referent behavior (belief, vs. experiencey2). 

Facet C—Actor (others; vs. selfo2). 

Facet D—Actor’s intergroup behavior (comparisona: vs. interactionaa). 
Facet E—Domain of actor’s behavior (hypothetical; vs. operationales). 


Attitude statements can be derived from combinations of the above facet 
elements, and if (a) only one element from each facet is represented in any 
given attitude statement, and if (4) each element in each facet is viewed as 
either “weak” (subscript 1 elements) or “strong” (subscript 2 elements), and 
if (c) levels of attitude strength are defined according to the number of strong 
facet elements present in the semantic combinations, then the number of levels 
of attitude strength is equal to the number of facets plus one—in this case six: 

Multiplication of the elements in the five facets indicates that a total of 32 
semantic combinations or attitude profiles are theoretically possible within the 
six levels of attitude strength, although Maierle (25) has reported that only 
12 of the 32 combinations appear to be semantically and logically consistent. 

Construction of the six level Attitude-Behavior Scale: Mental Retardation 
(ABS-MR) used in this research was guided by the six combinations of factt 
elements shown below. The six profiles were chosen because they appear 
potentially capable of instrumentation, the relevance implied in the six levels 
or “types,” and because they form a simplex order. The six ABS-MR sub: 
scales range on an abstract-impersonal to concrete-behavioral continuum— 
from a stereotypic level to reports of actual behavioral interaction with the 
retarded. Correspondingly, each profile moying from level 1 to level 6 18 
characterized by one additional strong facet element in a range from no stron 
elements to all strong elements, as follows: 


Level 1—Societal Stereotype (a; b, c, dy e). Others believe others hyp” 
thetically compare to the mentally retarded. 


<tc i taa iitiata se ee 
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Level 2—Societal Norm (a; by cy dy e). Others believe others hypotheti- 
cally (should) interact with the retarded. 

Level 3—Personal Moral Evaluation (a2 bi cy de e:). I believe others 
hypothetically (should) interact with the retarded. 

Level 4—Personal Hypothetical Action (az b; ce dz e:). I believe I hypo- 
thetically (would) interact with the mentally retarded. 

Level 5—Personal Feeling (a2 by co dz e:). I experience (affect) when I 
hypothetically interact with (think about) the retarded. 

Level 6—Personal Action (ag be cz de eg). I have experience myself in 
operationally (actually) interacting with the retarded. 


Whereas facets A through E defined the preceding six levels of attitude 
strength, five additional facet-—F through J—were used to specify the content 
of each item, while two final facets—K and L—indicated the degree of 
favorableness and intensity with which a subject could respond to items 
structured by facets A through J. The relationship among all the facets used 
to structure the ABS-MR is contained in detail elsewhere (20). 

Thus, a subject’s response mode to each ABS-MR item corresponds to a 
combination of elements in facets K and L, while the items (see Table 1 for 
examples) themselves correspond to various combinations of facets F through 
J. Construction of attitude items by this systematic a priori method obviously 
heightens content validity, while Jordan (18-20), Morin (26), and Vurdelja 
(32) have reported evidence for concurrent validity in that the ABS-MR has 
differentiated groups with known diverse characteristics in six different lan- 
guages. In support of construct validity, the former authors also obtained 
a simplex correlational relationship among the scale levels predicted by Gutt- 
man’s (10) contiguity hypothesis in 19 of 25 groups tested. In addition, 
reliability figures reported by Jordan (20) and Morin (26) compare favorably 
with the bulk of those of other attitude scales described by Shaw and 
Wright (31). 


2. Hypotheses 


Harrelson (15) reviewed a large number of studies concerned with attitudes 
toward mental retardation and found that none of the studies employed an 
attitude scale based on facet theory, few were predictive or correlational in 
design, and the results were largely inconsistent and sometimes contradictory. 
Although no consensus existed, a number of variables seemed related to atti- 
tudes toward the retarded, and these generally fell into four categories earlier 
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TABLE 1 
Irem Examrves (Except FoRMAT) FROM THE ABS-MR® ILLUSTRATING 
Six LeveL STRUCTURE AND THE Directions FoR EacH LeveL 


Item examples 


Directions: Other people generally believe the following things ’ 
mentally retarded as compared to those who are not retarded: 
Items: 1, Energy and vitality (1. less, 2, same, 3. more), 
7. Can maintain a good marriage (1. less able, 2. s 
able). 
2 Directions: Other people generally believe that mentally retard 
ought: 1 
Items: 57. To date a person who is not mentally retarded () 
approved, 2. undecided, 3. approved). u 
71. To be regarded as dangerous (1. so regarded, 2, no 
3. not so regarded), 
3 Directions: In respect to people who are mentally retarded, do 
that it is usually right or ally wrong: <a 
Items: 111. To provide special laws for their protection (1. 
undecided, 3, right). ay 
117. To put all mentally retarded in separate classes, @ 
normal children (1, right, 2. not sure, 3. wrong). 
4 Directions: In respect toa mentally retarded person, would yous 
Items: 127. Live in the next-door house or apartment (1. n0, 
know, 3. yes). $ 
149. Hire the mentally retarded if you were an emplo 
2. don’t know, 3. yes). j 
5b Directions: How do you actually feel toward persons who ar 
retarded compared to others who are not mentally retarded: 
Items: 9. Dismay (1. more, 2, same, 3. less). 2 
25. At ease (1. less, 2. same, 3. more). 
6 Directions: HEN Aids or contacts with the mentally retarded: © 
Items: 47, Worked in the same place (1. no, 2. uncertain, 3. 
73. Worked to get jobs for them (1. no, 2, not certa’ 


® On the 1968 edition. 
> Item numbers start over on level 5 because of the use of IBM answer | 


suggested by Jordan (17): (a) demographic factors, (+) general valu 
tation, (¢) knowledge of retardation, and (d) contact with the reta’ 
On the basis of Harrelson’s (15) review of the literature and 
(17) international study of attitudes toward the physically disable 
hypothesized that (a) contact with the retarded, (b) age, (c¢) 
-(d) a belief in man’s ability to control his natural and social environn 
(e) knowledge about retardation would all be positively related to a 
toward the retarded. An additional hypothesis grew out of Guttma 
contention that it should be possible to predict a pattern or stru 
relative sizes of correlations between attitude scale levels purely on t 
of the semantic facets from which they were derived: i.e, his ¢ 
hypothesis which states that “attitude levels closer to each other in the 
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scale of their definitions will also be closer statistically” (p. 324). Stated 
differently, the matrix of level correlations will exhibit what Guttman terms 
a “simplex” ordering. A simplex is characterized by (a) decreasing correla- 
tional magnitudes moving from the diagonal where the levels intersect them- 
selves to the end points of both axes, and (b) closer correlations between 
adjacent levels than correlations separated by one or more levels (see hypo+ 
thetical simplex matrix in Table 4). 


B. Metuop 
The design of the study was dictated by the more inclusive requirements 
of the parent cross-cultural project which specified that equivalent samples, 
instrumentation, and analysis procedures be used in each participating nation. 
A detailed description of the methodology may be found in Harrelson (15), 


1. Subjects 

Five different groups were sampled in West Germany. One sample consisted 
of 83 manager-executives (81 male, 2 female) who represented an 80 percent 
return rate in a mail survey of white collar office workers either studying for 
or actually employed in middle management positions in industry and com- 
merce throughout the Federal Republic. The other samples were drawn from 
the industrialized Dortmund area of western Germany and were tested in 
8roups in parents’ assemblies, teachers’ meetings, and college classrooms. These 
groups consisted of (a) 144 parents of moderately retarded children (68 male, 
76 female) attending or awaiting admittance to a special school; (b) 70 
Parents of normal children (21 male, 49 female) enrolled in a regular ele- 
mentary school; (c) 148 teachers of the mildly retarded (86 male, 62 female) 
who were either certified or working toward certification in special education 
in this specialty; and (d) 74 full-time regular elementary school teachers (28 
male, 46 female) taking additional college work. The subjects were assured 
complete anonymity and were assumed to be fairly representative of their 
Tespective groups in that area of Germany. All sampling and testing was 
directed by the third author, a native German experienced in special education, 
educational psychology, and sociology in Germany. 


2. Instrumentation 
The criterion ABS-MR! whose development through facet theory was 
Previously described was translated into German for use in this study. ‘The 
a aS 


* The 32 ited with NAPS. For complete 
Page ABS-MR test booklet has been deposited wit! AF » 

booklet order NAPS Daeanene 01362 from CCMIC National Auxiliary Pubhgeton 
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instrument consists of six subscales of 40 items each, with three response 
alternatives of favorableness and three alternatives of intensity or certainty 
of response for each item. The response alternatives (content and intensity) 
were combined into one composite® criterion score for the analysis with a 
possible range of scores of 1-9 (very unfavorable to very favorable) for each 
item. A questionnaire following the ABS-MR contained measures of standard 
demographic variables, while additional independent variables in the question: 
naire included (a) a 16 item scale measuring knowledge about mental retarda- 
tion adapted from Haring, Stern, and Cruickshank (14), and (d) a nine 
item scale adapted from Wolf (36) termed Efficacy which measures degree of 
confidence in man’s ability to control his natural and social environment. 


3. Analysis 


The data were analyzed by computer at Michigan State University. In addi- 
tion to product-moment correlations and analysis of variance statistics, a pt0- 
cedure described by Kaiser (22) was used to test the degree to which a simplex 
was considered approximated. Kaiser's procedure sorts and rearranges all pos 
sible arrangements of adjacent pairs of correlation coefficients so as to generate 
the best empirically possible simplex approximation and assigns a descriptive 
statistic, Q?, to the original and rearranged matrices. ()? is a descriptive statis i 
tic with a range of .00 to 1.00, and Hamersma (13) suggested that a value of 
.70 be used to consider a matrix of attitude level correlations as having a? 
proximated a simplex. 

As pointed out by Guttman,® Kaiser’s procedure is limited to a simplex of 
the form rj = aj/a, (j < k), and alternate means or procedures for simplex 
analysis have been developed by Mukherjee (27) and Schonemann (30). The 
recent nonmetric “smallest space analysis” programs of Guttman and Lingo® 
(2, 11, 24) permit a more general analysis of a simplex of any form. This 
show whether the data lie on a one-dimensional curve, even though more than 
one Euclidean dimension is required; such a curve would conform to the facet 
theory, yet not generate a simplex matrix. At the time of this research thes 
alternate methods of analysis were not available to the authors, but research 10 
progress by Jordan will reanalyze the simplex aspects of this research to explore 
alternate methods of simplex analysis more fully. 


F as 
5 The current study combines content and intensity into one single score, where 
other studies have used content only (15, 19, 20). 


6 Personal discussion between d tts Institut 
ecimlity, Neveuber Ata nytt Guttman, Massachuee 
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C. Resutts 


Kuder-Richardson type reliability coefficients were obtained for each of the 
samples on each of the ABS-MR scale levels via an analysis of variance proce- 
dure described by Winer (35) and are shown in Table 2. The reliability 
“estimates” (actually lower bounds) ranged from .60 to -90, with the lowest 
figures appearing on the moral evaluation level 3 which measures the subject’s 
perceptions of how other people should behave toward the retarded, Both the 
magnitude and pattern of reliabilities are similar to those reported by Jordan 
(20) and Morin (26), and compare favorably to those of a variety of attitude 
scales discussed by Shaw and Wright (31). 

The ability of the ABS-MR scale levels to differentiate among various 
groups is demonstrated by the analysis of variance results shown in Table 2, 


TABLE 2 
ABS-MR ANALysIs oF VARIANCE RESULTS: RELIABILITY COEFFICIENTS, 
SAMPLE SIZES, AND SCALE LEVEL MEANS 


ABS-MR scale levels 


Group n 1 2 3 4 \) 6 
Reliability coefficients: Lower bounds 

Teachers of retarded 148 64 79 61 74 79 -68 

Teachers of normals 4 76 82 -60 81 88 72 

Parents of retarded 144 74 .79 -68 78 +84 62 

Parents of normals 70 82 80 77 -90 76 +68 

Manager-executives 83 71 79 65 84 +83 61 


Means:* Combined content/intensity 
Teachers of retarded 148 49.97 69.19 135.39 = 127.31 108.88 $3.39 
Teachers of normals 74 63.57 72.95 133.75 108.34 91.47 77.18 
Parents of retarded 144 77.64 77.60 127.39 124.34 107.57 93.10 


Parents of normals 70 89.67 89.99 121.55 95.66 102.77 67.39 
Manager-executives 8370.36 78.03 126.34 103.83 89.80 63.84 
F 73.37*  14.10* 9.20® 30.16" 23.00% 58.66* 


f Adjusted for sex and sample size. 
?<.001. 
which reveal a significant F on every level. Table 2 further illustrates the 
multidimensionality of the ABS-MR and the utility of the facet theory ap- 
Proach to attitude scale construction. If the mean scores of the special educators 
and parents of normals are added into total mean scores for these two groups, 
it will be seen that the resultant totals are quite close: ie., 574.13 and 567.03, 
Tespectively. Yet these very similar “total” scores are made up of strikingly 
dissimilar individual scale means—i.e., the parents of normals scored higher 
than all other groups on levels 1 and 2 where attitude-behaviors are attributed 
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to “others,” while the teachers of retarded scored lowest of all groups on thest 
two levels. On levels 3 and 4, where attitudes are attributed to oneself, just 
the reverse occurred, with special educators scoring highest and parents of 
normals lowest of all groups. Further examination of Table 2 will also reveal 
similar patterns for the special and regular teachers on the one hand, and the 
parents of normals and manager-executives on the other with respect to differ- 
ences between scores on levels 1 and 2 versus scores on levels 3-6, while the 
parents of retarded were uniformly high on all scale levels. 

Correlations between the predictor variables and the various ABS-MR 
levels for the total sample are presented in Table 3. Generally, the correlations 


TABLE 3 
ABS-MR CorRELATIONS WITH INDEPENDENT VARIABLES FoR TOTAL SAMPLE" 
Independent ABS-MR scale level correlations 
variables 1 2 3 4 5 6 


Contact with 


retarded 4398, — 21908 <19eee 
Age —.03 —.07 —.07 
Education —.538** — 268908 2aeee 
Efficacy aye? 09" .06 
Knowledge of 

retardation —.15"* —.06 —.05 

aN = 519. 

PLD <EEOS,. 
> < 01. 
ae > < 001. 


are of relatively low magnitude although a number of them reach statistical 
significance. The results suggest that knowledge of retardation is weakly and 
inversely related to attitudes on three of the six scale levels, an interpretation 
tempered by the fact that the knowledge scale was not cross validated in Get- 
many, while a belief in man’s ability to control his environment (Efficacy ae 
weakly and positively related to attitudes only on the abstract-imperso” 
ABS-MR levels 1 and 2. 

Amount of education and amount of contact with the retarded were sim 
ilarly related to ABS-MR attitudes in the German sample. It is noted that 
the “referent” in the facet design on which the ABS-MR is based—the refer 
ent being the person or persons to whom the attitude or behavior is attribute 
by the subject—shifts from “others” on levels 1 and 2 to “self” on levels 3-6. 
Both variables were negatively related to attitudes on levels 1 and 2 whet 
attitudes are attributed to others, and positively to levels 3 and 4 where atl: 
tude-behaviors are attributed to oneself. In addition, the contact variable Ww 


- 
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also positively related to the feeling and action levels 5 and 6. Age was also 
positively related to the more concrete-behavioral levels 4-6 but, in contrast to 
contact and education, was unrelated to the more abstract-impersonal leyels 
1-3. 

An hypothetical matrix illustrating the characteristics of a simplex is pre- 
sented in Table 4 along with the matrix actually obtained from the total 


TABLE 4 


Q?s For HYPOTHETICAL MATRIX ILLUSTRATING SIMPLEX CHARACTERISTICS AND 
OBTAINED AND REORDERED MATRICES FOR ToTAL SAMPLE® 


Six level ABS-MR correlation matrices 


Matrix 1 2 3 4 5 6 g? 
Hypothetical 99 

1 oe 

2 90 =— 

3 80 90 = 

4 70 80 90 - 

5 60 .70 80 90 = 

6 50 60 70 80 90 - 
Obtained 87 

1 oe 

2 50 = 

3 AL 06 _— 

4 —.06 02 54 — 

5 Bel 12 18 23 - 

6 —.05 14 9 43 25 oad 
Reordered 94 

i = 

a 50 a 

3 12 Bel oa 

4 —14 —.05 25 = 

5 —.06 —11 18 19 - 

& 02 06 25 43 54 - 

2 N = 519, 


German sample and that matrix as reordered by Kaiser's (22) procedure 
into the “‘best” empirically possible simplex approximation given that set of 
data, The Q? statistic is also presented for each matrix, anh ae, 
While Guttman’s (10) contiguity hypothesis places no restriction on sign 
of correlation, Kaiser’s (22) statistical simplex approximation procedure tests 
only a simplex model that requires positive signs, and Table 4 shows that a 
total of four negative correlations occurred in the matrix obtained from the 
total German sample. A similar result occurred in several matrices obtained 
from the individual samples (15), and all of the negative correlations were 
between either level 1 or 2 and one of the remaining scales, and reflected 
Negative relationships between how the subjects perceived others relating to 
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the retarded and how they perceived themselves relating to this 
negative signs were ignored in computing the obtained and reo} 
approximation Q? values shown in Table 4, but were felt to constitut 
cient threat to validity that a simplex of the special type tested 
could not be considered unequivocally approximated in the obtaine 
spite of a Q? in excess of Hamersma’s (13) criterion value of .70. 


D. Discussion 

The extent to which the findings reported here are unique to 
groups studied or generalizable to the German nation as a whole 
unknown, while the extent to which the findings are generalizable 
groups in other countries and/or cultures participating in the i 
study will be determined when all of the data have been collected and 
Perhaps the most surprising finding to emerge, in view of the historica 
type of the authoritarian German personality, was the apparent tend 
the subjects to see little relationship between the attitude-behaviors of 
and their own attitude-behaviors toward the retarded. The failure 
contiguity hypothesis, as well as at least two of the correlational h 
(contact and education) to receive stronger support, may be attrib 
this phenomenon. i 

In retrospect, it has become clear both that the correlational hypoth 
too general and that the ABS-MR is indeed multidimensional and 
suring different dimensions of attitude-behaviors on the various scale 
The instrument seemed to discriminate adequately and differentially | 
the various groups, and a real weakness in the present research lies 
lack of coherent and systematic theoretical underpinnings at the — 
causation level, In short, the attitude predictors, more so than th 
were largely inadquate in this research. 

Facet theory and design may hold promise for revealing structure oF ' 
tic order in psychological concepts in other areas. Hopefully the find 
sented here have demonstrated the feasibility and utility of the 
approach to the study of attitudes and to the view of “measurement as 
theory” (12). This approach appears to provide the most efficien' 
examining the “components” and “directions” of attitudes and ha 
found further application in the measurement of attitude-behavior 
the mentally ill (25, 33), attitudes toward the deaf (29), raci 
(3, 6, 7, 13, 34), and drug addiction/dependency (23, 28). 


E. Summary 
A six level instrument, constructed according to principles of | 
and measuring attitudes on an abstract-impersonal to concrete-behav 
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tinuum, was used to assess the attitude-behaviors of regular teachers, teachers 
of the retarded, manager-executives, and parents of normal and retarded chil- 
dren in West Germany. The relationship between attitude-behaviors and de- 
mographic variables, contact with the retarded, knowledge of retardation, and 
a belief in man’s ability to control his natural and social environment was also 
measured. The instrument differentially discriminated among the groups as 
well as the predictor variables on the various scale levels. Unexpectedly, in 
view of the stereotyped authoritarian German personality, the subjects per- 
ceived little relationship between the attitudes of others and their own attitude- 
behaviors. 


10, 
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FAMILY, PEER, AND ACADEMIC ORIENTATION OF 
IRANIAN ADOLESCENTS* 


Pahlavi University, Shiraz, Iran 
A. A. RazavigH anp A, A, Hossgini 


A. IntRopucTION 

Adolescent studies in America provide evidence as to the strong orientation 
of youth toward peers (2, 5, 7, 9). There are indications in some of these 
studies that peer groups constitute a more important reference group for 
adolescents than family, teachers, and educational institutions (2). These 
findings have resulted in theoretical speculations as to the existence of a 
“society,” “culture,” or “subculture” with norms and values different from 
those held in adult society (2, 4). Some of these writers go as far as to suggest 
that it is the characteristic of this developmental period to be against the larger 
adult society, its values, and features (4). 

A number of other psychologists are skeptical about the existence of a sub- 
culture or society of adolescents and maintain that although strongly oriented 
toward peers, adolescents are basically inspired by and observe the values of 
their parents and the adult society (3, 6, 8, 10). These psychologists refuse to 
accept that a break with values of adults is the characteristic feature of adoles- 
cence. The observed deviation of adolescents from the society of adults, accord- 
ing to these writers, stems from the liberal tendencies of young adults. 

It is conceivable, however, that the strong orientation of adolescents toward 
Peers and away from parents and teachers, as observed in the United States, 
is a local, rather than a universal, phenomenon, Studying adolescents in other 
cultures could provide new evidence that might be helpful in the settlement of 
this social, psychological, and educational issue. The present study was designed 
with this aim in mind. It seemed to the authors that data on Iranian adoles- 
cents and their comparison with those obtained in the United States, in con- 
sideration of the sharp contrasts that separate the two cultures, would help 
better understanding of the adolescent period. 

B. ProcepDuRES 

The sample consisted of 975 adolescents selected randomly from) among 
*PProximately 17,000 applicants who were taking the Pahlavi University en- 
a 
at preeeived in the Editorial Office on December 30, 1971, and published immediately 

incetown, Massachusetts. Copyright by The Journal Press. 
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trance examination for admission to the undergraduate program. A question 
naire composed of 32 questions was given to these subjects immediately after 
they took the entrance examination in June 1969.1 The subjects had just fir 
ished high school and had an average age of almost 18. The ratio of boys to 
girls was approximately four to one. They came from all parts of the county, 
mostly from cities and middle-class families. They were applicants for the | 
different major areas of medicine, engineering, agriculture, and arts and st | 
ences, They had specialized in high school in one of three branches—mathe | 
matics, natural sciences, and literature—with a greater frequency in naturil 
sciences and a lesser frequency in literature, as is the general trend in Iranian 
high schools, . 

The questionnaire consisted of two parts: the first part dealt with personal 
information regarding the subjects, such as their sex, age, fathers’ age | 
occupation, mothers’ age and occupation, birth order, and so on. The second 
part consisted of 20 questions aimed at finding the aspirations of the subjects, 
their academic orientations, their family orientations, and their peer orienta 
tions. Some of the latter questions were adaptations from Coleman (2) to 
make possible comparison of the findings with those of his study on America 
adolescents. 


C. Resurts 


One question was aimed directly at finding out whether the subjects wert 
more oriented toward parents, teachers, or peers. It was stated as follows: ’ 
Which one of the following would be hardest for you to take? 1) Yom 
parents’ disapproval; 2) your teachers’ disapproval; 3) breaking with chet 
friends, 4 
The responses given by the subjects to this question are summarized i 
Table 1. These results show a rather strong orientation of the subjects towalt 
parents. | 
Only 19 percent of the girls and 24 percent of the boys chose peel an 
teachers, and a strong majority stated that parents’ disapproval was the hard i 
est for them to take. The same question for adolescents in the United State 
resulted in an even split of the Tesponses between parents and peers (2),8 
Coleman’s study 53.8 percent of the boys and 52.9 percent of the girls tf 
parents’ disapproval as the hardest to take, and the percentage of subjects W 
stated peers’ disapproval was hardest reached 42.7 for boys and 43:4 for 8 


* The questionnaire was given to all applicants at the time of taking the ent 
examination, and later a sample of 975 was selected randomly for analysis. 
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TABLE 1 
PERCENTAGES OF ADOLESCENT Boys AND Girts WHO RESPONDED TO THE QUESTION 
ASKING IF PARENTS’, TEACHERS’, OR Peers’ DisAPPROVAL WAS THE 
Harvest FoR THEM To TAKE 


Category of response % of boys % of girls 
Parents’ disapproval 70 81 
Teachers’ disapproval 8 7 
Breaking with friends 16 12 
No response 6 0 


It is apparent, from comparing these data, that Iranian adolescents are more 
oriented toward parents than their American counterparts. 

The above percentages should by no means be interpreted as the subjects’ 
lack of orientation toward teachers and peers. In fact, there are areas in which 
the subjects’ respect and regard for teachers’ views exceed their respect and 
regard for those of parents. An examination of the subjects’ responses to three 
similar questions asking for their reactions to parents’, teachers’, and peers’ 
opposition to doing something that the subjects wish to do makes this point 
clear. The questions were as follows: 

Suppose you have always wanted to join a particular club in your school, 
and now you are notified that your membership is approved. Further, suppose 
that you have just found out that your (parents) (teachers) (close friends) 
are against your membership. What do you think you would do? Would you 
think you would 1) definitely join; 2) probably join; 3) probably not join; 
4) definitely not join. 

Table 2 shows the responses of adolescent boys and girls to these three ques- 
tions. The results indicate that teachers’ opposition to the subjects’ membership 
in a school club is more conducive to the stepping back of the subjects than 
cither parents’ or peers’ opposition. The examination of the same table shows 


TABLE 2 
Responses oF Boys AND GIRLS TO THE QUESTIONS OF ACCEPTANCE OR REJECTION 
oF PARENTS’, TEACHERS’, AND PEERS’ OPPOSITION TO THEIR 
MEMBERSHIP IN A SCHOOL CLUB 


Percentages responding to opposition of 


Parents Teachers Peers ; 
Response category Boys Girls Boys Girls Boys Girls 
Definitely joi 15 8 9 7 2 16 
Probably join. 33 28 16 28 a1 36 
Probably not join 21 21 29 36 2 
Definitely not join 2942 4 O85 22 $ 
No response 2 1 2 0 3 


R 


340 JOURNAL OF PSYCHOLOGY 


that girls tend to change their actions more than boys when parents oppose 
their membership, but the boys’ tendency is to step back when teachers are 
against their membership in their favorite club. 

A comparison of these results with those obtained by Coleman (2), using 
similar questions, shows a rather striking difference between the orientations 
of Iranian adolescents and those of American youth. Coleman’s data indicate 
a rather weak orientation of the subjects toward teachers and about an equal 
split between friends and parents. Commenting on his results, Coleman writes 
as follows (2, p. 141): 

These data indicate clearly that the teachers are not an intermediate 
step between a child’s orientation to his parents. and his orientation to 
peers. Neither the elites nor their followers indicate that they.would pay 
much attention to a teacher’s wishes outside the elassroom—even if it is 
their favorite teacher. 


The subjects’ strong orientation toward teachers is also revealed in their 
responses to another question. This question was stated as follows: 

If you could be remembered at your high school next year, which one of the 
following groups would you want to be remembered by? 1) teachers; 2) stu- 
dents. 

Sixty-seven percent of the boys and 77 percent of the girls indicated that 
they wanted to be remembered by their teachers. This strong tendency of 
Iranian youth to respect their teachers seems to be related to the great value 
they attach to school and education. This can be seen from the subjects’ re- 
sponses to a question asking about their educational ambitions. The question 
was as follows: 

What are your plans for the future? Do you intend to get 1) a Bachelor’s 
degree; 2) a Master's degree; 3) a Doctoral degree. 

Seventy percent of the boys and 66 percent of the girls indicated they in- 
tended to enter a doctoral program. Twenty percent of the boys and 23 @ercent 
of the girls said they would want to get a Master’s degree. The number of 
subjects who indicated that they wanted to get only a Bachelor’s degree was 
less than 10 percent. 

The strong orientation of Iranian adolescents toward education and toward 
their teachers seems to stem from home and receive reinforcement from parents. 
A fact that supports this assumption is the subjects’ responses to the following 
question : 

What do your parents think 4 
should have mre education Babe Se or ae as they believe 1) You 

: going to college would: not 
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affect your future happiness. 3) You should give up the idea of going to college 

and start to work. 4) They have not stated any opinion. 

; .  Eighty-one percent of the boys and $4 percent of the girls indicated that 

> their parents believe that they should go to college. Only one percent of the 

i subjects indicated that their parents think they should give up the idea of 
going to college. - 

; F The orientation of Iranian adolescents toward peer groups is by no means 

Fi as strong as it is in the United States. No indication was found in the data as 
to the existence of what is referred to as adolescent society, culture, or sub- 
culture. The subjects’ responses to a number of questions showed some tendency 
to associate with*friends and talk to them when faced with problems. But it 
seems that these tendencies hold with the assumption that they do not violate 
their strong identification with their parents. The modest percentages obtained 
by asking questions in the areas where one expects strong, if not full, orienta- 
tion toward peers may be taken as support for this viewpoint. 

‘In one question the subjects were to respond to the following statement: 
i Suppose on a holiday you are invited to two parties: one a family party with 
oy your parents, the other a friend’s party with a number of close friends. Which 

: one would you choose to attend? 1) the family party; 2) the friend’s party. 

Sixty percent of the boys and 69 percent of the girls chose the friend’s party. 
Some thirty-seven percent of the boys and 31 percent of the girls indicated that 
they would go to the family party. 

A oh The same idea was put in another question in a more direct way. The sub- 

jects were to respond to a question in which they were asked which one—family 
“nfembers, friends, or teachers—they would more enjoy to be with. Although 
the company of peers and friends seems to be preferable at all ages, the number i 
of subjects who favored peers over family members and teachers dropped con- 
Ss siderably. Forty-six percent of the boys and 59 percent of the girls indicated 
that the company of friends was more enjoyable. 

Still in another question the subjects were asked about the persons with whom 
they. would discuss their problems and talk about their feelings. The question 
wag as follows: 

53 Which of the following do you talk to about your problems and feelings? 

ef father} 2) mother; 3) sister; 4) brother; 5) friends; 6) teachers; 7) none. 
*. "so = Only thirty-nine percent of the boys and 31 percent of the girls indicated 
e.. that they talked about their problems and feelings to friends. The rest chose 
: either mother, father, sister, brother, or teacher as the person of their choice for 
discussirlg their personal feelings and problems. 


v 
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D. Discussion 


It is apparent from the data that the subjects are strongly oriented toward 
parents and also toward teachers and education. Evidence is not as strong, 
however, when the question is about the subjects’ orientation toward peer 
groups. It seems, from the data, that Iranian youth tend to like peers to the 
extent of socializing and associating with them, but not depending on them or 
using them as sources of status and support. Only a small minority of the 
subjects chose peers over parents when they were to make a choice. The data 
provide little indication, if any, that Iranian adolescents constitute a “society” 
or “subculture” in which values and norms deviate from the values and norms 
of adult society. 

Strong ties with and orientation toward parents in Iranian society are not 
beyond expectation. Dependence on parents and high respect for the family 
have long been distinct Persian traditions. The children of an Iranian family 
are under no pressure to leave home and seek financial independence. They 
enjoy family care and support even after they are married, These conditions 
seem to alleviate the anxiety and the stress which are believed to be the char- 
acteristic features of the adolescent period. 

This is in contrast with the conditions present in the United States. The 
American adolescent is pushed toward autonomy, competence, and self- 
sufficiency. American culture moves its adolescents away from seeking comfort 
and reliance on parents. It also pushes its adults to withdraw emotional 
support, care, and affection from their adolescent boys and girls (6). These 
conditions, plus the heavy developmental tasks set forth for American adoles- 
cents and the high expectations of society from its young adults, contribute to 
the F abibabie of the transitional anxiety and tension of the adolescent 
period. 


Moving toward other sources of support, such as Peers, and turning away 
from parents and family seem to be a consequence of such social and cultural 
arrangements. ‘The strong family ties and the Iranian adolescents’ orientation 
toward family and parents may change as a result of future industrialization 
At the present time, however, the Iranian family seems to provide its wlales. 
cents, generally speaking, with a frame of reference, 
observe the values of the family. 


The strong orientation of the subjects toward education seems to stem f 
the value education has in present Iran. Education in Iran, a rapid] - 
veloping country, provides a major path to individual suctess and u es 
mobility. It is the aspiration of most Iranian families, at least in uta al 

” 


and adolescents seem to 


» 
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to send their children to college and to provide them with as much education 
as they can get. Academic success is almost the only criterion for evaluating 
youth, Secondary school programs are basically college preparatory. Extra- 
curricular activities, which are an important part of school programs in the 
United States, have little place in Iranian schools. All that are done in 
secondary schools are academically related activities, and these constitute the 
criteria for evaluating schools. Teachers in the Iranian educational system are 
the central figures and provide their pupils with models who are well-liked, 
well-treated, and well-received. This is probably the main reason for the 
stronger orientation of the subjects toward teachers than toward parents in 
the areas of academic activity. 

It is possible, therefore, to understand why Iranian adolescents are not 
as strongly oriented toward peers as their American counterparts. The low 
pressure applied to these youths to leave home and to strive for financial 
independence and the strong orientation toward education are the two major 
contributors to their modest orientation toward peers. 

Ausubel (1) believes the heavy and difficult developmental task set forth 
for American adolescents by American society and the high pressure applied 
to them to leave home and to seek autonomy results in their seeking a temporary 
source of status other than derived status (the status bestowed on children) 
and primary status (the status of adults). Ausubel calls this temporary status, 
“interim status,” the source of which is peer groups. American adolescents, 
between their puberty, when they lose derived status, and almost the age of 
21, when the majority of them obtain primary status and declare financial 
independence from their families, turn to their age-mates for social and psy- 
chological support. During this period the peer group becomes the most im- 
portant frame of reference, and friends of the same age turn out to be the 
most influential persons in their lives. 

These conditions are obviously absent in present Iranian society. Further 
industrialization of Iran might result in stronger ties with peers and weaker 
orientation toward parents. But the data presented in this paper suggest that - 
adolescent anxiety is in part affected by cultural and local conditions, and 
adolescence does by no means necessitate the loosening of family ties and the 
founding of a subculture with different values and norms than in adult society. 


E. Summary 


A sample of 975 Iranian adolescents were given a questionnaire designed 
to find out about their orientation toward family, peers, and teachers. Analysis 
of the data revealed that Iranian adolescents are strongly oriented toward 
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parents, teachers, and education. The subjects’ orientation toward peers was 
found to be rather modest, and little evidence was found regarding the 
existence of a “subculture” or “society” in which values and norms deviate 
from those held in adult society. An interpretation of the findings suggested 
that adolescent anxiety is partly affected by local and cultural conditions, and 
the difficulty of the adolescent period does not necessitate a subculture in which 
the values and norms are apart from those held in the society of adults. 
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IMITATION OF FACIAL EXPRESSION OF EMOTION* 1 2 


Washington State University 


MarsHatt L, HaMILton 


A. IntTRopucTION 


Bandura (1, 2) discusses the evidence that there can be vicarious influences 
on emotional responses. Most of the authors he refers to have studied this 
process in terms of vicarious conditioning of physiological and psychological 
responses to models in films. Some of the work of Schachter and his associates 
differs in that, while the research was conducted for other reasons, it did involve 
a more direct focus on the subject adopting the behavioral indications of emo- 
tion displayed by a model. Schachter and Singer (5) varied level of physio- 
logical arousal and available emotional cues to determine the effects of each 
factor in emotion. They found that subjects in a state of physiological arousal 
(induced by epinephrine injections), but without an adequate explanation for 
their symptoms, imitated the behavior of models displaying either anger or 
euphoria, The behaviors involved included angry comments and stomping out 
of the room when the model was depicting anger, or flying paper planes and 
whirling a hula-hoop when the model was depicting euphoria. Imitation occur- 
red to a lesser degree with subjects who had been led to expect symptoms of 
physiological arousal or with those who received injections of a saline solution. 

The present study was intended to provide a demonstration of imitation of 
facial expressions, the most commonly used overt indication of specific emotions. 
Tomkins and McCarter (6) have developed a set of descriptions of facial ex- 
pressions for eight primary affects plus neutrality. They found that judges’ 
labelings of photographs had an average point biserial correlation of .858 with 
the affect the models had been instructed to pose. This approach apparently 
represents a significant improvement over many of the earlier efforts to find 
adequate reliability in judgments of facial expressions. 


* Received in the Editorial Office on December 31, 1971, and published immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. 

1 This investigation was supported in part by U. S. Public Health Service Research 
Grant No. MH15591-01, with the author as principal investigator, and in part by 
funds provided by the Graduate School Research Funds of Washington State Univer- 
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The present study also involved study of the effects of directing the attention 


of the subject to the facial expressions presented, and the effects of previous, 
imitative interaction with the model. These manipulations were chosen on a 


strictly exploratory basis, 


B. MetHop 


1. Subjects 
The subjects were 54 Washington State University Nursery School children 
averaging 57 months of age (range = 50-67). They can best be characterized 
as white and upper-midd]e class, The children were randomly assigned to one 
of the three conditions described below, except that the groups were equated 
on the age and sex distribution me the Fabjectll 


2. Procedure 


re individually to a small room adjacent 
to the nursery school by a Set test Slides of one model posing eight 
emotional expressions (interest-excitement, enjoyment-joy, surprise-startle, dis- 
tress-anguish, fear-terror, shame-humiliation, contempt-disgust, anger-rage) 
and neutral beh shown to each subject. The poses were intended to match 
as closely as possible the details of facial expression described by Tomkins and 
McCarter (6). The sy ject got an approximately life-size view of the head 
1 Year-old female in the slides. The slides were shown 
. ¥Y to the side of the subject on to a panel of white poster 
material about four feee in rahe of te ganic In an adjacent room an ob- 

server watched the child from behind a one-way mirror located above the panel 

on which oa slides Were shown, Each of the three people who served as ob- 

ert i rH fe aha mes had practiced identifying the nine expressions earlier 

sel ect i treet in a series of six presentations, The slides were ran- 

4b ty to aerebannre Whaxentation to each subject. The task of the observer was 

viewed each slide for 1 ©xpression the child’s face was showing as the child 
best guess possible even Seconds, The observer was encouraged to make the 
several expressions Pies. €n the child showed no change in expression or made 
means of matching her ae the slide presentation, The observer had no other 
other than by chance. Dias with the expression actually shown in each slide 
instructed to judge which ing the Presentation of each slide, the observer was 
agente ethiect. The ©ne of the nine possible facial expressions best charac- 
during each slide, and tha ee was, told to make independent judgments 
t it was permissible to use the same facial expression 

» If that category best characterized the subject on 


Condition A. The chilg 


and shoulders of a 3] 
from behind and slight 


category more than once 
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more than one slide presentation. The instructions to the subject were as fol- 
lows: “In this game I want you to look at some pictures. Be very quiet and 
don’t say a word until I show you all the pictures. When you see the pictures 
up here (pointing to the panel), the lady will make a face.” The experimenter 
then dimmed the ligths and started the automatic presentation of the slides, 
while the observer rated the child’s responses from behind the mirror. The 
session was ended by allowing the subject to look at cartoon characters in a 
View-Master. 

Condition B. This condition differed from A in that the child was instructed 
that after viewing each slide he would be asked to pick out a face “just like 
the one the lady made” from a set of drawings that were shown to him. The 
drawings were nine stereotyped faces depicting the expressions used in the 
slides and presented on 3 X 5 cards displayed inside a manila folder, The ob- 
server made her choices during the presentation of the slide, but as an added 
precaution the child’s choices from the drawings were concealed from the ob- 
server. 

Condition C. This condition differed from A in that it was preceded by a 
“follow-the-leader” game in which the subject was asked to imitate a series of 
behaviors by the model whom he would later see in the slides. In this interac- 
tion of about three minutes duration, the model exhibited nine behaviors such 
as touching her nose with her finger, clapping hands, and standing up and sit- 
ting down again. The model refrained from smiles or other facial expressions 
during this time. The child was praised by the experimenter with comments of 
“that’s right” or “that’s good” as he imitated each behavior. The model then 
left the room, and the slides were presented as in condition A. 


C. Resuxts 


Application of a single classification analysis of variance did not indicate 
a significant difference between the three experimental conditions (F = 1.28, 
p > .05). The mean number of emotional expressions that the observer could 
match was higher in conditions B (2.68) and C (2.82) than it was in A 
(1.92), but variability was high within conditions. 

‘The matching of expressions determined from the child’s face with those of 
the model in the slide occurred at levels far above chance (one in nine) for sev- 
eral children in each of the conditions. There were 24 children for whom there 
were matches on three or more expressions, and the probability of this occurring 
for any single child is at about the .069 level. In both conditions B and C 
there were matches on as high as seven out of the nine slides. Application of a 
coefficient of contingency (4) to the match between the nine projected and 


348 JOURNAL OF PSYCHOLOGY 


nine judged expressions for all subjects yielded a Z score of 8.60 (p < .001). 
This reflected the fact that for each expression the plurality of judged ex- 
pressions corresponded with the expression actually projected on that trial. 

Data obtained from two observers rating 15 of the children, five from each 
experimental condition, indicated an average 27.4% rate of interobserver agree- 
ment in judging the children’s facial expressions. Additional data on interob- 
server agreement were obtained the following year with use of a group of 
children in the same nursery school who were essentially identical to the previ- 
ous sample in age and social class. These observers were also trained briefly 
by the procedure described earlier. Two observers rated 20 of the children (11 
females, nine males) as they participated in an experimental procedure like that 
of condition A. The observers were asked to indicate which of their nine judg- 
ments on each child were essentially guesses, and which were based on some 
confidence that a fairly clear emotional expression had been evident. The ob- 
servers were in agreement in 18.6% of the judgments which they regarded as 
guesses, and only 11.5% of the judgments coincided correctly with the facial 
expression in the slide. In the judgments made with some confidence there was 
a 57.9% rate of agreement and a 36.8% rate of correspondence with the pro- 
jected expression. Prior to the experimental procedure the two observers ob- 
tained simultaneous but independent three-minute observations of the facial 
expressions of each of 29 children as they engaged in their normal nursery school 
activities. The 29 children included the 20 who were later observed in the 
experiment. The observers averaged an 84.5% rate of agreement on the num- 
ber of emotional expressions shown by each child, and they averaged 76.1% 
agreement on the specific expressions observed, 

An indication of the children’s ability to recognize the Projected expressions 
and match them with stereotyped drawings of the same expressions was ob- 
tained in condition B. The children accurately matched slides with drawings 
at a rate of only 25.9%. There was no indication that accurate pairing of 
expressions by the child was related to the observer's success in identifyin; hk 
child’s facial expression, since the observers identified slightly more sitar: 
(30.8%) when the subject mismatched expressions than when h 
matched expressions (26.2%). eg 


D. Discussion 


On the assumption that response to a live 
a model presented in slides, it is apparent tha 
the facial expressions of a model. This occurre 
ing condition in which a child simply sat and 


model parallels the Tesponse to 
t children Spontaneously imitate 
d even in the telatively uninyoly- 
watched nine expressions presented 
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at 15-second intervals. There were obvious individual differences in the imita- 
tive tendencies of the children. In a few cases the failure of the observers to 
identify correctly nine out of the nine expressions presented may have been due 
only to the difficulty in judging expressions accurately, as the imitative efforts 
of the children were quite evident. At the other extreme a number of children 
remained quite expressionless throughout the presentation. 

The trend of the differences between conditions suggests that the highest 
rates of imitation may occur in situations requiring more attention to the 
model’s expressions, or when there is a history of interaction with the model. 
The lack of a significant difference between experimental conditions may reflect 
both the weakness of the exploratory manipulations and the fact that there are 
several points where errors in the transmission of expressions can occur in the 
present procedure, 

The age of the subjects in the present study may not have been optimal for 
the accurate recognition of emotions. Their inaccuracy in matching slides and 
drawings of expressions seems consistent with Davitz’ (3, p. 181) finding that 
the performance of five-year-olds in recognizing vocal expression of emotions 
was about half as accurate as that of 12-year-olds. The ability of the child to 
imitate effectively an expression that he has recognized is, of course, an addi- 
tional, unevaluated variable. 

The figures obtained for interobserver agreement indicate the difficulty of 
the task performed by the observers. The agreement obtained as the observers 
watched the spontaneous expressions of the children in their nursery school 
activities was at a satisfactory level. However, it is quite possible that the task 
of the observers was easier in that setting where they could observe both the 
event that preceded the child’s emotional expression and the expression itself. 
Knowledge of the situation has been found to facilitate the identification of 
emotional expressions in past research. Interobserver agreement was lower in 
the experimental situation, particularly in the instances in which the observers 
felt that they were guessing. Those instances usually involved either a child 
who made no particular response during the presentation of the slide, or a 
child who made several facial expressions during the presentation, none of 
which seemed dominant. Both the agreement between observers and the rate of 
correspondence between judged expressions and expressions projected improved 
on the trials in which the observers expressed some confidence that they had 
observed a dominant facial expression. 

It should be emphasized that the evidence obtained regarding the children’s 
imitation of the facial expressions of a model could not reasonably be disre- 
garded because of the opportunity for error that was evident in the children’s 
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identification of modeled expressions and in the observers’ identification of the 
children’s expressions, Such errors should have tended to reduce the findings to 
chance levels, rather than cause the above chance levels of matching that were 
found. 

The present study did not, of course, demonstrate the vicarious occurrence 
of a “whole” emotional response. However, facial expressions are regarded by 
most investigators of emotion as central in denoting specific emotions. The 
present study was one step in demonstrating vicarious influences on specific emo- 
tional responses. 


E. Summary 


In an effort to demonstrate imitative reactions to facial expressions of emo- 
tion, preschool children were exposed to a series of slides in which a model 
posed different emotions. Conditions of directing attention to the model’s ex- 
pressions or prior interaction with the model were varied. An observer sought 
to identify what emotion the child was seeing from his facial reactions. While 
the conditions of presentation did not yield significant differences, the children 
imitated facial expressions at above chance levels. 
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AN ADDITIVE METHOD FOR COMBINING PROBABILITY 
VALUES FROM INDEPENDENT EXPERIMENTS* 


The University of Calgary 


Eucene S, Encincton 


A. Intropuction 


Jones and Fiske (2) described a method of combining probability values 
from two or more independent experiments. The method is that which is now 
familiar to psychologists and involves determination of the significance of a set 
of probability values by reference to chi-square tables. The significance shows 
the probability under the null hypothesis of getting such a small product of 
probability values as an obtained product. In this paper that method will be 
called the multiplicative method, and the term additive method will refer to a 
proposed alternative which combines the probability values by addition. 


B. Tue Muttipricative MetHop 


The multiplicative method is discussed in several psychological statistics 
- books (e.g., 1, p. 248-250; 4, p. 231-233 ; 6, p. 43-45). To combine probabilities 
1, ba, « «+» Pn from n independent experiments, the first step is to determine 
from tables the natural logarithms of the reciprocals of the values. The second 
step is to sum the logarithms. The sum is multiplied by 2, and this product is 
treated as a chi-square value with 2n degrees of freedom, where n is the number 
of ps that are pooled, to determine the significance of the overall combination 
of ps. Although the arithmetic operation performed by the user of this method 
is addition, it is logarithms that are added, and therefore in fact it is the 
significance of the product, not the sum, that is determined. 

For example, suppose three experiments provided these probability values: 
+10, .10, and .20. The reciprocals are 10, 10, and 5, and the natural logarithms 
of the reciprocals are 2.3026, 2.3026, and 1.6094, which total to 6.2146. Two 
times 6.2146 is 12.4292. Chi-square tables show that the value 12.4292 with 
2n = 6 degrees of freedom is significant at the .05 level. That is, the product 
(.10) (.10) (.20) = .002 is significant at the .05 level. Any other three 
probability values whose product is .002 would yield the same sum of loga- 
rithms and therefore would be significant at the same level. 


* Received in the Editorial Office on January 4, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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C. Tue Appirive MetHop 


The additive method, which will now be described, is an alternative to the 
multiplicative method for combining probabilities. The two methods serve the 
same function, but the additive method will be shown to be more powerful. 

To illustrate the rationale for the additive method, suppose one experiment 
gave a p of .30, and a second gave a p of .20. The sum of the ps is 50. When 
the null hypothesis is true, what is the probability of getting such a small sum 
of ps from two independent experiments? All pairs of ps from zero to | are 
equally probable, and so the desired probability is the fraction of the pairs 
which give a sum of .50 or less. Thus the area enclosed by the triangle in 
Figure 1 represents the probability of such a small sum as .50, since the hypote- 
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nuse consists of all points for which the sum of the X and Y coordinates is .50. 
The area of this triangle is (.50)2/2 = .125; that is, the area of the triangle 
is 1/8th of the unit area representing all possible combinations of ps from 
zero to 1. Thus, the overall probability of such a small sum as .50 is .125. 
And it can be seen that for any sum no greater than 1.00, the overall prob- 
ability will be S?/2, where S is the sum of the two ps. 

The overall probability of three ps adding to .50 or less can be determined 
in a similar fashion. Imagine a third axis of .50 units in length to extend 
from the origin in Figure 1 at right angles to the vertical and horizontal 
axes, Then connect the value .50 on the axis with the .50 values shown for 
the X and Y axis in Figure 1, by means of straight lines. These two lines, 
along with the hypotenuse of the triangle in Figure 1, form a plane consisting 
of all points where the sum of the XY, Y, and Z coordinates is .50. The 
volume within the tetrahedron under the plane represents the probability of 
a sum of three ps being .50 or less. Such a tetrahedron has a volume of $°/6, 
or, in this case, about .021. The general formula for the overall probability of 
a sum of § or less, for any S no greater than 1, no matter how many ps are 
added is 

Sn 
nl 
where n is the number of ps being added. 

Formula 1 will give an overall probability value that is too large when Ry 
exceeds 1, because it will include combinations involving individual prob- 
ability values greater than 1. To take account of the unrealizability of a p 
greater than 1, correction terms must be added to the formula. The general 
formula with corrections appended is 


5 () ae ()ae CO). 


where the minus and plus signs preceding the terms alternate, and where 
additional terms are used as long as the number subtracted from S in the 
numerator is less than S. A mathematical derivation of Formula 2 is given in 
detail in Section E of this paper. 

Ordinarily one would not combine ps from as many as seven experiments, 
but such an example will be used here to illustrate the application of Formula 
2. Suppose that seven experiments provided fs of .20, .35, .40, .40, .35, 40, 
and .40. The sum is 2.50. The overall probability of getting § < 2.50 is 


Se 


[1] 
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The computation can of course be simplified by using logarithm tables for 
raising numbers to the seventh power. 

Formula 2 gives the exact probability for § when ps are taken from a 
continuous distribution of probability values from zero to 1. The ps that 
experimenters obtain, however, will be taken from tables for ¢, F, the Mann- 
Whitney U, and other statistics where only certain probability values have 
been tabulated. The experimenter therefore must take his » values from a 
discrete distribution of a small number of fs in a table. For some values of 
S§ this restriction on the available p values reduces the number of possible 
combinations that provide a given sum or less, making the exact probability 
less than the probability given by Formula 2. This is shown in Section E. 
An example of the conservativeness of Formula 2 for such discrete distribu- 
tions of ps will now be given. 

Refer again to Figure 1. The triangle is divided into sections to show the 
various possible combinations of p values that! are obtainable from a discrete 
probability table providing fs at intervals of .10—i.e., .10, .20,..., 1.00—and 
which total no more than .50, The corners of the squares in Figure 1 represent 
those combinations, and the area of each square represents the probability of 
the combination of ps whose coordinates form the upper right corners of the 
square. To illustrate, the area of the top square represents the probability of 
getting a p of .10 from the first experiment and a p of .40 from the second, 
The probability table forces the experimenter to assign a p of .10 to all results 
which with a more complete table would have been given values between zero 
and .10, and a value of .40 to all results with exact ps between .30 and .40. 
The probability of a p between zero and .10 is -10, and the probability of a ? 
between .30 and .40 is also .10, and so the probability of the joint occurrence 
of a p of .10 for the first experiment and of .40 for the second experiment is 
(.10) (.10) = .01, which is the area of the top square. There are 10 squares, 
and so the total area representing ps totalling .50 or less is .10, which is the 
probability, given a true null hypothesis and two independent experiments, of 
getting a sum of fs of .50 or less, with use of the described probability table 

The application of Formula 2 to determine the overall probability of dle 
a small sum as .50 for two independent experiments involves the use of onl 
the first term of the formula, which is 5?/2 in this case, and which earli : 
gave the value .125. This probability, .125, based on dhe aeauiotie of 4 
tinuity of the probability distribution, is larger than the value for the fi 
case by .125 — .10 = .025, which is the area of the five small triangles that 


fall along the hypotenuse in Figure 1. If one used a table with values .01 


.02,..., 1.00, the area for the additional Possible combinations totalling .50 
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or less would include the major portion of the five small triangles and provide 
an overall probability of .1225, reducing the discrepancy between the prob- 
ability value based on the assumption of continuity and the exact probability 
obtained by considering the discrete distribution. Thus Formula 2 provides a 
conservative value, however discontinuous the distribution of probability values 
in the probability tables. 


D. Tue Revative EFrrectTiveness OF THE ADDITIVE AND 
THE MuttipuicaTivE MetHops 


When there is no treatment effective in an experiment, all ps from zero to 
1 are equally probable, and consequently every pair of ps from two independent 
experiments is equally probable when there are no treatment effects. With 
treatment effects, however, there is a smaller probability of combinations of ps 
involving large values, and a larger probability of combinations involving 
small » values. It will now be shown that this consequence of treatment effects 
makes the additive method more effective than the multiplicative method. 

In Figure 2 the area under the slanted straight line (Y = .447 — X) 
consists of combinations.of two ps whose sum is significant at the .10 level by 
the additive method. The area under the parabola (Y = .021/X) consists of 
combinations of two ps whose product is significant at the .10 level by the 
multiplicative method. The p combinations that are significant for the additive 
but not the multiplicative method are those under the straight line but above 
the parabola; that is, the area enclosed between the two lines. Combinations 
significant by the multiplicative but not the additive method are represented 
by the two tails of the parabola outside the corner formed by the straight line. 
Now it can be seen that a treatment effect increases the probability of getting 
combinations in the corner, where the additive method is more sensitive, and 
decreases the probability of combinations in the tails of the parabola, where 
the multiplicative method is more sensitive. Graphs of the relationships be- 
tween the straight lines and parabolas for other levels of significance would 
show that for those levels also a treatment effect tends to provide more signifi- 
cant results for the additive than for the multiplicative method. 

Given a treatment effect of any amount then the additive method will 
lead to significance at a given level more often than the multiplicative method. 
The greater the treatment effect the greater the superiority of the additive 
method. This superiority, however, is of theoretical interest only, unless it can 
be shown that with realistic amounts of treatment effect the advantage of 
the additive method is large enough to have practical significance. Most ex- 
perimenters would regard as realistic the assumption that they will not get p 
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values larger than .50 and therefore would be interested in the telative effec- 
tiveness of the two methods when no single 9 is larger than -50. Other persons 
would regard a statistical comparison of the relative effectiveness of the methods 
to be of practical interest when the maximum # is -40, or even less, In the 
following analysis, therefore, the effect of various cutoffs for individual 
values will be considered. Within the tange of the # values under consideration’ 
(i.e., those below a specified cutoff) all combinations will be treated as equally 
probable. 5 
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The first comparison will assume a continuous distribution of » values up 
to a value of .343. (This value was selected to permit a comparison with the 
effect of a cutoff of 343 for a particular discrete distribution.) The areas 
under the straight line and the parabola between zero and .343 along both 
axes of Figure 2 were determined by calculus to provide a comparison of 
probabilities of getting combinations significant at the .10 level. Similar com- 
parisons were made for the .05, .025, .01, and .005 levels, with use of the 
areas under the appropriate straight lines and parabolas. The /eft half of 
Table 1, for a continuous distribution, shows the results of the analysis. The 
column “A/M” shows that there are about one and a half times as many 
results significant for the additive as for the multiplicative method. 


TABLE 1 
PROBABILITIES OF GETTING RESULTS SIGNIFICANT AT VARIOUS SIGNIFICANCE LEVELS 
FOR THE ADDITIVE AND MULTIPLICATIVE METHODS, USING A PROBABILITY 
DisrRiBUTION TRUNCATED AT .343 


Probability of significant results 


Significance Continuous probability distribution Discrete probability distribution 
level Additive Multiplicative A/M* Additive Multiplicative A/M® 
10 089 056 1.6 072 044 1.6 
05 05 031 1.6 031 y Org: 1.6 
025 025 017 1.5 014 010 14 
01 01 007 14 006 002 3.0 
005 005 004 12 002 001 2.0 


® The probability for the additive method divided by the probability for the multipli- 
cative method. 


Earlier, the distinction between the distribution of combinations of ps from 
continuous and discrete distributions was stressed. Inasmuch as experimenters 
must obtain fs from published probability tables which provide discrete dis- 
tributions, it is appropriate to compare the effectiveness of the two methods 
when a typical probability table is used. The table selected for this purpose was 
one that is used for determining the significance of the Mann-Whitney U 
statistic, where there are four subjects in one group and four in the other 
(5, p. 271). The p values in the table are these: .014, .029, .057, .100, 171, 
.243, .343, .443, and .557. The probability values in the right half of Table 1 
are based on fs in this distribution up to and including .343. The effect of the 
discontinuity can be seen; there are fewer significant results at each level 
than for the continuous probability distribution. The probability values in the 
right half of the table were obtained by the means employed with the discrete 
distribution in the example associated with Figure 1. In the present example, 
however,.the various possible combinations are not equally probable. For 
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instance, the probability of a p of .014 for the first experiment and .057 for 
the second is (.014 — 0) (.057 — .029) = .000392, whereas the probability 
of .343 on the first experiment and .243 on the second is (.343 — .243) 
(.243 — .171) = .0072. The probabilities of occurrence of the remaining 
47 combinations of ps were also computed in this way. Each combination of 
ps was evaluated by both the additive and the multiplicative methods to 
determine at what levels that combination was significant, Then, in considera- 
tion of the probability of occurrence of each combination, the probability of 
getting a combination significant at a particular significance level by applica- 
tion of the additive method was determined, and the same was done for the 
multiplicative method. 

As expected, the probability of a significant combination by either method 
is less for the discrete than the continuous case for all significance levels, but 
the relative superiority of the additive method is maintained for the discrete 
case. This is shown by the size of the A/M ratio. 

The A/M ratio for significance at a given level is one measure of the 
relative effectiveness of the additive and multiplicative methods, but an alterna- 
tive measure may be of interest. That measure is the probability that a combi- 
nation of ps will yield a higher level of significance when evaluated by the 
additive method than when evaluated by the multiplicative method. A com- 
parison of the methods, with use of this measure of relative effectiveness, is 
given in Table 2. The significance levels used are those typical of relatively 
complete tables of chi square which are used for the multiplicative method; 
-10, .05, .025, .01, .005, and .0005. To show the effect of the location of the 
cutoff, the relative effectiveness was computed for the Mann-Whitney U 
probability distribution specified earlier: 014, .029, .057, +100, .171, .243, 
.343, .443, and .557, with successive cutoffs of .171, .243, 343, 443, and 1557, 
The superiority of the additive method diminishes as the cutoff point is raised 
but is still considerable with a cutoff of S57, 


TABLE 2 
PROBABILITIES OF ONE METHOD Givinc More SIGNIFICANT ReEsuL’ Orn: 
Usinc A Discrete PropaBitity DistriBution witn Wat Canine 1 


Probability of more significant results 


Cutoff ili 
values Additive method Multiplicative method pee eae 
171 89 0 11 

243 87 07 ‘06 

343 75 09 16 

443 .70 16 14 

557 64 22 x 


“ RUGENE S. EDGINGTON 359 


Table 2 is based on one particular probability table, and other probability 
tables would be expected to give somewhat different, although similar, values 
for the same cutoff points. To illustrate, consider the distribution .001, .01, 
.02, .05, .10, and .20, which are the probability values given for the ¢ test in 
many statistics books. With use of this distribution of ps, which has a cutoff 
at .20, it is found that the probability of the additive method giving a more 
significant result than the multiplicative method ( with the levels of significance 
used for the comparison shown in Table 2) is .90, the probability of the multi- 
plicative method giving a more significant result is .01, and the probability of 
the two methods giving the same level of significance is .09. These values are 
similar to those in Table 2 for cutoff values of .171 and .243. 

Examples given by writers to illustrate the use of the multiplicative method 
suggest that the power of the additive method relative to the multiplicative 
method is not diminished when more than two ps are combined. Guilford (1, 
p. 249) gave an example of the pooling of three probability values, .02, .06, 
and .12, and in using the multiplicative method got a chi square significant at 
the .01 level, but not at the .005 level. The additive method applied to those 
three ps gives a probability of (.20)?/6 = .0013, or significance at the .005 
level. Winer (6, p. 44) gave an example for five p values, using .07, .11,, .15, 
.20, and .30, and got significance at the .05 level by application of the multi- 
plicative method, The additive method gives (.83)5/120 = .0033, or signifi- 
cance at the .005 level. 


E. Tue Derivation or ForMuLa 2 


Under the null hypothesis each p in a uniform distribution of & different 
ps (like 01, .02,..., 1.00, where & = 100) has a probability of 1/k of 
being obtained, and so any ordered combination of n such ps has a probability 
of 1/k". Thus the probability that n ps will provide a sum as small as § is 
1/k" multiplied by the number of ordered combinations with a sum as small 
as S. The problem then is to determine a formula showing the number of 
ordered combinations providing a sum as small as S because that formula 
multipled by 1/4" gives the desired probability. 

Transform a uniform distribution of ps by multiplying each » by & (the 
number of p values) to provide the distribution of integers 1, 2,..., &. For 
example, when & = 100, the distribution is 1, 2,..., 100, and when ee 
1000, the distribution is 1, 2,... , 1000. The integers can be regarded as 
frequencies, and so each # is associated with a frequency &p and each sum S is 
associated with a total frequency &S. The problem of determining the number 
of ordered combinations providing a sum as small as § can now be expressed 
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in a mathematically equivalent form which can be answered by application of 
standard formulas from combinatorial mathematics: How many distinguish- 
able assignments of &§ or fewer indistinguishable objects to m cells can be 
made with no less than one and no more than & objects per cell? (One 
assignment would be distinguishable from another solely on the basis of the 
frequencies within the individual cells.) The lower and upper limits on the 
number of objects per cell correspond to the empirical requirement that every 
one of the n experiments must provide a p, and that no p can be larger 
than 1, 

Netto (3, p. 15) showed that the number of distinguishable assignments 
of kS or fewer indistinguishable objects to x cells, with at least one object per 
cell, given no upper limit on the number per cell, is 


ries popes org lees 
n—1 + n—1 cts nets a—1]~ \n 
For example, four or fewer indistinguishable objects can be assigned to three 


4 
cells in () = 4 distinguishable ways; namely, the assignments providing 


these frequencies: {1, 1, 1}, {1, 1, 2}, {1, 2, 1}, and {2, 1, 1}. (In terms or 
probability values this result implies that there are four ordered combinations 
of three ps that sum to .04 or less when & = 100, and four ordered combina- 
tions that sum to .004 or less when & = 1000.) 


kS 
The formula (*) gives the total number of distinguishable assign- 


ments of &§ or fewer objects to n cells, with at least one object per cell, with 
no upper limit on the number of objects per cell. The lack of an upper limit 
does not matter when S < 1, because no individual » value could exceed 1. 
Thus, when § < 1 or, alternatively, when &S S& the total number of assign- 


kS 
ments is given by the formula ( ), which is 
n 


(RS) | 

(#5 —n)I (nyt [3] 

When, however, § > 1, kS > &, and conse 
modified to subtract those assignments where on 


more than & objects because such assignments correspond to physical] 
realizable situations in which one or more experiments provide ps Fe . 
1. This can be done by affixing correction terms to the formula. si 


quently the formula must be 
€ or more of the cells contains 


Ke 
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A correction for inappropriate inclusion of assignments with more than & 
objects to a cell can be made in this manner. Fill the first cell with & objects. 
Then substitute &§ — & for &S in Formula 3 to determine the number of 
assignments of &S — & or fewer objects to the m cells with at least one assigned 
per cell. All of these assignments are inappropriate because the first cell has 
a frequency exceeding &. Now if &S is no greater than 24—that is, if § is no 
greater than 2—then only one cell can contain more than & objects. This cell 
could be any one of the x cells. Thus, the number of assignments where the 


n 
first cell frequency exceeds & is multiplied by ( | =n to give the total 


number of assignments where any one of the cells has a frequency greater 
than &. This number of assignments is unintentionally included in Formula 
3, and so the corrected formula for § < 2 is 


(aS) | (") (kS —k)! 


(RS —n)!(n)! ~ \i) Sk — a2) (an)! [4] 


It is obvious from the preceding argument that the correction term is necessary 
only when the sum S exceeds 1; that is, when &S is greater than &. This 
restriction is built into the second term through the impossibility of computing 
factorials of negative numbers. 

When S > 2, assignments where two cells simultaneously contain more 
than & objects must be considered because such assignments would be included 


n 
* more than once in the second term of Formula 4, since the multiplier (") 


presupposes that the assignments where one cell contains more than & objects 
are different from those where another cell contains more than & objects. 
When two or more cells simultaneously contain more than & objects, the 
second term provides too large a subtraction, and a correction must be made 
by adding back assignments subtracted more than once. 

Assume that § < 3. Then no more than two cells can simultaneously 
contain more than & objects. The term to be appended to Formula 4 is 
determined in this manner. Fill the first two cells with & objects each. Then 
substitute &S — 2 for &S in Formula 3 to determine the number of assign- 
ments of &S — 2k or fewer objects to the cells with at least one object 


n 
assigned per cell. Multiply this number of assignments by (*) to deter- 


mine the total number of assignments with any two cells containing more than 
k objects. Each of these assignments is included twice in the second term of 
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Formula 4, and consequently Formula 4 overcorrects, So the formula appli- 
cable for S < 3 is 


(8)! () (kS — k)! 
(#5 —n)U (ny \1) CS k—=n) I (ml 


n (AS — 2k)! 
+(3) (48 — 2k—n)1 (ny! [5] 


Additional terms can be added, with the signs alternating, to provide correc- 
tion terms for three, four, or more cells simultaneously having frequencies 
greater than & Inasmuch as the sum S cannot exceed n, k§ cannot exceed . 
kn, and so the number of required terms in any situation cannot exceed n 

Formula 5 gives the number of combinations of n ps that give a sum of § 
or less. To get the probability of a sum S or less, Formula 5 is multiplied by 
1/&" which is equivalent to dividing by &". For any specified n and S, the value 
Provided by Formula 5 divided by &" increases as the value of & increases, 
approaching Formula 2 as a limit. Thus Formula 2 Provides a conservative 
estimate of the exact probability of getting such a small sum as S, never being 
less than the value given by Formula 5 divided by 

The above derivation assumes a uniform distribution of bs, whereas ex- 
perimenters usually must obtain their p values from distributions with unequal 
intervals between successive values. The effect of the nonuniformity of intervals 


An example will illustrate the effect of nonuniformity of interyals ina 
distribution of ps. For a uniform distribution of 100 bs, 01, .02,... , 1.00, 
list all equally probable ordered combinations of » 2s, and put the sum of the 
1 ps beside each combination. Then in all combinations Containing a p of -06, 
-07, .08, or .09, change those ps to .10, This is equivalent to forming a list of 


equally Probable combinations given a Probability table of the kind described 


sum as small as § will be equal to or less than the Probability would be fora 
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uniform distribution, p1, p2,..., Px, where 1/é is the smallest interval between 
successive fs in the nonuniform distribution. 


F. Summary 


The procedure commonly employed for combining probability values from 
two or more independent experiments is that which involves reference to 
chi-square tables to determine the significance of the set of probability values. 
This method, which determines the significance of the product of the obtained 
probability values, may be called the multiplicative method. An alternative 
method, called the additive method, is developed for determining the signifi- 
cance of a sum of probability values. The significance of a sum is determined 
directly from a formula and requires no reference to tables. The additive 
method is shown to be more powerful than the multiplicative method, having 
a greater probability than the multiplicative method of yielding significant 
results when there actually are treatment effects. 
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2 The development of fine prehension in infancy—B. M. ‘Cassone 
. The growth of adaptive behavior in infants: An expert 
5 & 6. Differential reactions to taste and temperature ctimelt tee bate inten tee see M, Ricuanosox 
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control—J. R. Hicanp 
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Herey, ano D. Erwin 
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‘A comparative study by means of the Rorschach method of personality development in twenty pairs of identical 


twins—E. Trour i 
Individual differences in the facial expressive behavior of preschool children: A study by the time-sampling method— 


C. Swan 
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. An experimental analysis of “level of aspiration” —R. Gourp 
Some tight om the problem of bilingualism as found from a study of the progress in mastery of English among pro- 


‘school children of non-American ancestry in Hawaii—M. E. Suara 
Doninetion and social integration in the behavior of kindergarten children and teachers—H. H. Anprnsox 
Iy uf the rhesus and cebus monkey and the gibbon to acquire diflerential response to complex visual 
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. Learning by children at noon-meal in a nursery school: Ten ‘good’ eaters and ten “poor” eaters—J. B. McCay, E. B. 
Waninc, ann P, J. Kruse 
Studies in'the interpretation of play: I. Clinical observation of play disruption in young children—E, 
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ual differences by the method of co-twin 


Fi Thera ANPiency, incidence, and amelioration—A. L. Brusio 
inger nail-biting: Its incipiency, incidence, and ameliorat 
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INTRINSIC MOTIVATION AND VOCATIONAL CHOICE* 


York University, Canada 


H. 1. Day 


A. INTRODUCTION 


The relationship between personality measures that attempt to relate, to 
vocational choice, and interest inyentories that also claim predictive, validity 
for vocational success, merits investigation. Some, like Holland, have) argued 
that interest: inventories are, in fact, personality inventories (6) and that 
vocational preferences are signs of various personality traits. Recently Siess 
and Jackson (8) compared scores on the Strong Vocational Interest Blank 
with scores on the Personality Research Form to demonstrate the closeness of 
relationship between them. 

Day (2, 3) and Day and Maynes (4) developed the Ontario, Test of Intrin- 
sic Motivation (OTIM), a personality questionnaire designed to measure the 
trait of specific curiosity in adults.. The definition of specific curiosity that 
served as the basis for construction of the test was outlined by Day (3) and 
describes the highly specific curious person as one who 

1. reacts positively to new, strange, incongruous, or mysterious elements 
in his environment by moving towards them, by exploring or investigating 
them and/or by expressing interest in them, and 

2, tends to persist in such positive reaction, gaining information in the 
Process, i 

The test is designed to measure a total score of interest in 90. stimulus 
events or concepts, and the verbal response of “interesting” is used as descrip- 
tive of a situation that arouses curiosity. To some extent the use of the term 
“interesting” in this regard is different from the common. usage and follows 
Berlyne’s statement that. “ratings of interestingness reflect internal processes 
closely related to arousal-raising stimulus properties” (1, p. 288). 

The great diversity of findings in the literature on curiosity suggests that 
specific curiosity is a multifaceted concept often task specific as well as response 
specific, The Ontario Test of Intrinsic Motivation (OTIM) was therefore 
constructed on the model of a cube similar to Guilford’s conceptualization of 
intelligence (5) with the dimensions of each face interacting to form a number 
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of small cubes. One of the faces of the cube was designed to reflect area of 
interest, and the 10 vocational interest areas of the Kuder Preference Record, 
Form “C,” (7) were adopted. This allowed the identification of interest 
level in the same areas as does the Kuder Preference Record Vocational 
(KPR). 

The differences between the two test lie both in their theoretical background 
and in their construction. The KPR Form “C” is a forced choice test designed 
to identify the directions of interest and indirectly the degrees of differences 
among the 10 alternatives. It is usual to find two or three above average peaks 
and about the same number of troughs. On the other hand, the subscales of 
the OTIM are independent, and a respondent can score above or below the 
population norm on all areas. Thus, not only the directions of his interest are 
indicated, but also his overall level of curiosity, and the level of interest in 
each area is compared with his population group. 

Furthermore, the OTIM was designed to measure only intrinsic motiva- 
tion—i.e., the interest aroused by the complexity, novelty, and ambiguity of 
the situations described in the items representing the various vocational fields. 
The KPR does not seek to pinpoint the reasons for a respondent’s interest and 
so may reflect high interest because of intrinsic attractions of the field itself 
(as represented in some of the items of the test) or by extrinsic rewards that 
the field may yield (such as Prestige, money, etc,). 


B. Procepure 


Ninety-six students, both male and female in a numbe 
Colleges of Applied Arts and Technology in Metropolit: 
teered to complete the OTIM and the KPR Form “C” 
ing program. The two tests were completed in the same 


t of Community 


as part of a large test- 
session. 
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to what extent this interest is intrinsic. Thus we find that in the persuasive 
area, the correlation between the two tests fails to reach statistical significance, 
suggesting that intrinsic reasons for entering this field of occupation are mini- 
mal, On the other hand, in the artistic as well as in the computational area, 
people may seek a great deal of intrinsic satisfaction, and this may be reflected 
in the much higher correlations obtained. 

The results of this study suggest that using the KPR and OTIM together 
may not only identify the directions of interest, but may also give some indica- 
tion of overall level of interest in general, the level of interest in each voca- 
tional area compared to the general population, and the proportion of intrinsic 
component in any vocational interest. 


D.. Summary 


Ninety-six Ss completed both the Kuder Preference Record (Vocational) 
and the Ontario Test of Intrinsic Motivation. Correlations among the sub- 
scales of the two tests suggest that certain vocational interests may be intrinsic- 
ally motivating, while others are attractive because of extrinsic gain, 
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PROSE LEARNING: EFFECTS OF QUESTION POSITION 
AND INFORMATIONAL LOAD INTERACTIONS ON 
RETENTION OF LOW SIGNAL VALUE INFORMATION* 


The University of Wisconsin-Madison and Case Western Reserve University! 
Rocer R. Etscuens, A. J. H. Garrz, anp V. K. Kumar 


A. IntRopucTIon 


Research on the learning and retention of meaningful material is now a 
well established area of investigation and study. Several studies (2, 3, 4, 5, 8, 
9) have shown that adjunct questions inserted in textual material facilitate 
the retention of both information specific to the questions and nonquestioned 
specific information. While the most recent of these studies (8) has shown 
that delivery of adjunct questions by a teacher is more effective in promoting 
learning from written materials than written adjunct questions imbedded in 
the text, it seems clear that students and others will continue for some time to 
be asked to learn from texts alone, and the admonition to “Read and learn” 
will continue to be part of many teachers’ instructional repertoires, Thus, 
further investigation into the facilitative effect of questions, and their use as 
a means of structuring written materials to improve learning, seems justified. 

This study was designed to throw light on the problem of where questions 
might most profitably be placed in the text to further learning and recall, for 
clearly this decision is of prime importance in the development of written 
instructional materials. More specifically, the study was designed to ascertain 
if adjunct questions would facilitate the recall of information that was known 
to have a low signal value. Signal value refers to the capacity of a unit of 
information to stimulate the necessary information-processing activities to be 
stored in long term memory. (Unit is here used to refer to any combination of 
words that contains an idea or fact, rather than to a specific linguistic unit.) 
Thus, when there are necessarily limits on the number of questions that can be 
placed in a piece of written material, it would seem wisest to select questions 
relating to ideas or facts judged to be important yet known to have low signal 
value. In addition to investigating the effect of questions on this kind of infor- 
mation, the study also investigated the differing merits of three question posi- 
tions relative to the specific low signal value information. 


* Received in the Editorial Office on January 6, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
1 The third author is at Case Western Reserve University. 
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B. MetHop 
1. Subjects 


The subjects were 200 students enrolled in introductory educational psy- 
chology courses at the University of Wisconsin. All subjects were volunteers. 


2.. Materials 


A 1354 word passage was used. The text concerned the historical develop- 
ment of the province of Quebec in Canada and in particular traced the growth 
and development of the nationalist movement within that province. The pas- 
sage consisted of nine pages with approximately eight sentences per page. 
Following the first part of the study (see Design and Procedure section) each 
set of questions was typed on a separate page, and these pages were inserted 
into the passage. 


3.- Design and Procedure 


The subjects were randomly assigned to four groups. Pretesting of the 
materials, which was necessary to ascertain which information was not recalled 
by subjects and could thus be designated low signal value information, was 
done with subjects in Group 1. Subjects were given the passage and instructed 
to read and learn it to the best of their ability in the time allotted (30 min- 
utes). Subjects were told that they would be tested on the information in the 
Passage. Forty-eight hours after reading the passage, subjects in Group 1 were 
tested with a 63-item recall measure. Following this test the questions, which 
fewer than 25 percent of the subjects in Group 1 had answered correctly and 
which referred to information found on pages 1, 4, and 7 of the Passage, 
respectively, were identified. From this select group of questions which related 
to low signal yalue information, a further random selection was made so that 
a total of nine questions were selected: three referring to information con- 
tained in page 1 of the passage, three referring to information on page 4, and 
three referring to information on Page 7. ; 


Each set of three questions was then typed on a separate page, and these 


+, 
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TABLE 1 
PosiTION OF QUESTIONS IN PASSAGE FoR Groups 2, 3, AND 4 

Group 2 Group 3 Group 3 
Questions close Questions medium Questions distant 
Text page 1 Text page 1 Text page 1 
Question page 1 Text page 2 Text page 2 
Text page 2 Question page 1 Text page 3 
‘Text page 3 Text page 3 Question page 1 
Text page 4 Text page 4 Text page + 
Question page 4 Text page 5 Text page 5 
‘Text page 5 Question page 4 Text page 6 
Text page 6 Text page 6 Question page 4 
Text page 7 Text page 7 Text page 7 
Question page 7 Text page 8 Text page 8 
‘Text page 8 Question page 7 Text page 9 
Text page 9 ‘Text page 9 Question page 7 


post questions to facilitate retention was affected by their distance from the 
question-specific information. 

Subjects in Groups 2, 3, and 4 were presented with the passage (each group 
receiving a version with the appropriate arrangement of questions) and 
instructed to read and learn the passage with a view to being tested upon it 
in the future. Subjects were allowed 30 minutes to read the passage. Forty- 
eight hours later subjects in each group returned and were given the nine-item 
recall test for which there was no time limit. All procedures were carried out 
with each group separately; thus each group came together twice—once to 
read and learn from the prose passage, and once to take the criterion test. 


C. ReEsuLts 


The results were analysed with use of a one-way analysis of variance to 
compare the mean scores of the groups on the retention test. The analysis 
of variance was followed by post hoc pair-wise comparison of means of means 
using Scheffé’s method. 

The mean scores of each group on the nine-item recall test were as follows: 
Group 1, .40; Group 2, 3.56; Group 3, 2.52; and Group 4, 2.38. 

Table 2 presents the results of the analysis of variance, which were significant 
beyond the .01 level. 

The post hoc comparison of means using Scheffé’s method showed that the 
experimental Groups 2, 3, and 4 were significantly different from the control, 
Group 1 (p < .01). Further, Group 2 was significantly different from Group 
3 (p < .01); Groups 3 and 4 were not significantly different; Group 2 was 
significantly different from Group 3 only at the .10 level. 
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TABLE 2 
ANALYsIS OF VARIANCE TABLES FoR GRoUPs 1 TO 4 
Sum of Degrees of 
Sources squares Freedom Mean square F 
Bana ib ibe a De sek Ee ci ef seat ar eects dant 
49.73* 
Between groups 302.46 3 100.82 
Within groups 405.46 200 2.03 
Total 707.92 203 


* p< Ol. 


Table 3 presents the results in such a way as to allow the comparison of the 
responses of each group to each question in the criterion measure. 

The results make it clear that the presence of questions in the text can 
facilitate the learning and recall of low signal value material; i.e., that mate- 
rial which, in a pretest situation using subjects randomly selected from the 
main experimental population, is demonstrated to be lacking the characteris- 
tics necessary to be stored in, or retrieved from, long term memory. Further, 
Post questions have a greater facilitating effect upon learning when they are 
placed close to the information to which they refer than when they are re- 
moved from it. 


D,__ Discussion 


The results may be explained in terms of the review effect of the questions 
(4). Thus the questions occurring after the information to which they refer 
stimulate subjects to review that information. This review process would seem 
to be inhibited if much additional material has to be Processed between the 
reading of information and the occurrence of a question referring to it. The 
results then are in agreement with those found by Frase (5) who found that 


TABLE 3 
PERCENTAGE OF SuBJECTS RECALLING Inpivinuat Irems 
oF Test By Group 
: Group 2 Group 3 

Question Group 1 uesti h jap . 
number Control a Wie peri Parone 

1 20.2 65.1 5 

2 23.1 623 aaa ia 

3 7.2 53,2 46.2 a 

vn AG 37.1 204 ar 

5 0 13.4 91 ia ' 

s 13 44.8 344 143 

13 37.1 134 : 
8 13 27.4 25.3 ia 
9 13 49 0 sails 
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post questions have a greater facilitating effect on learning when they come 
close to the material to which they relate. 

These results are in agreement with previous findings which have demon- 
strated the facilitating effect of post questions inserted in the text (1, 3, 4, 5, 
7, 8). However, it extends these findings by demonstrating that questions 
will facilitate the recall of material which it is known would not have been 
recalled had the questions been omitted. Further, as the responses of the con- 
trol and experimental groups to individual questions (Table 3) make quite 
clear, questions placed close to the information to which they refer have a 
greater facilitating effect than questions placed some distance from the infor- 
mation. Allen (1) in a study investigating, in part, the effect of questions 
found that questions given at the end of an extended passage affect learning 
positively. The results of this study do not contradict Allen’s findings. Indeed, 
the experimental condition that comes closest to matching those of Allen’s 
study (Group 4) resulted in questions having a facilitating effect on learning. 
However, the results for Group 1 reinforce the suggestion that while one may 
facilitate learning by placing questions at the end of a long passage, they are 
better placed in the text closely following the information to which they refer 
(5). 

Written texts and materials seem likely to be with us for some time. The 
results of this study emphasize and confirm the importance of pretesting writ- 
ten materials on the population for which they are intended, before attempts 
are made to manipulate the text to the learner’s advantage (6). Questions 
facilitate learning; but the person preparing the materials must know where 
to place the questions in the text. It would make little sense to scatter ques- 
tions at random, or to insert a question which refers to something that in all 
probability could be learned and recalled without its aid. The results of this 
study give some confidence to the suggestion that in the preparation of written 
materials, the way to proceed is to pretest the materials and thus learn which 
of the information and ideas, judged to be important, has such low signal 
value that it is not being recalled; questions referring to this category of 
information would then be inserted into the text close to the information. 
Thus, it would seem that the usual practice of placing questions at the end 
of a piece of text is not the best way to use questions. 


E. Summary 


One-way analysis of variance with post hoc comparisons were used to study 
the effects on the learning and retention of low signal value information of 
questions and question placement within text. Two hundred college students 
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were randomly assigned to four groups. Pretesting the materials (a 1354 word 
passage) with the control group identified material that was of low signal 
value and thus was not being learned or remembered by Ss. The three experi- 
mental groups received passages containing questions that related to the low 
signal value information. The results indicate that questions facilitate learn- 
ing, but that this effect is greatest when the questions follow closely the infor- 
mation to which they refer. The probability that the extent to which questions 
can be used to facilitate learning and retention of written materials depends 
upon pretesting the materials is discussed. 
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VERBAL OPERANT CONDITIONING AS A MEASURE OF 
VALUE STRENGTH: II. THE USE OF VALUES 
AS REINFORCED RESPONSE CLASSES* 1 


The University of Georgia 


Eprru Wetsskopr-JozLson AND WaYNE Matrury 


A. IntRopuction 


The present study pertains to a research project designed to use verbal 
operant conditioning (VOC) as an objective and quantitative tool to measure 
the strength of values; thus, the project attempts to provide a rapprochement 
between humanistic and experimental psychology. Previous studies have sug- 
gested that verbal stimuli pertaining to personal values that are readily en- 
dorsed by subjects (high values) tend to increase the frequency of verbal 
responses more than yerbal stimuli pertaining to personal values that are not 
readily endorsed (low values) ; in those studies verbal stimuli pertaining to 
values were used as reinforcers (e.g., 12). 

In contrast, the present study used verbal stimuli pertaining to values as 
reinforced response classes rather than as reinforcers. It was hypothesized that 
groups of subjects (Ss) reinforced for responses pertaining to high values 
would condition at a higher level than would groups of Ss reinforced for 
responses pertaining to low values, when the two groups were compared to 
each other and to control groups. 


B. Mertuop 
1. Selection of Values 

The values used in this study were originally isolated by Weisskopf-Joelson, 
Alleman, Anderson, and Katkin (13). In that study over 500 autobiographies 
of undergraduate Ss were analyzed, and nine values obtained; a questionnaire 
was developed to assess the strength of these values. Katkin and Weisskopf- 
Joelson (5) improved this questionnaire; they devised a forced choice test 
composed of 72 pairs of statements pertaining to the nine values. Weisskopf- 
Joelson et al. (13) and Katkin and Weisskopf-Joelson (5) found that state- 
ments pertaining to Self-Interpretation and Self-Actualization were chosen 

* Received in the Editorial Office on January 12, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. i r F 

1 This study is based on a Master’s thesis submitted to the University of Georgia by 


the junior author in partial fulfillment of the requirements for the M.S. degree. The 
study was directed by the senior author. 
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with significantly greater frequency than statements pertaining to any of the 
other values. Statements pertaining to Conformity were chosen with signifi- 
cantly less frequency than statements pertaining to any of the other values. 
‘This was the case in several other unpublished investigations, using as Ss stu- 
dents from the same university as the Ss of the present study. Self-Interpreta- 
tion and Self-Actualization were higher values than Conformity not only for 
the total samples but for almost every individual S. Therefore, Self-Interpre- 
tation and Self-Actualization were used to represent the high values in the 
present study, and Conformity was used to represent a low value. Ten state- 
ments that embody Self-Interpretation or Self-Actualization, and 10 state- 
ments that embody Conformity were taken from the test described above. 
The emissions of these statements were used as reinforced response classes. 
Five judges rated the statements with an agreement of 96% as reflecting 
Self-Interpretation, Self-Actualization, and Conformity, respectively (13, p. 
302). 
Typical of the high value statements are the following: 


1. I developed many of my neglected assets (Self-Actualization). 


2. I solved many problems by understanding my feelings (Self-Interpre- 
tation). 


Typical of the low value statements are the following: 


Tipe tried to be like other people my age (Conformity). 
2. I was influenced by the values of my associates (Conformity) . 


2. Subjects 


8s were 100 students enrolled in Introductory Psychology at The Uni- 
versity of Georgia, They volunteered to Participate in the experiment, but 
they received credit for participation. Fifty Ss were freshmen or ephionscres 


and over 90 percent were 21 years of age or younger. 


3. Materials 


The materials consisted of 120 5 by 8 inch filin; 
were two statements of the kind listed above, one reflecting a high value and 
one reflecting a low value. All statements were of approximately th wi 
length, The statements were randomly selected for each card vith oe 
strictions that there was one statement reflect ieee. 


: ing a hi 
ment reflecting a low value on each card, and nee a Be oe) 
Te use 


g cards. On each card there 
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an equal number of times. The personal pronoun “I’’ was replaced by a 
vertical column of three personal pronouns in the nominative case from which 
S was instructed to select one. Similarly, the possessive pronoun “my” was 
replaced by a vertical column of three personal pronouns in the possessive 
case from which S was instructed to select one. 

The upper part of a typical card carried, for example, the pronouns “he, I, 
they” in the left-hand column and. the corresponding possessives, “his, my, 
their,” in the right-hand column, followed by the sentence: developed 
many of neglected assets. The lower part of the card carried, for example, 
“we, you, she” in the left-hand column and “our, your, her” in the right-hand 
column, followed by the sentence: tried to be like other people —— age. 

High value statements and low value statements appeared in the upper and 
lower location on the card an equal number of times, Pronouns in the left- 
hand column were randomly ‘selected for each card from the list “I, we, you, 
he, she, they” and were arranged in random order. The pronouns in the pos- 
sessive case corresponded to the pronouns in the nominative case on each card, 


4. Procedure 


The oral instructions given to Ss were as follows: 

“I am going to show you some cards. On these cards you will find some 
words and some phrases. Read both of the phrases aloud to me and then 
choose one of the pronouns to form one complete sentence. When selecting 
a pronoun, always use the possessive pronoun of the same person as the 
pronoun originally selected.” (Two examples were then read by E.) 

Ss were divided into five groups of 20 Ss each, 10 males and 10 females: 
Experimental Group I, Experimental Group II, Control Group I, Control 
Group II, and Control Group III. Experimental Group I was reinforced 
for the selection of statements reflecting high values by “good,” “fine,” of 
“O.K.,” delivered orally in random order by E (the junior author), Experi- 
mental Group II was reinforced in the same manner for the selection of 
statements reflecting a low value. Control Group I was yoked to Experi- 
mental Group I; Ss in that group received the same reinforcement during 
the same trials as Ss in Experimental Group I: i.e., Ss in Experimental Group 
I were paired with Ss in Control Group I, and each member of each pair 
received the same reinforcements at the same trials as the other member of 
the same pair. Experimental Group I was contingently reinforced after the 
emission of a high value statement, while Control Group I was noncontin- 
gently reinforced after the emission of either a high value statement or a low 
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value statement. Males were yoked to males only, and females were yoked 
to females only. Control Group II was yoked in the same way to Experi- 
mental Group II. Control Group III received no reinforcements. 

It is, of course, possible that E reinforced Experimental Group I with 
more “enthusiasm” than the other groups (7). However, E tried to avoid 
variations in tone when reinforcing, as well as other linguistic, paralinguistic, 
and kinesic cues, even though such variables were not systematically con- 
trolled (1). 

Reinforcement in the experimental groups was contingent on the selec- 
tion of high value statements or low value statements only. In no case was 
it contingent on the emission of specific pronouns; the pronoun selection task 
was used only to obfuscate awareness of the actual reinforcement contingen- 
cies. (No difference was detected between the Pronouns selected for high value 
statements and the pronouns selected for low value statements.) 

During the first 24 trials Ss were not reinforced, in order to determine the 
operant response level. During the next 72 trials all groups except Control 


Group III were reinforced as described above. The last 24 trials were used 
for the purpose of extinction, 


ment was, 


Before the experiment Pp: 
lasting 10 minutes; Solley and Lo: 
this kind facilitates conditioning. Thi i i 
ment proper, was conducted in a 10% 8 X 10 foot cub Seanad 
E sat at a table facing each other. 


> 
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C. ReEsuits 


Figure 1 shows that Experimental Group I, reinforced for the emission of 
high value statements was superior to Experimental Group II, reinforced for 
the emission of low value statements. 


wn 


MEAN NUMBER OF REINFORCED VALUE STATEMENT 


12.0 


3 


10.0 


@—@ HIGH VALUE EXPERIMENTAL GROUP 
O-—-O LOW VALUE EXPERIMENTAL GROUP 


| 2 3 4 5 6 ve 8 9 10 


TRIAL BLOCKS 
(12 TRIALS EACH ) 
FIGURE 1 


MEAN NuMBER OF REINFORCED VALUE STATEMENTS PER SUBJECT FOR 
EXPERIMENTAL Groups I ann IT 
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Figure 2 shows that Experimental Group I, reinforced for the emission of 
high value statements was superior to yoked Control Group I, reinforced non- 
contingently during the same trials as Experimental Group I, and to Control 
Group III which received no reinforcement. 

In contrast, Figure 3 shows that Experimental Group II, reinforced for 
the emission of low value statements, can hardly be distinguished from yoked 
Control Group II, reinforced noncontingently during the same trials as Ex- 
perimental Group II. Figure 3 also shows that Experimental Group II 
remains close to Control Group III which received no reinforcement. 

The data shown in the figures support the hypothesis on which this study 
is based: namely, that reinforced response classes pertaining to high values will 
lead.to higher levels of conditioning than reinforced response classes pertaining 
to low values, when experimental groups are compared to each other and to 
control groups. It appears that the frequency of responses pertaining to a very 
low value cannot be increased by reinforcement through mild affirmatory 
words, 

The statistical analysis supports the impression given by the figures. All 
statistical analyses were performed twice. The variable analyzed for each of 
the reinforced groups was the number of value statements falling into the rein- 
forced category. For the nonreinforced group (Control Group III) one 
analysis was based on the number of high value statements, and the other 
analysis on the number of low value statements. 

' Because homogeneity of covariance could not be assumed in this experi- 
mental design, a two-way fixed effects analysis of variance model (Sex 
bea Groups) was used (4) for each of the 10 trial blocks. (A trial 

lock consisted of 12 trials.) 


With the exception of an interaction effect between Sex and Groups ob- 


I) and Control Group IIT (not reinforced) 
Block 10 (6 < O5top< 01). In contrast, 
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indicates that the responses of Experimental Group I (reinforced for the 
emission of high value statements) were resistant to extinction. Even during 
the extinction period Experimental Group I remained superior to Experi- 
mental Group II, Control I, and Control Group Ill (6 < .05). 
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Perhaps the results of this study are tangentially related to Cushing’s (2) 
results; for example, showing that the frequency of the response “like” (“I 
like the person on this photograph”) could be increased by reinforcement, but 
the frequency of the response “dislike” could not be increased. However, 
Friesen and Ekman (3) were unable to replicate Cushing's results. That 
“congruence” between Ss’-personality and the reinforced response class affects 
the rate of conditioning is also supported by Mosher’s (6) results which 
demonstrate that Ss’ guilt feelings facilitate conditioning when emission of 
sentences with guilt content are reinforced, but inhibit the rate of conditioning 
when sentences of hostile content are reinforced. Another example of a per- 
tinent study is the investigation by Steinberg and Oakes (10) who showed 
that “good” and “hard” words (according to the Semantic Differential) can 
be conditioned at a higher level than “bad” and “soft” words. While the 
Steinberg and Oakes study, in agreement with the present study, shows 
different rates of conditioning for different reinforced response classes, Weide 
(11) demonstrated that there is very little evidence that such differences 
exist with regard to benevolent, malevolent, and neutral words as reinforced 
response classes. 

If further studies will corroborate the hypothesis of the present study, as 
well as the hypothesis corroborated in previous studies (e.g., 12) that rein- 
forcers reflecting high values are more potent than reinforcers reflecting low 
values, then verbal operant conditioning could be used as a measure of value 
strength. Such measurement would combine the vigorous methods of experi- 
mental psychology with the relevance and significance of humanistic psychology. 


E. SumMMaRY 


The hypothesis is that reinforced response classes pertaining to high values 
lead to a higher level of conditioning in VOC than reinforced response classes 
pertaining to low values. One hundred twenty undergraduate college students 
were evenly divided into five groups. Experimental Group I was reinforced 
for the selection of high value statements, and Experimental Group II for 
the selection of low value statements; Control Group I was yoked to Experi- 
mental Group I, Control Group II was yoked to Experimental Group Il, 
and Control Group III received no reinforcement. An ANOV and a Newman- 
Keuls analysis showed that Experimental Group I conditioned on a signifi- 
cantly higher level than Experimental Group II, Control Group I, and Con- 
trol Group III (p < .05 to p < .01). In contrast, there was no significant 
difference between the conditioning level of Experimental Group II and that 
of Control Groups II and III. Thus, the hypothesis was confirmed. The 


, 
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possibility was suggested that VOC could be used as a measurement of value 


strength, 
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THE EFFECT OF NEED ACHIEVEMENT ON VERBAL, , 
i OPERANT CONDITIONING USING ACHIEVEMENT- 
RELATED STATEMENTS AS REINFORCERS* * * 


The University of Georgia 


} Epiru Wesskopr-JoELson AND Doucias O, McKrown 


A. INTRODUCTION 


Most previous studies on verbal operant conditioning (VOC) limit them- 
selves to assessing the relative efficacy as reinforcers of different ways of ex- 
pressing interest, approval, or agreement with regard. to the subject’s (S’s) 
responses, Favorite reinforcers in the past have been “Mm-hm” (5) ; “Good” 
(6); “Yes, You're right” (13) ; and the like. But there must be many other 
types of stimuli that have a reinforcing quality. 

Relatively complex reinforcers were employed in a few additional studies. 
For example, Adams, Noblin, Butler, and Timmons (2) were able to in- 
crease the emission of first person pronouns by psychoanalytic interpretations 
of the sentences that the Ss emitted. Adams and Frye (1) used clinical inter- 
pretations, reflection of feelings, and hostile statements as reinforcers, Clinical 
interpretations increased the number of personal. references, while reflections 
of feelings and hostile statements decreased them.* ; 

Weisskopf-Joelson and Heiney (14) used statements pertaining to values of 

high and low strength with regard to their Ss, predicting that high values 
would be more potent reinforcers than low values. 
The present study is related in two ways to the study by Weisskopf-Joelson 
d Heiney (14) mentioned above: I. The reinforcers used were relatively 
‘complex. II. The authors predicted that the outcome of the reinforcement 
procedure would depend on the relationship between the reinforcers and Ss’ 
characteristics. A similar prediction was made by Golightly and Byrne (4). 
Their prediction, as well as the one by Weisskopf-Joelson and Heiney (14), 
was supported by the data. 


Received in the Editorial Office on January 12, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. _ 

1 This paper is based on a thesis by the junior author submitted to the Graduate 
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3 See also Hekmat, H. Reinforcing values of interpretations and reflections in a 
quasi-therapeutie interview. J. Abn. Psychol., 71, 17, 25-31. . 
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The purpose of the present study was to test the hypothesis that there is a 
relationship between Ss’ needs and their responses to need-related reinforcers. 

Ss with high need Achievement (n Achievement) were hypothesized to per- 
form at a higher level in a Taffel-type (11) VOC situation when Achievement 4 
statements are used as reinforcers, than when Antiachievement statements are ‘ 
used as reinforcers. Achievement statements are statements that encourage Ss 
to strive for Achievement, while Antiachievement statements are statements 
that “play down” the value of Achievement in favor of a satisfactory life 
without regard for Achievement. | 

In contrast, Ss with low n Achievement were hypothesized to perform at _ 
a higher level when Antiachievement statements are used as reinforcers, than 
when Achievement statements are used. 

A random sample of Ss for whom n Achievement can be expected to be of 
intermediate mean strength, rather than of extremely high or extremely low 
strength, was hypothesized to perform at the same level with either Achieve- 
ment statements or Antiachievement statements as reinforcers, 

All hypotheses are based on comparisons between experimental groups, be- - 
tween an experimental and a control group, and on intragroup comparisons. i 

In the present investigation n Achievement was used instead of any other | 
need because Atkinson (3) and McClelland, Atkinson, Clark, and Lowell (7) 
have developed relatively extensive methods to assess the strength of this need. 

If it could be shown that there is a relationship between the strength of a 
need and Ss’ Tesponses to reinforcers related to this need, such findings could 
lead to an additional way of measuring the strength of needs. 


B, Mernop 
1. Subjects 


me hundred forty-one male undergraduate students served as Ss, fulfilli 
< ntroductory Psychology course Tequirement to participate in an experimen 
elr mean age was 19.3 years, and their ages ranged from 18 to 33. 
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studies by McClelland et al. (7, pp. 100-101) and the Thematic Apperception 
Test (8) Pictures 7BM and 8BM. These pictures were selected because they 
are most frequently used to assess n Achievement and were employed in the 
original validation study of n Achievement measurement (3). 

The pictures were presented to Ss in groups of 25 or fewer by the experi- 
menter (£, the junior author). Ss were instructed to write a story about each 
k picture, telling what the depicted situation represented, what the characters 

were doing and thinking, and what the outcome would be. The written instruc- 
tions, as well as the detailed procedure of presentation, are described in Atkin- 
son (3, pp. 836-837). 

The stories were then scored for n Achievement by the junior author and 
another graduate student on the basis of the instructions given by Atkinson 
(3, Chapter 12). The interjudge reliability between the two judges was .91 
according to a Spearman rank order correlation coefficient corrected for ties. 

Of the 141 Ss only 111 were assessed for n Achievement. The 30 remaining 
Ss were considered random with regard to their mean n Achievement; they 
were randomly assigned to Control Groups 1, 2, and 3. T'wenty Ss whose n 
Achievement scores were highest were divided randomly into Experimental 
Groups 1 and 2. Twenty Ss whose n Achievement scores were lowest were 
divided randomly into Experimental Groups 3 and 4. Each group consisted 
of 10 Ss, After n Achievement had been assessed, the remaining 71.Ss with 
intermediate Achievement scores were not used for the purpose of this ex- 


periment. 


3. Materials and Procedure II 


The size of the room in which the experiment described below was con- 
ducted is 10 X 8 X 10 feet. S and E sat across from each other, separated by 

table. Before the experiment began, S and E engaged in a casual conversation. 

his part of the experiment was included because Solley and Long (9) found 
that a procedure of this kind facilitates conditioning. " 

Ss were conditioned by a VOC Taffel-type (11) sentence construction 
procedure. Each § was presented with 120 cards, one at a time, each of which 
showed three personal pronouns and three verbs. The procedures by which 
these cards were made up are the same as those described in detail elsewhere 
(14, p. 47), except that the verbs were randomly selected from the list of 
words developed by Thorndike and Lorge (12) and that deceptive symbols 
(++, —, =, 0) were typed in random order in the upper right corner of each 
card with the purpose of obfuscating Ss’ awareness of the reinforcement con- 
tingencies. The order in which the cards and the reinforcers were presented 
was random and different for each S. 
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The following instructions were given: 

‘We are attempting to develop a new method which is to be used to 
diagnose one’s personality. You are going to look at some cards one at a time. 
For each card I would like you to make up a sentence using one of the three 
upper words as the first word in the sentence and one of the three lower 
words as the second word in the sentence. You may use any one of the upper 
words and any one of the lower words. It is important that you respond with 
the first sentence that occurs to you. We feel that your sentences can be used 
to understand your personality. I may interrupt you whenever I have reached 
a diagnostic conclusion and communicate the conclusion to you. Go ahead.” 

Immediately after the termination of the experiment Ss were told that the 
diagnostic conclusions were bogus. 

The “diagnostic conclusions” mentioned in the instructions were actually 
reinforcers contingent upon the use of either “I,” or “we.” Experimental 
Group 1 (high n Achievement) was reinforced with Achievement statements 
for the selection of first person pronouns, Whenever Ss constructed a sentence 
starting with “I” or “we,” E delivered an Achievement statement. Experi- 
mental Group 2 (high n Achievement) was reinforced with Antiachievement 
statements. Experimental Group 3 (low n Achievement) was reinforced 
with Achievement statements, and Experimental Group 4 (low n Achieve- 


ment) was reinforced with Antiachievement statements. Control Group 1 


(random n Achievement) was reinforced with Achievement statements, and 


Control Group 2 (random n Achievement) was reinforced with Antiachieve- 
ment statements. Control Group 3 (random n Achievement) received no 
reinforcement, The order in which the reinforcers were presented was random 


with the following restriction : When one statement was selected, it was 


removed from Es choices until all other statements had been used. 


The following are examples of the 11 Achievement statements used : 


1, You will benefit from making an effort to be the best at whatever task 
you may undertake, 


2. If you learn to work hard fo 
you will be successful, 

3. When your work is 
accomplishments, 


The following are examples of the 11 Antiachievement statements: 


1. You should have a good time and not work i i 
ence some setbacks in the tasks you biaiondldatie one endtgery emai 


2. Hard work and making 
order to lead a satisfactory life, 


r the goals that you have set for yourself, 


your primary concern, you will be proud of your 


Proper sacrifices ate not always necessary in 
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3. You may be less happy if you concentrate on being good at everything 
you do. 

Achievement statements, and Antiachievement statements were matched 
with regard to content and length. 

Five judges, who were graduate students in psychology, were presented with 
the 22 statements and were asked to classify them as either Achievement state- 
ments or as Antiachievement statements. By a successive trial and error 
procedure the statements were altered or replaced by other statements until 
an agreement of 100% was reached for the total of 110 judgments made.. 

In order to prevent bias when conditioning Ss, E did not know whether they 
had been classified as having high, low, or random n Achievement. (Ss were 
instructed not to put their names on their protocols. The numbers that 
identified Ss were covered up when E scored the stories.) 

The operant level of first person pronoun usage was determined during 
the first two of the 10 trial blocks (a trial block consisted of 12 trials) during 
which no statements were delivered. Similarly, no statements were delivered 
during the extinction period, which consisted of the last two blocks of trials, 
Trial Blocks 9 and 10. 

After the treatment period and before the extinction period, Ss were pre- 
sented with a written questionnaire to determine whether they were aware of 
the reinforcement contingencies. The questionnaire»consisted of 14 questions 
and was a modification of the one suggested by Spielberger (10, p. 86). Four 
graduate students judged the written answers, classifying Ss as either aware 
or unaware. The judges were instructed to classify as aware only Ss whose 
hypothesis regarding the experiment would enable them to obtain 100% rein- 
forcement. 


C. ReEsuits 


| A repeated measures analysis of variance (15) showed that there were no 
differences among the numbers of first person pronouns emitted by the seven 
groups during the operant period of the experiment. A second repeated 
measures analysis of variance showed that there were significant differences 
among the seven groups with regard to the numbers of first person pronouns 
emitted during the treatment period (F = 4.18, df = 6/63, p < .01). 
According to a third repeated measures analysis of variance there were no 
differences among the groups during the extinction period. For this reason 
the data for this period will not be discussed further. 

Figure 1 shows the mean numbers of first person pronouns emitted during 
the operant period and the treatment period by Experimental Group 1 (Ss 
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with high n Achievement reinforced with Achievement statements) , by Experi- 
mental Group 2 (Ss with high n Achievement reinforced with Antiachieve- 
ment statements), and by Control Group 3 (Ss with random n Achievement 
receiving no statements). Newman-Keuls procedures indicated that the high 
n Achievement group reinforced with Achievement statements emitted signifi- 
cantly more first person pronouns overall (p < .01) than the no statements 
group, while the high Achievement group reinforced with Antiachievement 
statements did not differ significantly overall from the no statements group. 
The group reinforced with Achievement statements was significantly above 
the no statements group during each of the last four blocks of trials in the 
treatment period (p < .05 to p < .01), while the group reinforced with 
Antiachievement statements was above the no statements group during the 
last two treatment blocks (p < .05 to p < .01). There was no overall differ- 
ence between the two reinforced groups, but during the last block of the 
treatment period the group reinforced with Achievement statements was 
significantly above the group reinforced with Antiachievement statements with 
regard to emission of first person pronouns (pf < .05). However, ¢ statistics 
for correlated observation data comparing the operant period with the last two 
blocks of trials in the treatment period showed that both groups conditioned 
—the group reinforced with Achievement statements (¢ < .01), as well as 
the group reinforced with Antiachievement statements (p < .01). 

Figure 2 shows the mean numbers of first person pronouns emitted during 
the operant period and the treatment period by Experimental Group 3 (Ss 
with low n Achievement reinforced with Achievement statements), by Experi- 
mental Group 4 (Ss with low n Achievement reinforced with Antiachieve- 
ment statements), and by Control Group 3 (Ss with random n Achievement 
receiving no statements). Newman-Keuls procedures indicated that the low 
n Achievement group reinforced with Antiachievement statements emitted 
overall (p < .01) and during each of the last three trial blocks of the treat- 
ment period significantly more first person pronouns than the no statements 
group (p < .05 to p < .01), while no differences could be detected between 
the low n Achievement group reinforced with Achievement statements and the 
no statements group. There was no overall difference between the two rein- 
forced groups, but the group reinforced by Antiachievement statements was 
significantly above the group reinforced by Achievement statements during 
the last two blocks of trials in the treatment period (p < .01). Moreover, ¢ 
statistics equivalent to those reported above showed that only the group rein- 
forced with Antiachievement statements conditioned (p < .05). 

Figure 3 shows the mean numbers of first person pronouns emitted by Con- 
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trol Group 1 (Ss with random n Achievement reinforced with Achievement 
statements), by Control) Group 2) (Ss with random n Achievement reinforced 
with Antiachievement statements), and by Control Group 3 (Ss with random 
n Achievement receiving no statements). According to Newman-Keuls 
procedures the group reinforced with Achievement statements-emitted overall 
(p < .05) and over each of the last three blocks of the treatment period 
(~ < .01) significantly more first person pronouns than the no statements 
group; also, it was significantly (p < .01) above the group reinforced by Anti- 
achievement statements during the last treatment block. The difference between 
the group reinforced with Antiachievement statements and the no statements 
group was not significant. There was no significant overall difference between 
the two reinforced groups. Both reinforced groups conditioned as shown by ¢ 
statistics (Achievement statements group: p < .01; Antiachievement state- 
ments group: p < .05). 

The data of Ss judged to be aware of the reinforcement contingencies on 
the basis of their responses to the questionnaire were not analyzed separately, 
since only four Ss-were aware. The agreement between the judges was unan- 
imous for all judgments made, One of the aware Ss was in Experimental 
Group 1, two were in Experimental Group 2, and one was in Control Group 


2. 


D. - Discussion 


The first hypothesis predicted that, for Ss with high n Achievement, Adhiens 
ment statements would be more effective reinforcers than Antiachievement 
statements. The results shown in Figure 1 indicate that this hypothesis was 
confirmed. However, the confirmation of the hypothesis is marginal, since 
there is relatively little difference between the two reinforced groups ercet 
during the last block of the treatment period. 

The second hypothesis predicted that, for Ss with low n Achievement, Anti- 
achievement statements would be more effective reinforcers than Achievement 
statements, The results shown in Figure 2 indicate that this prediction was 
definitely confirmed. 

The third hypothesis predicted that for a random sample of Ss, Achievement 
statements would be equally effective reinforcers as Antiachievement state- 
ments. The results shown in Figure 1 indicate that this prediction was not 

. confirmed. Achievement statements appeared to be more potent reinforcers for 
the group under discussion. 

It is interesting to note that the only reinforced group that did not perform 
at.a higher level than the no statements group, and that did not, condition 
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according to a 2 statistic was the low n Achievement group reinforced with 
Achievement statements. In contrast, the high n Achievement group. rein+ 
forced with Antiachievement statements performed at a higher level than the 
no statements group to a marked degree, and conditioned according to a t 
statistic. The combination of these two results suggests that high n Achieve- 
ment Ss are “attracted” by Antiachievement statements, while low n Achieve- 
ment Ss are not “attracted” by Achievement statements. Previous research 
suggests a relationship between the potency of the reinforcer and its “‘attrac- 
tiveness” to S (e.g., 14). The possibility that Ss with high n Achievement 
may be “attracted” by Antiachievement statements while Ss with low n 
Achievement may not be “attracted” by Achievement statements could be 
explained as follows: the former have to control their desire to ‘enjoy them- 
selves,” to “loaf,” etc. in order to satisfy their high need for Achievement; 
thus, Antiachievement statements can be expected to release pressure and 
create relaxation. It is less probable that Ss with low n Achievement have to 
exert an equally strong control over their conflicting tendencies. However, it 
is quite possible that high n Achievement Ss are easier to condition than low n 
Achievement Ss, no matter what the reinforcement is. 

The fact that randomly selected Ss can be more easily reinforced with 
Achievement statements than with Antiachievement statements might be the 
result of a society that is Achievement-oriented, especially among college stu- 
dents. This orientation might result in a higher level of performance (as 
compared to the no statements group) when Achievement statements are 
used as reinforcers, but not when Antiachievement statements are used, for 
all Ss with the exception of a minority with extremely low n Achievement, In 
informal discussions after the conclusion of the experiment most Ss expressed 
agreement with the Achievement statements and aversion to the Antiachieve- 
ment statements, 

It is recognized that it would have been preferable to use three control 
groups each, of high, low, and random n Achievement. However, only 10 Ss’ 
n Achievement was extremely high, or low, respectively, and it was not 
Possible to find enough Ss to double the total sample. 

Relatively few Ss (four) became aware of the reinforcement contingencies, 
maybe because. the simulation of the experiment as a diagnostic procedure 
focused Ss’ attention on how they were going to be “diagnosed,” and this 
focused their attention away from the contingencies. Also, Ss probably believed 
that the “diagnostic conclusions” were made on the basis of a large number of 
their sentences, rather than of the emitted pronouns immediately preceding 
the reinforcers; this, again, may have focused their attention away from the 
contingencies. 
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The present study, as well as the studies cited in the Introduction (1, 2, 4, 
14), emphasizes the fact that verbal reinforcers in VOC procedures need not 
be limited mostly to different ways of expressing interest (“Mm-hm”) ; 
approval (“‘Good,” “Fine”) ; or agreement (“‘Yes, you’re right”). The studies 
open a wide horizon of new research possibilities: Values, needs, attitudes, 
likes and dislikes, personal preferences, and tastes of all kinds could be investi- 
gated by the help of VOC. 

However, it may well be that, for example, Antiachievement statements 
could have encouraged Ss with low n Achievement to speak about themselves 
(emit “I”), and Achievement statements could have encouraged them to 
speak about others (emit second and third person pronouns) independent of 
the potency of the statements as reinforcers. Therefore, additional research is 
suggested in which other reinforced response classes are used, rather than first 
person pronouns. The same argument could be applied to all VOC studies 
using the Taffel-paradigm and reinforcing “I” by stimuli that $ may perceive 
as an expression of interest (““Mm-hm”) ; approval (“Good,” “Fine’’) ; or 
agreement (‘“That’s right”). 

The confounding mentioned above would lead to a more frequent use of 
“{” than of “we” when a need that is of high strength for a particular group 
of 8s is used as reinforcer. However, the frequency of “I” did not differ 
significantly from the frequency of “we” in any of the seven groups of Ss, If 


further studies yield similar results, VOC could be used as a tool for the mea- 
surement of needs, 


E. Summary 
The hypotheses are: 


a higher level when rein: 
than when reinforced with Antiachievement sta 


concluded that there is a relationsh’, 
tionship between §'s needs and his responses to 


need-related reinforcers. If further studies yield simi 
be used as a tool for the measurement pa Sepa nate woe. ove 
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THE EFFECT OF AGGRESSIVE CARTOONS ON THE 
BEHAVIOR OF FIRST GRADE CHILDREN* 


State Department of Mental Health, Fairbanks, Alaska; and 
Mississippi State University 


Girnn Tuomas ELLs AND Francis Sexyra, III? 


A, InrropuctTion 


Many parents and educators have expressed concern over the possibility 
that the content of mass media, particularly films, might have an influence on 
children’s rate and expression of aggression. ‘The results of many studies 
suggest that the incidence of aggressive behavior is likely to increase after 
viewing film-mediated aggression. The individual tends to “model” or behave 
in a manner consistent with the actions of the stimulus character. Bryan and 
Schwartz (4) provide an extensive review of investigations ascertaining sym- 
bolic modeling influences on human behavior. Their conclusions definitely 
favor the theory that imitative aggressive behaviors can result from the 
observation of a hostile model. 

Lovaas (8) conducted a study in which he permitted children to play 
with either of two toys, each of which could be operated by depressing a lever. 
One lever operated a hitting doll, and the other lever caused a ball to rise 
within a cage-like structure. Children who had viewed an aggressive cartoon 
directed a greater percentage of responses to the lever that operated the 
hitting doll than did children who had watched a nonaggressive film sequence. 

Bandura, Ross, and Ross (3) compared the effects of portrayed real-life, 
filmed human, and filmed cartoon aggression on the behavior of preschool 
subjects (Ss). Portrayed aggression was directed toward an inflated doll. 
After exposure to the models, the Ss were mildly frustrated; and imitative 
and nonimitative aggressive behaviors were measured. The three treatment 
groups that observed aggressive models exhibited a great number of precisely 
imitative aggressions, whereas similar responses rarely occurred in the non- 
aggressive model group or the control group. The results also indicated that 
film mediated models are as effective as real-life models in eliciting aggressive 
patterns of behavior. 
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The effects of a two and one-half minute aggression modeling film upon 
the behavior of 20, four- and five-year-old boys indicated that, even in the 
absence of frustration, the Ss were more willing to attack a human clown or an 
inflated plastic doll. Hanratty, Liebert, Morris, and Fernandez (7) found, as 
predicted, that the incidence of aggression was higher if the modeling film had 
been viewed, and that the Ss exhibited imitative aggressions. It should be noted 
that the plastic doll was more frequently and more violently attacked than the 
human victim. 

Most research in the area of film+mediated aggression has been conducted 
in laboratory settings. Often Ss have been removed from their natural environ- 
ment for the research, and this occurrence may have altered their behavior. 
This raises the question of to what extent the results can be extrapolated. The 
purpose of the present investigation was to ascertain the effects of viewing an 
aggressive cartoon sequence in a setting to which children are accustomed. It 
was hypothesized that viewing a cartoon sequence containing portrayed aggres- 

sion would stimulate increased overt aggression displays by the Ss. 


B. Mernop 
1. Subjects 


Fifty-one pupils were randomly selected from three first grade classes of a 
Georgia elementary school to serve as Ss. The Ss were not segregated accord- 
ing to intelligence, sex, Tace, or socioeconomic status. Selection of first grade 
pupils was based on the belief’ that the Ss would exhibit interest in cartoon 


pid htc ta hi also assumed that first grade children were less inhibited 


_ After pretreatment obsery: 


groups were labeled the “aggress' 
“control group,” 
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through the process of hitting, tackling, fighting, kicking, shouting, and’ shoot- 
ing the referee. : 

The neutral film was selected on the basis of a complete absence of aggres- 
sive content. The film depicted a musical variety show featuring animated 
characters engaging in singing and dancing behaviors. 

Aggressive § behaviors were recorded on mimeographed tabulation sheets 
dichotomized to distinguish between physical and verbal activities. Physical 
aggression was further categorized into striking, pushing, pulling; kicking, 
tripping, butting, or throwing classmates or objects. These were classed as 
contact (attempted) behaviors. Noncontact physical behaviors consisted :of 
facial and body gestures which were directed at a classmate, the investigators, 
or the teacher. Verbal aggression included name calling and shouting. 


3. Procedure 


The experimental procedure consisted of (a) pretreatment observation, 
(b) treatment per se, and (c) posttreatment observation. During the experi- 
ment, school officials cooperated fully, and teachers were instructed not to 
alter their daily activities. 

Prior to pretreatment observation, recording the behavior of each § was 
assigned to two assistants. During the pretreatment period, each observer 
(O) independently recorded each overt aggressive behavior emitted by Son 
the mimeographed checklist. Each S was observed for a 15 minute period 
once a day for three consecutive days. All observations occurred at the same 
time each day. The Os sat on opposite sides of the room to prevent collabora- 
tion while collecting the data. A pilot study was conducted prior to the in- 
vestigation in order to determine interrater reliability. An interrater reliability 
coefficient of 1.00 resulted, indicating that the Os reported apparently similar 
observations of aggressive behaviors. 

Each § was observed while he functioned during a relatively relaxed period, 
such as an art period. It was assumed. that a relaxed period would minimize 
the inhibition of spontaneous behaviors. 

Following the three day pretreatment observation period, an arithmetical 
mean of aggressive behaviors was calculated for each S. If the two Os dis- 
agreed on the incidence of aggressive behavior for any particular child on any 
given day, the highest frequency reported was employed by the experimenter 
(E). It was assumed that the’ aggressive ‘response occurred, but it went un- 
noticed by one of the Os. The same technique was applied to the posttreatment 
data, and thus a form of consistency was maintained. It should be noted that 
interrater discrepancies occurred very rarely. 1 
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Calculation of the average rate of aggression for each § was followed by 
hierarchically sequencing all Ss, blocking the Ss into clusters of three, and 
randomly assigning each to either the aggressive group, the neutral group, or 
the control group. It was assumed that the randomized blocks design would 
eliminate the possibility of overloading any group with extremely passive or 
aggressive Ss. This technique would also control for such extraneous variables 
as teacher influence, 

Experimental treatment occurred on the fourth day, and it consisted of 
allowing § either to view one of the two cartoon sequences or to engage in 
five minutes of informal activity. All three treatment experiences occurred 
within the school five minutes before the time set for the beginning of the 
pre- and posttreatment observation periods, Treatments were administered 
by three assistants who were unaware of the Ss pretreatment incidence of 
aggression. 

The Ss in the aggressive group and the neutral group were escorted to sepa- 
rate viewing rooms isolated from normal school activity and other extraneous 
stimuli. The stimulus films were projected in their entirety, and the Ss were. 
conducted back to their classrooms, 

The Ss in the control group were conducted from the classrooms to an 
empty room. Seated in a semi-circle facing the assistant, each § was engaged in 
a casual conyersation and asked his name, his favorite sport, and his opinion of 

school. After a five minute absence, the control Ss were escorted back to their 
respective classrooms. 

The return of all treatment Groups coincided, and the Os immediately 
commenced the 15 minute posttreatment observation period. In all cases, O 


by: unaware of the type of treatment that § received. This control prevented 
jas. 


C. Resutrs anp Discussion 

The record of each $ was anal 
age rate of aggression with the 
obtain a change score, An analy: 


lyzed by comparison of the pretreatment aver- 
Posttreatment rate of aggression in order to 


sis of variance for Tepeated measures (5) was 
performed on the change scores, and the results are summarized in Table 1. 
Comparisons of pretest data were unnecessary because the randomized blocks 
design controlled for between group differences, 


Statistical analysis of the change scores yield fe 
.05 for the treatment variance, ’ cellar pees eh. 1.5) 
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TABLE 1 
COMPARISON OF INCREASE IN AGGRESSIVE BEHAVIOR AMONG THE 
AGGRESSIVE, NEUTRAL, AND CONTROL Groups 


Source Ss df MS F 
Treatments 14.98 2 749 3.80% 
Subjects 24.74 16 1.55 79 
Error 62.99 32 1.97 
Total 102.74 50 

*p< 05. 


The change scores of the Ss that viewed the aggressive cartoon were com- 
pared with the change scores of the Ss that viewed the neutral cartoon and the 
control Ss by means of an orthogonal analysis. The results are reported in 
Table. 2. 


TABLE 2 
ORTHOGONAL ANALYSIS COMPARING THE MEAN AGGRESSIVE BEHAVIOR 
OF THE TREATMENT GROUPS 


Source Ss df MS F 
AXN+C 14.14 1 14.14 7.18* 
NXC 84 1 84 43 


Sik oeeUNNUVEEDD Of Ue Rd) AS PRT EEE Dire ae abe ERs BA en ae 
Note: A = aggressive treatment group; N = neutral; C = control. 
*p <.05. 


Orthogonal analysis yielded an F (1, 32) = 7.18, p< .05, and it was 
concluded that the Ss who viewed the aggressive cartoon emitted a significantly 
greater number of aggressive behaviors than the Ss in the neutral or control 
groups. Orthogonal comparison of the change scores of the Ss in the neutral 
group with the change scores of the Ss in the control group did not yield signif- 
icant results. Thus, it appeared that viewing the aggressive cartoon accounted 
for the significant treatment effect, and viewing a film per se did not contribute 
significantly to the obtained between group variance, 

To ascertain whether frequency of aggression increased simply as a result of 
treatment, separate ¢ tests for correlated data (6) were performed to compare 
the pre- and posttreatment aggression scores of each group. Analysis of pre- 
and posttreatment scores of the aggressive group yielded a ¢ (16) = 2.84, 
» < .05. The hypothesis of no behavioral change was rejected, and it was as- 
sumed that Ss who viewed the aggressive cartoon film increased their rate of 
aggression when observed immediately after the treatment. The ¢ test compar- 
ing pre- and posttreatment scores for the neutral and control groups both failed 
to yield significance. 

Thus, film-mediated aggression did indeed facilitate increased aggressiveness 
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in first grade children in their natural ‘school environment. Adherents of the 
modeling theory might explain the increase of aggression in terms of behaviors 
copied from the cartoon stimulus figures. Bandura et al. (3) pointed out that 
cartoons not only condone, but also facilitate socially unacceptable behavior 
because cartoon models frequently are not subject to punishment for deviant 
aggressive behavior. Bandura and Walters (2) argue that learning is fostered 
through the observation of other people, and televised cartoons provide picto- 
tially presented models without the accompaniment of any direct instructions 
to the observer. The models are of human and fantasy variety. The major 
networks have reportedly eliminated many of the violent scenes occurring in 
television programs directed primarily at adults, but little noticeable revision 
has taken place in children’s programs (1). This seems especially true with 
animated cartoon films. Thus, within the limits of this investigation, the effect 
of filmed cartoon aggression appears to be a very relevant influence on chil- 
dren’s behavior. 

Overall, it can be concluded that viewing film-mediated aggression did have 
an influence on the Ss’ behavior, It appeared that the effect of viewing an ag- 
Bressive cartoon sequence was an increased rate of emitted hostile behaviors 
by the Ss when observed in his natural environment. These data suggest that 
serious consideration should be given to the content of children’s cartoons. 


D. Summary 


Fifty-one first grade Ss wer 
ized blocks technique based upon data 


(0 viewed the aggressive cartoon emitted an in- 
i H 
treatment. immediately after the 
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A COMPARISON OF PREMISE TYPES IN HYPOTHETICO- 
DEDUCTIVE THINKING AT THE STAGE OF 
FORMAL OPERATIONS*? 


University of Minnesota 


Wi M. Barr 


A, INTRODUCTION 


One of the characteristics of the stage of formal operations to Jean Piaget 
(8) is the capacity to think in a hypothetico-deductive manner. Commentators 
on formal thought, such as David Elkind (4) and John Flavell (5), have 
aided in the further refinement of the psychological process of hypothetico- 
deductive thinking. Piaget (8) and Elkind (4) have both indicated that a 
subject at the formal thought stage can accept an absurd statement that is 
contrary to a fact that he knows to be a fact, and deduce valid logical state- 
ments from it, Flavell (5) has indicated that a subject at this stage can accept 
an abstruse statement, which has no concrete referents and is beyond the ex- 
perience of the subject, and deduce valid logical statements from it. Un- 
fortunately, in none of the literature on formal reasoning is there any com- 
mentary as to whether deduction from absurd premises is in a psychological 
sense more difficult than, as difficult as, or easier than deduction from abstruse 
premises. 

In two recent reports Bart (1, 2) used the heretofore discussed observations 
on hypothetico-deductive thinking to develop item specifications for tests on 
formal reasoning. However, no attempts were made in those two studies to 
compare deduction from absurd premises with deduction from abstruse prem- 
ises. The reseach reported here deals primarily with the testing of the 
following hypothesis related to that issue: valid deduction from absurd prem- 
ises is as difficult as valid deduction from abstruse premises with respect to 
each valid rule of inference. In other words, it is hypothesized that a subject 
will deduce statements from absurd premises with as much ease as from 
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abstruse premises. The result of the testing of that hypothesis will reveal the 
relationship between two types of premises within the realm of hypothetico- 
deductive reasoning. 
B. Metuop 
1. Subjects 


Ninety adolescents from the Minneapolis area were used in this study; 30 
scholastically above-average students from each of three age levels—13, 16, 
and 19, (It was assumed that most pupils with above-average scholastic 
achievement and over 13 years of age would have formal operational capabil- 
ities according to Piagetian findings.) ‘Thirty 13-year-olds were from ac- 
celerated eighth-grade classes in a Minneapolis elementary school ; 30 16-year- 
olds were from accelerated classes in a Minneapolis high school; and 30 
19-year-olds were undergraduates at the University of Minnesota. All 90 
students were volunteers, as they expressed a willingness to participate in 
research on adolescent thought processes, 

There were 40 males and 50 females. There is substantial evidence that 
there are no sex effects with respect to formal operational skills (9, 10) ; thus, 
the condition of disproportionate sex sampling should not detract from the 
findings of the study. 

2. Instruments 


a. Information tests. Two tests with 40 free-response items in each test 
were constructed and then administered to each subject. The items in both 
tests related to cultural anthropological terms and topics. The items in one 
test, labelled the absurd information 


be easily answered by the subjects (e.g., “Can a tribe be a blade?”). The items 
in the other test, labelled the abstruse 


of the tests had all absurd 
remises, 


: 


WILLIAM M. BART 47 


The “abstruse” formal reasoning test was constructed with the premises’ in 
the items being derived from the abstruse premise bases, The “absurd” formal 
reasoning test was constructed with the premises in the items being derived 
from the absurd premises bases—in this case, the item premises were con- 
trary-to-fact statements based on the factual statements cited in the absurd 
premise bases. Both of the tests were set in the content area of cultural 
anthropology. The test items were multiple-choice with six alternatives, and 
the 30-item logical structures and their order were maintained in the two 
formal reasoning tests.” wnt 

Example items used in the absurd and abstruse formal reasoning tests, 
respectively, with the same logical structure and with the number of the 
correct answer choice underlined are the following: ) ' 

(a) Ears are smaller than toes. Ears are larger than Indians. Therefore . . . 
1) ears are the smallest of the three; 2) toes are smaller than Indians; 3) toes 
are the largest of the three; 4) toes are the smallest of the three; 5) Indians 
are the largest of the three; 6) ears are the largest of the three. 

(6) Aurignacian culture precedes Magdalenian culture. Aurignacian cul- 
ture follows Mousterian culture. Therefore .. . 1) Aurignacian culture was 
the earliest of the three cultures; 2) Magdalenian culture precedes Mousterian 
culture; 3) Magdalenian culture was the latest of the three cultures; 4) 
Magdalenian culture was the earliest of the three cultures; 5) Mousterian 
culture was the latest of’ the three cultures ; 6) Aurignacian culture was the 
latest of the three cultures. 

Though these formal reasoning tests were empirically determined with the 
use of the two information tests to be composed of absurd and abstruse 
premises for the sample of 90 subjects, it is not contended that the tests would 
have with certainty absurd and abstruse premises for other samples of subjects. 
What is an absurb premise to one subject may be an abstruse premise to another 
subject, and what is an abstruse premise to one subject may be a commonplace 
premise to another subject. Thus, it is suggested that formal reasoning tests 
must be somewhat custom-made for the subjects being tested. 

The tests were intended to determine hypothetico-deductive reasoning 
capabilities of the individual. Subjects were asked to accept either absurd or 
abstruse premises and then to determine valid conclusions from them. Though 
many factual terms, such as “ocher” and “Ordovicians,” were used, these 
tests should not be viewed as measures of information, for the informative 


terms were primarily sources of misdirection and confusion. In order for cor- 


2 Details of these tests may be obtained from the author. 
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rect resolution to occur, the logical structures and not the semantic features 
of the items must have been apperceived and considered. 

For each of the four tests, subjects were given directions for completing 
the test and were told to be frank and honest in their responses. The informa- 
tion tests were administered to the subjects initially with no time limit being 
indicated, as they were power tests, and the following order—absurd and 
then abstruse. Both of the tests were completed in less than one hour in each 
of the group-testing sessions, About one month later, the formal reasoning 
tests had been constructed and were administered to the subjects in group- 
testing sessions and in the following order—absurd and then abstruse. The 
two reasoning tests were also power tests with no time limits; however, every 
subject completed each test in less than 35 minutes. 


C. Resutts 


To test the hypothesis relating deduction from absurd premises and deduc- 
tion from abstruse premises, a correlational procedure and a procedure involv- 
ing the Goodman-Kruskal y statistic, which is a measure of association between 
two ordinal categorizations, were used. The y statistic and the correlation 
coefficient have many similar 
are from —I to 1, Also, 


variables is indicated, If y is near —1 
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For a more exacting examination of this hypothesis, the computed + statis- 
tics noting the relation between item scores for each of the pairs of items with 
the same logical structures for each of the three groups and for the total 
group were calculated. (In Table 2 the item pairs are indexed by their item 
numbers so that, for example, item pair 4 related item 4 of one test with item 
4 of the other test.) 

The prominent feature of Table 2 is that only eight of 120 computed y 
statistics are not significant—thus, indicating that the deduction hypothesis is 
confirmed as variation in premise type is found not to have much of an effect 
on the difficulty of a formal reasoning item. In general, the results of the two 
procedures tend to affirm the deduction hypothesis. 

To understand more fully the implications of the results, it is relevant to 


TABLE 2 
Compute y Statistics NoTING RELATION BETWEEN ITEM Scores FOR EACH Paik 
oF ITeMs HAVING THE SAME LocicaL STRUCTURES FOR EACH OF THREE AGE 
Groups AND THE TOTAL GROUP 


Age group 
Item pair 13 16 19 Total group 
1 55 61 “59 56 
2 64 69 73 68 
3 72 66 75 70 
4 45 61 60 53 
5 57 59 62 58 
6 74 .67 65 68 
7 228 +184 45 28 
8 73 64 81 71 
9 66 81 79 74 
10 71 57 66 64 
11 49 198 56 40 
12 59 68 82 68 
13 54 wal 66 62 
14 198 AT 174 26 
15 48 39 55 42 
16 59 47 67 56 
17 64 57 61 59 
18 67 71 77 70 
19 57 83 76 71 
20 67 64 57 62 
21 21" 53 208 30 
22 71 57 33 69 
23 66 59 47 56 
24 69 82 74 73 
25 64 77 81 72 
26 59 69 73 66 
27 58 77 75 69 
28 A7# 55 48 38 
29 61 64 60 61 
30 59 55 7A .60 


® Not significant at .05 level; all other values are significant at .05 level. 


From an examination of Table 3, one can see\as was reported by Bart (2) 
that formal thought test means increase from the 13-year age level to the 16- 
year age level, but then level off and indicate no significant increase from the 


period of time after its emergence (8). One can also see that the means are 
quite similar at each age level; thus, the pattern of development for deduction 
from absurd premises is viewed as similar to the pattern of development for 
deduction from abstruse premises, 


D. Discussion 


of the present study permit somewhat specific conclusions as to 
thinking at the stage of formal operations. 
was provided to indicate that for various levels in the stage of formal 
deductive thinking from absurd premises is in a psychological sense 
” deductive thinking from abstruse premises. Thus, one could 
absurd premises and abstruse premises having the same logical 
a mar > en deduction stimuli. 
Premise type is not.as important 
determining formal reasoning item difficulty as item logical struc- 
pends to be determined is the psychological order among rules of 
‘Thos, it seme that further research in formal reasoning will be 
the Piagetian realm of genetic logic (3). 
directed at exploring the developmental paterns of hypothetico- 
thinking would provide a more complete explanation of the develop- 
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mental process of formal reasoning, as well as indicate to educators the types 
of deduction and argumentation processes various adolescents at different 
levels of formal thought will be capable of using. 


E. Summaay 


‘Two information tests and two formal reasoning tests were administered to 
90 scholastically above-average 13-, 16-, and 19-year-olds to determine each 
subject's proficiency at deduction from absurd premises and his proficiency at 
deduction from abstruse premises, From correlational and Goodman-Kruskal 
y statistical procedures it was determined that valid deduction from absurd 
premises is as difficult as valid deduction from abstruse premises for various 
levels of formal thought. Logical structures of premises tend to be « more 
salient factor in determining hypothetico-deductive thinking difficulty than 
types of premises, 
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ORDINAL POSITION, FAMILY SIZE, AND 
PERSONAL ADJUSTMENT* 


Brigham Young University 


Biair R, Swanson, Ranpy H. Massey, anv I, REED PayNE 


A. INTRODUCTION 


This study was concerned with the influence of ordinal position and family 
size as determinants in adjustment as measured in a large private university. 
Few variables have generated such diffuse and often disjointed results as 
studies on ordinal position. Nevertheless, social researchers seem to be willing 
to attest to the importance of birth order as a contributory factor to personality 
and behavior development (3, 5, 18, 19, 21, 23). 

The research on birth order dates back half a century; in fact, the earliest 
studies were done by Galton in 1889 (10). Experimentation conducted since 
then, however, has disappointedly resulted in limited delineation of the specific 
factors that operate to make birth order a meaningful psychological variable. 

Attention has also focused on the size of the family as an influence on per- 
sonality growth (9, 17, 18). There seems to be a popular idea that family size is 
inversely related to such desirable influences as economic power, status striving, 
and adjustment (11, 15) ; but at present the trend is tenuous. 


B. Review OF THE LITERATURE 


In 1918, Alfred Alder (1) first presented a testable theory of birth order 
as an influence on life style. He stated that first-borns usually were more con- 
servative than later-borns. Altus (4) supported this finding with observations 
of a large college population. He measured status quo as derived from a scale 
on the Minnesota Multiphasic Inventory and found that first-borns scored 
higher on conservatism. Similarly, Kammeyer (13) found first-born girls to 
be more traditional in their orientation than later-born girls. 

Studies investigating birth order and general adjustment have shown varied 
outcomes. Sampson (20) reviewed a total of 12 analyses based on responses of 
Nonneurotic, nonpsychotic samples. Six authors reported first-borns with 
greater adjustment problems; four identified the second or later-born as more 
maladjusted; and two noted no difference between groups. Lester (14) cited 
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three studies on suicide and suicide attempts. Although among actual suicides 
no birth order relationship existed, the oldest and youngest sibling were over- 
represented in adolescent suicidal attempts. The author proposed an Adlerian 
interpretation, Cushna ef al. (7), in a clinical setting, found a greater tendency 
for children with behavioral disorders to be first-born rather than later-born 
(34 percent versus 17 percent). In terms of a theory, Palmer (16) stated the 


position that the first-born becomes more adult-centered sooner in his develop- — 


ment than the later-born. While the data do not consistently support a clear 
relationship between general adjustment and birth order, there is a slight trend 
in favor of the second or laterborn as being better adjusted. Before definite 
conclusions are reached, however, differences in family size and other environ- 
mental variables should be examined as possible confounding factors. 

In an earlier study, Bossard and Boll (6) found a relationship between a 
large family size and better adjustment scores for children. The authors, how- 
ever, have been criticized for their sampling technique (11). For example, in 
‘the original study, very general open-ended questionnaires were used. Never- 
theless, further support for better adjustment in large families has come from 
Elder (9) and Henry (12). Hawkes et al. (11) reported on the relationship 
between family size and adjustment, as measured by the Rogers test. Contrary 


to Bossard and Boll, the authors found that children from smaller families had _ 


better adjustment scores. Smart (22) presented data to support Hawkes’ find- 
ings. Examining the ordinal position of 242 alcoholics in a clinical setting, he 
found no predominance of position but noticed an increase in alcoholism with 
increasing family size. However, Damrin (8) found no relation between 
family size and general adjustment. Like studies on birth order, research find- 
ings on family size and general adjustment have produced apparently contradic- 
tory results. , 

The present study, then, attempted to separate birth order from family size, 
an apparent confounding variable and to examine the correlation of birth order 
and adjustment in a college population. From the trends reported in the 
literature, it was hypothesized that (a) first-born students are more malad- 
justed than later-borns; (6) there is a relationship between family size, birth 
order, and maladjustment. ? 


C. Mernop 
1. Subjects 


The Ss were drawn from a sample of 704 students enrolled in a required 
health course at Brigham Young University. Forty-three Ss were excluded be- 
cause they failed to complete the questionnaire. In all, 354 males and 350 
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females participated. Sex differences were examined. To test the second hy- 
pothesis, Ss were divided into two groups, those with five or fewer siblings 
(Group I) and those with six or more siblings in their families (Group II). 
“Only” children were excluded from Group I. Ss were characterized as being 
middle class (83 percent), freshman and sophomores (88 percent), and single 
(92 percent). 


2. Instrument 


The instrument used was a questionnaire booklet and a special IBM answer 
sheet, both specifically designed for this study. The questionnaire booklet con- 
sisted basically of two sections: the Cornell Index, Form N2 (24) and a per- 
sonal information section. 


D. RgsuLts 


- Regarding birth order and maladjustment, an analysis of variance using pro- 
portional cell frequencies was calculated for the total sample. The hypothesis 
that there is a relationship between birth order and maladjustment, regardless 
of family size, was not supported by the data (F = .84, df = 1/698, p > 
.05). There was, however, a significant difference in maladjustment between 
sexes with females scoring higher, more maladjusted than males (F = 24.46, 
df = 1/698, p < .001). 

An analysis showed that the ordinal position was significant only for Ss 
with six or more siblings (F = 3.22, df = 2/226, one-tailed p < .05). No 
significant difference in birth order was found for Ss from smaller families, 
The distinction is interesting when group means and standard deviations are 
compared (see Table 1). First-born males in Group II (larger families) had 
a mean adjustment score of 6.1, and their counterparts in Group I (smaller 
families) of 9.1 (¢ = 2,40, p < .05). 


TABLE 1 xy 
MALApJUSTMENT ON THE CORNELL INDEX AND Famity Size: Group ann SDs ~ 
First-born Middle-born Last-born 
Group N X SD N X¥ SD N X SD 
Group I 
(5 or fewer siblings) 
Males 74 91 84 110 91 7.9 54 9,0 7.0 
Females 66 12.6 9.7 118 118 9.9 50 11.2 68 
ange Il 
6 or more siblings) 
76 9.9 77 9 84 7.0 
Pemal ge 92 9 13.7 8.6 


Females 36 104 8.0 71 13.0 
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In an effort to ascertain reasons leading to the low score for first-born 
males in larger families, responses by first-borns and later-borns on the per- 
sonal questionnaire were compared. There were no significant differences 
between groups. 


E. Discussion AND CONCLUSIONS 


Overall, these results indicate the importance of family size as a possible 

confounding variable in research involving ordinal position. The lack of 
| delineation of family size may account for the conflicting results noted earlier. 
| Concerning ordinal position and adjustment, Sampson (20) stated that “in 
| the absence of any consistency of results in this survey of general adjustment, 
| to speculate on their possible meaning would be totally unwarranted” (p. 
| 207). The present study suggests that these inconsistencies in results may 
| result from failure to control for family size. The present data illustrate and 
| confirm the importance of considering not only one’s birth order, but also the 
| size of the family. ; 

In a rare study in which family size was controlled for a comparison of 
birth order and adjustment, Altus (2) found adjustment and the birth order 
correlation varied according to family size, with the last-born in larger families 
and the first-born in families of two siblings being better adjusted. Table 1 
also confirms the fact that birth order and adjustment are related to family 
size. However, present results show the male first-born in larger families to 
be better adjusted than the male first-born in smaller families, where the 
dichotomy of family size is five and fewer, or six and more siblings. No 
significant differences were noted among females. 

; Biographical data for first-born males were compared for the two family 
sizes. No significant differences were found for these males. Therefore, no 
explanation is offered for the differences in adjustment between first-born 
males of large and small families, 
* : iia me hie ae ue birth order differences now 
control of sex antes amet can Tols exercised. Methodologically, 
é an essential consideration. 


F. Summary 


The relationship between birth order effects on maladjustment and family 
size of 704 college students was studied. Sex differences in adjustment were 
noted. Females scored higher on maladjustment than males, No differences 
in maladjustment were noted when family size was held ae However, 
maladjustment varied according to ordinal Position with larger Cagis First. 


Cl 


ik) 
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born males appeared better adjusted in larger families than later-born males. 
No differences were found for females or smaller families. First-born males 
of larger families appeared better adjusted than first-born males of smaller 
families. It was concluded that control of sex and family size appears essential 
in birth order studies. 
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WE: SYSTEM* 
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Davip P. ARMENTROUT 


known, Equation 1, so that the universe of potential interactions between 
respondents over the cumulative bandwidth of all channels can be divided 
into those interactions for which loop gain will maintain the interaction, and 
those interactions in which it will not. Transition between the two states 
occurs by lengthening of the decay time of a random throughput: hence is a 
smooth process, presenting a limitation to a simple dichotomous model, Equa- 
tions 2 and 3. In addition, treatment here assumes synchronicity of through- 


In a closed feedback loop having two respondents, the net loop gain is readily 
puts for simplicity. 


In the space of » loop sustained throughputs of a group of respondents, 


systemic complexity, Equation 5. 


the complexity (enumeration of the “ways of relating”) of the group is 
bounded by linear minimum and hypergeometric maximum functions of the 
number of throughputs, Equation 4. As the set of throughputs defines a set 
of systemic states, each of which is subject to interaction with systemic through- 
puts (at least potentially), the loop throughputs may interact, compounding 


Choice may conveniently be viewed as the selection, according to some 
program of “optima,” of a succession of matrix points seemingly most to 


be desired, the matrix arising from treatment of interactional attributes as 
dimensions, and selection being limited to the space of coincident or trans- 
formably equivalent four-space parameters. Recycling a potential matrix point 
as a factual datum provides a loop outputting the matrix of potential interac- 
tions based on known data. The linear case arises for “perfect” digital inter- 
actions, while permitting self-generated parametric variations in mixing pro- 
duces the analogue surface arising from potentially infinite compounding of the 
hypergeometric term in Equation 4. Choice would then follow a goodness of fit 


comparison between projected alternatives and selected criteria. 


By assumption of random noise inputs for a constant stimulation of pro- 
jected alternatives, a value system or simular choice programming structure, 
and interaction under feedback, the primary requirements for a constellar 
model of ideation appear to be satisfied. There seems to be no reason to ex- 


— 
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clude any such system from the assumption of programmable cooperation. This 
would bear on sponge cells or computors as well as swarming gnats or crowds. 

Data from Markey (3) and Parten (4, 5) show that the pronominal usage 
“We!” ig almost the last to develop, and that cooperative play is one of the 
last forms to appear. Eisen reports (2) that sensory deprivation during this 
period, about 3.5 to 4.5 years, can result in impairment of interpersonal ad- 
justments later. It seems reasonable that this is indeed the period in which the 
concept of “we” develops. 

An emergent intelligence might be said to pass through four successive 
stages which can be readily treated: 1) tabula rasa, anomie, etc., the non- 
differentiation of relationships; 2) a stage of nonspecific differentiation of the 
“self” in an amorphous contextural “field”; 3) specific differentiations (com- 
parisons) between “self” and “field”; 4) limited identification with certain 
aspects of the field. A mystic would probably insist on a fifth stage, the total 
identification of self and field, which is also the equivalent of the first stage. A 
hierarchy of structures is then implied. The feedback model would be sub- 
divided into the idea of the system without throughput, the development of 
feedback supported relationships, compounding of the relationships into com- 
plex interrelationships, projection of the matrix of interrelationships into 
partial identification with the source of system inputs (context), and the fifth 
stage again of total identification with context from which the hierarchy might 
be extended. There is a strong temptation to speculate on the formation of 
formal social structure as an example, 

For an interaction to be stable with respect to content as well as magnitude, 
which is gain dependent, any disturbance must be subject to a stabilizing trans- 
Ra ae ey He original systemic state. For a universe 
one of the possible available s i RG ahaa eta ction pony 

ystem states, the probability of which, as 4 
range fee may decrease as the inverse of the hypergeometric summation 
Rey ee 
of individual interaction has Se ae i: ‘th caries on er 
less than that of noise. The actual point i h ; eel peer Ton canal ee 
istics of loop gains, probable systemic i ape Vy pepereptic Saracter: 

» : ; €, etc., and is of interest here only in 
support of the notion of hierarchical group formations, There is no reason to 
limit the model to cases in which the elements treated are individuals in other 
than group operational terms; hence, the model can be expected to cover all 
cases except the hierarchical discontinuity Tegions in which the characteristics 


@ 
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of the elements obscure the characteristics of the group, cases of very small or 
very large population. 

Subjective support for the “we” relationship is derived, at least in part, from 
the greater scope of the aggregate conceptualization of the individual, a sort 
of “extension of self.” A greater variety of experience, greater probability 
for creative interaction, and sense of preservation of some aspect of “self” 
despite corporeal demise, etc., all provide such support for the group con- 
ceptualization of “self,” the idea of “we,” that extreme actions (e.g., fanatics’, 
patriots’) have been observed. 

Should one member of a dyad abruptly vary his contribution to system 
gain, he may disturb the system of which he is part, Disturbance will require 
that the second member either relinquish participation in systemic determina- 


(1) A= = A= loop guin 
¢ «p = respondent gains 
(2) 2 (usu) € vy, So 
MaMa’ € No fa 22 


uz set of throughputs 
ws subsets of u, by goin 
A= subset gains 


(3) ae aa i= initial loop impulse 
i impulse after time t 
A= loop gain 


t= time in cycles 


n 
(4) Snax (Yn) =H Q) = arte 
nin) aimed wha set of populetion n 
e(u,) 
(5) facto) oe () 
fain) sigs vee CtCe 
FIGURE 1 
EQUuaTIONS 


Numbers within parentheses are the equation numbers referred to in the text. 
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tion, thus losing systemic extension of himself and suffering some degree of 
“role death,” the idealized passive respondent, or that he respond with a 
vigorous righting response, the idealized active respondent. Should he choose 
the active response, the first member will then be placed in the choice role, 
further altering his systemic contribution or losing any possible extensional 
properties that might have been predicated on control of the system. Clearly, 
a cyclic situation may develop in which the interactants are continuously drawn 
into mutual maintenance of an essentially conflict situation by virtue of the 
nature and extent of investments. Whether postulated as verbal interchange 
between individuals, or economic interchange between nations, the model agrees 
with Coser’s (1) careful analysis of conflict. 

A systems approach to interpretation of interaction appears to offer fair 
leverage of explanation of various group characteristics as feedback derived, 
as well as some of the structural and dynamic properties. In particular, the 
implications of a structural hierarchy may prove interesting in viewing society. 


SumMMAry 


The subjective grouping conceptualized as “we” is shown deriveable from 
feedback within a system. A matrix of possible states for the system model is 
Proposed and related to the functional stability, generation, and logical opera- 
tions of the structure. A hierarchical structure is proposed on the basis of the 


nature and definition of group elements. Conflict is sho i i 
5 wn compatible with 
the model as a stable internal Process. — 
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SELF-RATINGS OF CREATIVITY, SEMANTIC DIFFERENTIAL 
RATINGS, AND PREFERENCES FOR POLYGONS VARYING 
IN COMPLEXITY, SIMPLICITY, AND SYMMETRY*?* 


Temple University 


Russet E1ssNMAN AND MicHaet S. Grove 


A. InTRopucTION 


Recently, there has been increased interest in the validity of personality 
assessment via self-ratings. Those in favor of self-ratings have argued for the 
simplicity of administration—cost being low and validity being high—relative 
to more indirect methods, such as projective tests. Mischel (8) has reviewed 
a number of studies in which Ss’ simple self-rating proved more valid than 
more complicated procedures, such as interviews, pooled assessment board 
ratings, multiple-regression formulas, factor analysis, etc. Such findings are 
somewhat startling, given the great amount of time and effort expended in 
the employment of more cumbersome methods of assessment. However, such 
findings are not new. Lindzey and Tejessy (7) found that Ss self-ratings on 
aggressiveness were about as valid as Thematic Apperception Test signs of 
aggression. Perhaps the major spokesman for the utility of self-ratings has 
been Allport (1), who stated that with normal Ss self-report was an adequate 
means of assessment and often superior to the more costly indirect methods, 

The purpose of the present research was to study two self-report methods 
for the assessment of creativity. The most direct method is to ask Ss to rate 


themselves on creativity. 
B. Mernop 
1. Subjects 


The Ss were 68 males and females, Of the 32 males, 22 were in college, 
five in high school, and five were employed and above college age. Of the 36 
females, 26 were in college, five in high school, and five were employed, mostly 
above high school age. 


* Received in the Editorial Office on January 24, chan ue published immediately 
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2. Procedure 


All Ss were presented with a photograph of 12 polygons, including nine 
random shapes taken from Vanderplas and Garvin (9) and three symmetrical 
shapes from Birkhoff (2). Ss were asked to choose their three most preferred 
and three least preferred polygons, and to rate each polygon on a 1-to-7 scale 
for beautiful-ugly, strong-weak, and fast-slow, representing the semantic 
differential scales of evaluation, potency, and activity, respectively. Preference 
for complexity in polygons has been related to creativity (4, 6). Further in- 
formation on these semantic differential scales can be found in Eisenman (3). 
After the semantic differential ratings, Ss rated themselves on a 1-to-7 scale 
in response to the question “How creative are you,” with 1 Tepresenting the 
highest creativity and 7 representing the least creativity. Next, Ss were asked 
to list all the things that they had done that were creative. Finally, the Ss were 


questioned about their ratings and asked to introspect about the basis of their 
creativity ratings. 


C. Resutts 


In order to have confidence that Ss are responding to the complexity, sim- 
plicity, and symmetry dimensions in accord with previous findings, Pearson rs 
were run across preferences and semantic differential ratings. Table 1 presents 
the significant rs, which are generally consistent with expectations, This con- 
firmation allows for greater confidence in the findings to be reported, as well 
4s supporting previous findings of Eisenman (3). 

The correlations of the two self-repo: 
ity, simplicity, and symmetry preferences 
(creative) ratings in regard to the qu 
associated with dislike of complexity (r 


rt creativity measures with complex- 
were always opposite. Specifically, low 
estion “How creative are you?” were 
= 34, » < .01), liking simplici = 
25, b < .05), anda nonsignificant r of .19 is the «laa es 
metry. In contrast, the number of self-reported instances of creative behaviors 
was associated with liking complexity (r = -16) 
—.13), and disliking symmetry (r = 
and simplicity correlates of number of j 
are not statistically significant, 
level from the corresponding 
ference with self-ratings, Also, th 
symmetry preference is also signifi 
of creativity and instances re: 


—~ 
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TABLE 1 
SIGNIFICANT CORRELATES OF PREFERENCES FOR COMPLEXITY, 
SIMPLICITY, AND SYMMETRY 


ee 


Relationship r 7) 


A. Complexity (number of points on three most 
preferred shapes) 


Disliked simplicity 77 01 
. Disliked symmetry 4 01 
Rated complexity strong 25 05 
Rated complexity beautiful 65 01 
Rated moderate complexity beautiful 31 205 
Rated simplicity ugly 33 01 
Rated symmetry weak +25 205 
Rated symmetry ugly 54 01 
Rated simple-symmetrical polygon weak 537: 01 
Rated simple-symmetrical polygon ugly 46 01 
Rated simple-symmetrical polygon slow 24 05 
Rated moderately complex-symmetrical polygon ugly 48 01 
Rated complex-symmetrical polygon ugly 31 01 
B. Simplicity (number of points on three least 
preferred shapes) 
Disliked complexity a7 01 
Liked symmetry 56 OL 
Rated complexity weak 31 OL 
Rated complexity ugly 60 01 
Rated moderate complexity ugly 26 05 
Rated simplicity beautiful 35 01 
Rated symmetry beautiful 31 05 
Rated simple-symmetrical polygon strong 31 05 
Rated simple-symmetrical polygon beautiful . 32 01 
Rated moderately complex-symmetrical polygon beautiful 33 01 
C. Symmetry (one point given for every symmetrical 
shape among the three most preferred, and one point 
subtracted for every symmetrical shape among the 
three least preferred) 
Disliked complexity 74 01 
Liked simplicity 56 01 
Rated complexity ugly 59 ‘ 
Rated moderate complexity ugly 25 . 
Rated symmetry strong 34 af 
Rated symmetry beautiful 68 05 
Rated simple-symmetrical polygon fast 24 4 
Rated simple-symmetrical polygon strong 40 a 
Rated simple-symmetrical polygon beautiful A8 iM 
Rated moderately complex-symmetrical polygon fast 34 na 
Rated moderately complex-symmetrical polygon strong 34 om 
Rated moderately complex-symmetrical polygon beautiful 62 ‘ot 


Rated complex-symmetrical polygon beautiful 38 
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D. Discussion 


The two self-report methods for the assessment of creativity are not the 
same. While the number of reported instances of creative behaviors seems to 
bear some relationship to creativity, the self-ratings are typically significant 
and in the opposite direction: low (creative) self-ratings are associated with 
the absence of creativity, as measured by polygon preference. Furthermore, 
the correlation between the two self-report measures is .25, indicating that Ss 
who rated themselves as creative tended to give significantly fewer instances of 
creative behaviors (6 < .05). The enthusiasm for self-ratings would seem 
unwarranted as far as the present findings are concerned, Allport’s (1) belief 
that normal Ss can honestly tell you about themselves is not supported. It 
would seem that noncreative Ss are especially likely to rate themselves as 
creative. For example, a grandmother rated herself as creative, and when 
asked why said, “Because I work.” Noncreative Ss may have simplistic, wrong- 
headed notions of what creativity entails, or may be threatened by the research 
and react with defensively high ratings. On the other hand, the comments of 
Ss suggested that creative persons have as their reference group for self-rating 
other fairly creative persons. As a result, relatively creative Ss do not see 
themselves as notably creative. Several Ss with published poetry and other 
indicants of creative accomplishment rated themselves as “3,” which is on the 
creativity side of the rating scale, but near the midpoint. It is also possible that 
some §s simply did not understand what “creativity” is, and therefore could 
not rate themselves intelligently on this dimension. 

4 More Promising for the assessment of Creativity was the finding that reported 

instances of creativity seemed to be associ i 


unusual uses test, behavior is involved 
to obtain a high score on many 
limited amount of sophistication, 
However, he cannot likely fake 
objects, since his score is based 
not on what he claims. The same point holds with re, 
unless the person lies or otherwise distorts, 
ing upon his actual behaviors, A noncrea 
however he wishes, but js under greater 


3 g constraint when 
previous achievements, asked to report on 
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E. Summary anp Conclusions 


This study focused on the validity of self-ratings of creativity, when 
“creativity” is not defined for the subject. Under these conditions, self-ratings 
of creativity do not appear to be useful, as indicated by correlations with 
criterion measures. The 68 Ss also filled out semantic differential scales on 
polygons, providing insights into how Ss “see” complexity, simplicity, and 
symmetry. Future studies of self-ratings of creativity might provide Ss with 
explicit definitions of what E means by “creativity.” Unless explicit definitions 
are given, Ss are likely to have self-ratings of creativity that do not correlate 
with other alleged measures of the creativity construct. 
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AN OBJECTIVE STUDY OF THE EFFECTS OF 
EXPECTATION ON COMPETITIVE 
PERFORMANCE* 


Department of Psychology, Villanova University 


Letanp R. Ngetson AND Merrick L. Furst 


A. INTRODUCTION 


One of the most crucial aspects of human behavior pertains to its energiza- 
tion, Because human beings are alive and functioning, psychologists are com- 
pelled to deal with questions about (a) the assumed sources of the energy 
involved and (b) the direction in which this energy is channeled as it is 
expended. Some of the effects of motives and attitudes on behavior have been 
studied systematically by such outstanding investigators as Festinger (3), 
Hull (4), McClelland et al. (6), Miller and Dollard (7), Rotter (8), 
Skinner (10), and Spence (11). 

More recently, the effects of expectation on performance have been studied 
intensively. Ellis and Sermat (2) studied motivational determinants of choice 
behavior in two-person games. Diggory et al. (1) found substantial correla- 
tions between expectations of success and muscle-action potentials. Korman 
(5) studied the effects of subject expectation on performance of creative tasks, 
such as listing as many uses as possible for a brick. He concluded that “Social 
evaluations of one’s competence for a task . . . appear to become internalized 
by the individual in such a manner as to affect his performance for the task.” 

The purpose of this experiment was to study the effects of expectation and 
physical strength on outcomes in a competitive situation. This paper describes 
a direct method of measuring actual arm strength so that competitive situa- 
tions—particularly arm wrestling contests—can be studied objectively. 

There are several possible contests between Ss of different expectation and 
actual arm strengths. For example, it is possible to match a contestant who is 
actually stronger and expects to win with an opponent wi a 
and expects to lose. The obvious outcome—the stronger man winnt 
weaker man losing—would not be surprising or informational. : 

On the other hand, since expectation and actual strength predict opposite 
outcomes in this experiment, the effects of expectation on competitive per- 
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formance can be studied meaningfully. It was predicted that where actual 
strength differences were reasonably small, the weaker man would win if both 
he and his opponent thought him the stronger of the two. 


B. MetxHop 
1. Task and Apparatus 


The basic task of this experiment was to measure actual arm strength of Ss, 
record judged arm strengths, and then pit selected Ss in arm wrestling con- 
tests in which both contestants thought the weaker of the two to be the 
stronger of the two. 

The apparatus for measuring arm strength was designed specifically for 
this arm wrestling experiment. The apparatus consists of a hand grip and rope 
and pulley system which allows the rope to pass freely over the edge of a 
table. An open-sided elevator-type container was designed to hold 15 five- 
pound weights which could easily be inserted into the container while suspended 
from one end of the rope. 

In order to have arm strength measured, § would lean across the table, 
right arm in a simulated arm wrestling position, forearm vertical. The right 
hand then grasped the hand grip which was attached to the rope that held 
the container six inches above the floor. Each § was instructed to hold the 
container off the floor as long as possible while weights were added to the 


container. The number of weights in the container when it touched the floor 
was recorded as §’s strength factor. 


2. Subjects 


The Ss were 32 male undergraduate students enrolled in Social Psychology 
at Villanova University. All of the Ss were in the same section, and the 
experiment was undertaken near the end of the semester when a le op- 
portunity to meet each other had been provided, ; 


3. Procedure 


As each § came to class on the day of the experiment 
was measured by means of the special apparatus, The Ss 
pei yi and were not allowed to watch each other being measured. 
pi each § was given a sheet of paper upon which he was instructed to list 

is peers according to whether he thought they did bett hi 
strength measurement tests, ate 


his strength factor 
were not told their 


After listings were completed, the Ss engaged in the usual class activity 


while the experimenter made pairings for arm wrestling bouts, The partici- 
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pants were not told that they were going to arm wrestle, and they were not 
given the opportunity to discuss the experiment while bouts were being 
arranged. 

Using the actual strength factors measured and the listings by the 32 
participants, the experimenter was able to arrange 12 pairings to suit the 
situation where both contestants thought the weaker of the two to be the 
stronger. 

The arm wrestling contests that were conducted are presented in Table 1. 


C. Resuurs 


Following Siegel (9), a sign test was used to analyze the results. Table D 
(p. 250) shows that for N = 12, an x < 2 has a one-tailed probability of 
occurrence under Ho of p = .019. 

Thus, in this experiment, statistically significant results were found (p < 
-05). Table 1 shows that 10 out of the 12 contests were won by the con- 
testant with the lower strength factor. 


D. Discussion 


The results of this experiment indicate that expectation of winning can 
have a greater influence than an actual moderate strength handicap. How- 
ever, each outcome was also conditional on the stronger opponent expecting 
to lose. 

In the two cases where the stronger opponent won, other factors (such as 
greater experience in hand wrestling) may have been operating. In general, 
however, it seems safe to conclude that in competitive situations, all other 


TABLE 1 


ARM WRESTLING CONTESTS WITH WINNERS SHOWN 
Stren, Winner 

Contest aoe Expected Actual 
$1 vs. $2 7 vs. 6 $2 82 
$3 os, 84 10 os. 9 S4 538 
$5 us. 86 9 as. 8 86 86 
87 9s, 88 8 05.7 58 88 
$9 os. $10 7 vs. 6 $10 810. 
S11 vs. $12 7 0s. 6 $12 Su 
S13 us. 814 6 us. 5 S14 S14 
SIS vs. 816 6 us. 5 S16 $16 
$17 us. S18 11 vs. 10 S18 S18 
519 vs. $20 5 us. 4 $20 $20 
821 vs, $22 12 es. 11 $22 $22 
§23 us, $24 8 vs. 7 S24 S24 


* Contrary to expectation. 
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material things being equal as far as we know, a positive attitude towards 
winning seems to be a crucial factor. 


E. SuMMARY 


Actual arm strength of male college students was measured by use of 
special apparatus designed to measure arm (wrestling) strength. Following 
the measurement, each S judged his peers as either stronger or weaker than 
himself according to how well he thought they did in the strength measure- 
ment tests. None of the Ss was informed of the actual strength measurements 
or told the real purpose of the experiment. Next, selected Ss engaged in arm 
wrestling contests. In support of the experimental hypothesis, contest results 
were more closely related to expected strength than to actual strength. In 
other words, in a contest between $1 and $2, where $1 was measured stronger 
than §2, but both contestants thought $2 was stronger, S2 had a significantly 
higher probability of winning. 
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DETERMINING VOCATIONAL CHARACTERISTICS /S. 
PREDICTING MEMBERSHIP: A FOLLOW-UP 
OF MALE COLLEGE GRADUATES*? 


Yale University 


Pau S. BuRNHAM AND Benjamin A. Hewitt 


A. Intropuction 


That precollege and in-college data available on cumulative records might 
be used to predict careers of men 30 years after the graduation of their college 
class was implied by Burnham (1) in research on the 25 most promising and 
the 25 least promising undergraduate matriculants of the Yale Class of 193X. 
Marked dissimilarities were noted in many in-college respects, such as the 
conspicuously greater number of able than of less able matriculants who 
subsequently received financial aid, concentrated in the natural sciences and 
engineering, and earned bachelor’s degrees. Comparisons of their postcollege 
careers revealed that the high prediction group earned many more advanced 
degrees and published far more books and journal articles, In their vocations, 
both groups achieved a relatively high level of success, but the high group 
far outweighed the low in proportions entering the professions, On the other 
hand, 21 of the low group but only four of the high group became executives 
in business and industry. 

While a superficial interpretation of the last finding might lead one to 
conclude that anyone can succeed in business, we hypothesized that an inter- 
relation of in-college patterns of behavior and performance might characterize 
men in various career groups and thus “explain” their success. To this end, 
we aimed in the present study to analyze (a) to what extent the members of 
separate occupational groups had had similar characteristics during their 
college years and (4) whether these characteristics distinguished them from 
both their classmates as a whole and their counterparts in other occupations. 

As we designed the study we saw two important factors that could militate 
against finding significant results. First, because contingency factors could 


* Received in the Editorial Office on plea tie ts: ae published immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. 

1 The research reported herein was supported by the Cooperative Research Program 
of the Office of Education, U. S. Department of Health, Education and Welfare. For 
More complete detail than is reported here, the reader is referred to Burnham and 
Hewitt (2). 
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have influenced men to change their vocational goals to a second, third, or 
even an “undesirable” choice, we might discover unique patterns characterizing 
those who were always engaged in their chosen field, but might also discover 
the same patterns for men in other occupations. Second, before World War 
II, the level of ability and the socioeconomic background of the Yale popula- 
tion were more homogeneous than broadly representative of college students 
throughout the country; hence, we realized that any positive findings might 
be only suggestive of results that researchers might obtain from studying more 
diversified groups. Our findings imply, however, that theoretically such 
patterns can be revealed and that research of this kind could have meaningful 
applications in vocational counseling, 


B. MetHop 


The completeness of both their undergraduate records and the subsequent 
occupational information about them led us to select three study populations 
comprising undergraduate degree recipients in the following graduating classes: 
(a) 1931 and 1932 combined, (b) 1935 and 1937 combined, and (c) the 
class of 1959. The latter, with occupational information reported by its 
members five years after graduation, seemed also worthy of study as repre- 
sentative of a more recent period. Degree holders in these three study popula- 
tions numbered 1393, 1285, and 899, respectively. r 

Many items of antecedent information, such as geographical location of the 
family home, religion of parents, father’s occupation, student’s major in 
college, etc. could be coded but not quantified. Practical considerations 
parse Us to restrict this study to an analysis of the 26 items of quantifiable 
ee that were available: matriculation age, father’s educational 
rife Roe ; ee mother’s educational level, number of sib- 
a hanes ie ‘ie desig secondary-school grade average, mean College 
pee sie Board achievement scores, Scholastic Aptitude Test 
bs fe Ev! teiag of freshman year academic achievement at Yale, 

rage at Yale, senior year average at Yale, Yale scholastic 


honor: ial ai i i 
vate eae i en se (maximum year), socioeconomic-educational 
‘ : items indicating the amount of Participation in a variety 


2 First administered in 1926, th 
Examination Board’s SAT provided Pan 


from seven Verbal and two Mathemati 
ical vi from 
tests, The 1931, 1933, and 1954 versions See ih fen score b ee 
re based on 


Verbal subtests, and also a separate i 
present study were based entirely on Toone Pere teed in’ the 


ninths Mathematical content of the 1927 test ial with the exception of the two- 


1928 forms of the College Entrance 
€ score. In 1927 the score was derived 


- 


PAUL S. BURNHAM AND BENJAMIN A. HEWITT 75 


of extracurricular pursuits involving athletics and a wide range of nonathletic 
student activities, s 

Published sources were searched for career information; it was found 
available for 95 percent of the cases. (See Appendix for tabulation of recency 
of career information in approximate number of years since graduation for 
each of the three study population groups.) From this material we developed 
a three-digit coding scale that would accommodate the concentration of our 
sample in the higher-level business and professional categories. While we 
did not estimate their degree of occupational success, we assumed it on the 
whole to be generally high insofar as membership 30 years after college 
graduation is a valid criterion. 

The traditional approach to the analysis of the characteristics of occupa- 
tional groups has been through the mean test score profile method; but 
Fisher’s discriminant function (4) is a more sophisticated technique which 
permits one to determine whether clusters of variously arranged and dis- 
tributed characteristics of one sample are “significantly” different from those 
of the population from which the sample was drawn. We used this technique 
to determine whether graduates employed in the same occupation had had 
sufficiently similar characteristics in adolescence and early adulthood (a) to 
differentiate them from people in other occupations and (b) to permit the 
prediction of their subsequent vocation. 

To facilitate the analyses, we drew systematically from each parent pop- 
ulation two 20 percent spaced samples hereinafter identified as Reference 
Samples #1 and #2. Each occupational group numbering at least 25 cases 
was then compared initially to each of the two reference samples. It should be 
noted that in analyzing each occupational group in relation to each of the two 
reference samples, we must expect to have a common membership of ap- 
proximately 20 percent of the occupational group. ‘The fact of this overlap, or 
commonality, implies that any F value found through calculation of the 
discriminant function is conservatively biased ; without the overlapping mem- 
bership, the value expected would be greater and consequently more significant 
than the obtained F. Table 1 lists the 16 occupational samples numbering at 
least 25 men in one of the three parent populations. 


C. ResuLts 
1. The 26 Variable Analyses 


Using all 26 antecedent measures we analyzed each of the 16 occupational 
Zroups in relation to the characteristics of the two reference samples. Since 
12 of the 16 occupational categories yielded one or more F values (Fisher's 
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TABLE 1 5 
SIXTEEN OCCUPATIONAL CATEGORIES, EACH HayinG AT Least 25 CAsEs 1N 
One on More Parent PoPuLATIONS 


Parent populations 
1931, 1932 1935, 1937 1959 


Occupational categories n n n 

Brokers, officers, and professional staff 30 
Insurance company officers and professional 

staff 25 
Professional writer, editor, publisher 34 
Teacher, secondary school 25 
Administrators, medium and small size 

corporations : 5+ 62 
Bank officers, lower level 29 38 
Chief executives, medium and small size 

corporations 42 64 


Retail store managers and owners 


32 26 
Sales Vice Presidents and Sales Managers, 
large, medium, and small size corporations 5+ 31 
Wholesale business Managers and owners ood 34 
Administrators, large corporations 75 102 27 
College teachers 62 49 25 
Engineers, professional 54 45 48 
Lawyers 162 130 160 
Physicians and surgeons 109 90 4 
Salesmen 46 35 4H 


discriminant function) significant at the .05 level, we found herein initial 
support for our hypothesis that vocational groups can be identified at the 
time of their college graduation by unique patterns of antecedent data. In 
the four categories, college teachers, engineers, lawyers, and salesmen, sta- 
tistically significant F values were found in either two or three parent popula- 
tions. Data concerning these significant findings are shown in Table 2. 
Presented here also are data about a high occupational level or top-decision- 
making group, such as newspaper publishers, university presidents, and hospital 
superintendents. Since the significance levels for this high Supational group 
t; however, profiles are presented 
of the other 12 occupational groups show close correspondence. While four 


ed no significant results and thus 
seemed to have the characteristics of random samples, eight yielded signifi- 
cant F values in one parent Population. : 


2. The Nine Most Discriminating Measures 


By analyzing t scores, we determined the telative contribution of each of 
the 26 measures to differentiation between each Career group and each of the 
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TABLE 2 
OccuPaTioNAL Groups IDENTIFIED IN A 26 VARIABLE DISCRIMINANT ANALYSIS 
AS YIELDING AT LEAsT Two SIGNIFICANT F VALUES 


Classes of Classes of Class of 
1931, 1932 1935, 1937 1959 
Occupational groups aC oR TD fone! ee te a oe 
College teachers 62 49 25 


vs. Reference Sample #1 278 3.49 01 254 2.34 .O1 179: 145; 008 

vs. Reference Sample #2 281 4.39 001 261 2.29 .001 181 116 # 
Engineers (professional) 54 45 48 

vs. Reference Sample #1 278 «1.58 .05 254 1.63 .05 179 2.28 .01 

vs. Reference Sample #2 281 118 *% 261 140 *® 181 2.75 .001 
Lawyers 162 130 160 

vs. Reference Sample #1 278 1.92 .01 254 1.97 .01 179 1.60 .05 

vs. Reference Sample #2 281 2.25 .001 261 2.55 .001 181 142 4 
Salesmen 46 35 41 

us. Reference Sample #1 278 1.09 4 254 2.19 .01 179 1.59 .05 

us. Reference Sample #2 281 1.53 .05 261 2.09 01 181, 1.13) (8 
High occupational level 136 141 9 


vs. Reference Sample #1 278 1.58 .05 254 1.51 .05 
vs. Reference Sample #2 281 1.75 .05 261 141 ®& 


* Not significant; i.e, p > .05. 


two reference samples. Nine antecedent measures contributed most markedly 
and consistently to discrimination ; these fall into three categories: 

a. Noncognitive matriculation data. (a) the student’s age, (b) his father’s 
occupational level, and (c) his mother’s educational level, all at the time of 
his matriculation. 

b. Academic achievement factors. (a) the student’s score on the SAT taken 
before matriculation, (4) his senior year average at Yale, (c) the scholastic 
honors he received during his college career. 

ce. Nonathletic extracurricular activities. (a) membership in fraternities 
and societies, and involvement in (2) academic, sports, or social clubs, and in 
(c) dramatics, publications, musical organizations, etc. 

Except within the three academic measures, considerable independence 
characterized these nine variables. Table 3 details the correlation coefficients 
based on the aforementioned populations of 1393, 1285, and 899 cases, Tespec- 
tively, The large size of each population ensures a high degree of stability in 
the coefficients. Since each coefficient larger than .07 is “significant” at the 
.01 level, those of .07 and smaller suggest no better than a chance relationship 
between the variables. It is quite noticeable that only five of the 108 coefficients 
exceed .30 in size, thus reflecting from another point of view the relatively high 
degree of independence of each measure from each of the other variables. 
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3. The Nine Variable Analyses 


By use of the nine key predictor variables, discriminant function analyses 
were calculated between each of the groups of college teachers, engineers, 
lawyers, and salesmen, and of each of the two reference samples. Table 4 pre- 
sents F and p values resulting from the comparison of these four occupational 
samples with Reference Samples #1 and #2 and with a third reference group, 
the total population adjusted (TPA) by the removal of the particular occupa- 
tional sample under consideration. Examination of these data (Table 4) reveals 
the consistency of the probability values across the three parent populations, 
the number of instances of discrimination at the .001 level, and additionally 
the greater differentiation here than in the 26 variable analyses of Table 2. 


TABLE 4 
DiscRIMINANT FUNCTION VALUES FOR Four OCCUPATIONAL GROUPS IN THE NINE 
VARIABLE ANALYSES WITH BoTH REFERENCE SAMPLES AND ALSO WITH 
THE TOTAL PopuLaTIoN Apjustep (TPA) 


Classes of Classes of . Class of 
1931, 1932 1935, 1937 1959 
Occupational groups n F - n F b nF p 


College teachers 62 49 25 
us. Reference Sample #1 278 = =©9.07 .001 254 3.89 001 179 3.20 .001 
vs. Reference Sample #2 281 10.58 001 261 4.09 .001 181 2.50 .01 


vs. TPA 1331 12.61 .001 1236 5.20 001 =874 «3,54 001 
Engineers 54 45 48 


vs. Reference Sample #1 278 2.63 01 254 2.31 .05 179 4.85 001 
vs. Reference Sample #2 281 #4116 * 261 2.08 .05 181 6.59 .001 


vs. TPA 1339 1.74 # 1240 2.31 .05 $51 9.03 .001 
Lawyers 162 130 160 


vs. Reference Sample #1 278 2.86 .01 254 4.78 .001 179 3.25 . 
us. Reference Sample #2 281 «5.59 .001 261 4.98 .001 181 285 .O1 


vs. TPA 1231 550 001 1155 8.35 .001 739 5.22 .001 
Salesmen 46 35 41 

vs. Reference Sample #1 278, 1.614 254 3.22 .001 179 2.26 .05 

vs. Reference Bama #2 pit ie ee 261 2.86 .01 A8iue LG a, 

vs. TPA 1347. (1.570 ® 1250 3.19 .001 858 2.63 .01 


" Not significant; i.e., p > .05. 


_ Having identified four occupational groups with characteristics differentiat- 
ing them significantly from their parent populations, we determined next the 
extent to which these groups differed from each other on the nine antecedent 
Measures, While some evidence of independence was shown in the correlational 
analyses in Table 3, additional information is offered through the results of 
discriminant function analyses between all possible pairs of the four significant 
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occupational groups. The resulting F values and their corresponding prob- 
abilities are summarized in Table 5. In 10 instances there was “significance” 
at the .001 level, in four at .01, and in two at .05. 


TABLE 5 
DiscRiMINANT FuNcTION VALUES Founp BETWEEN Pairs OF OCCUPATIONAL 
Groups IN THE NINE VARIABLE ANALYSES 


Occupational Engineers Lawyers Salesmen 
groups F - F p PF 3 
College teachers 
1931, 1932 5.19 001 4.27 001 917 001 
1935, 1937 2.93 01 1.34 a 5.52 001 
1959 242 05 2.87 01 4.28 .001 
Engineers 
1931, 1932 2.22 0S 142 i 
1935, 1937 4.59 001 3.26 01 
1959 12.10 001 5.15 001 
Lawyers 
1931, 1932 3. 001 
1935, 1937 357 “00L 
1959 3.10 01 


®" Not significant; ie, p > 05, 


By indicating how each of the four occupational groups compared with 
the Parent populations (a) in age, father’s occupational level, and mother’s 
educational level, all at matriculation ; (2) in levels of academic ability and 
college achievement; and (c) in participation in extracurricular activities, we 
may illustrate the extent of differentiation between these vocational groups. 
In relation to their classmates, the engineers tended to be younger and to 
have fathers with lower occupational levels and mothers with less education ; 
the salesmen were high in these respects, and the lawyers and college teachers 
average, In ability and college achievement, the lawyers and college teachers 
were high, the engineers average, and the salesmen low. In extracurricular 
participation, the salesmen were high, the engineers low, and the lawyers and 
college teachers average. ; re 

The degree of differentiation is depicted graphically in Figure 1, which 
Presents a profile of the characteristics of each of the four vocational groups 
in relation to each of the three Parent populations, and of the highest occupa- 
tional level group for the two earlier parent populations. Since only nine cases 

in the class of 1959 could be classified in the “highest occupational level” 


—————— 
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category, they are not shown in Figure 1. Typical patterns just described 
seem to emerge from the fairly consistent profiles of the lawyers, the college 
teachers, and the salesmen. The profile patterns of the lawyers and college 
teachers differ from those of the salesmen; and the engineers appear to be 
different from the other three. The two profiles for the high occupational 
group seem remarkably consistent ; yet their marginal significance values (shown 
in Table 2) illustrate the weakness in relying on profile analysis in interpreting 
these data. 


D. The Prediction of Membership in an Occupational Group 


We must recognize, however, that while antecedent data may predict with 
auseful degree of probability those people who will later be members of certain 
vocational groups, these same data may identify at a much higher probability 
level those who have the qualifications requisite for such membership. Many 
who have the characteristics of an occupational group may not enter that 
vocation for a variety of personal reasons (e.g., financial resources, family 
responsibilities, etc.) unrelated to their aspirations and qualifications. To this 
issue of actual membership versus qualifications for membership, Thorndike 
(5) spoke eloquently as he reviewed his 1959 study Ten Thousand Careers 
(6): “Long range success depends so heavily upon contingency factors that 
one can never hope to predict it from what can be known about the individual 
in advance” (p, 187). 

The discriminant function provides a method for combining the original 
predictor measures in a manner to estimate best the degree of correspondence 
of each subject’s pattern of characteristics with those of the career group. A 
discussion of this procedure is presented by Cooley and Lohnes (3) who, in 
their research attempted to estimate each prospective graduate student's later 
selection of history or physics as an area of concentration (p. 136-137). 

We hoped to determine through this technique the probability that each 
student might have the characteristics common to a college teacher, an engineer, 
a lawyer, or a salesman. Hence, we calculated four predictive indices for each 
case in each of our three parent populations. Extensive overlapping in the 
distributions of these resulting predictive indices indicated that even though 
We might estimate whether a given individual possessed the qualifications for 
membership in an occupation, we could nof predict assuredly that he would 
become a member of that group. Despite the impressively high discriminant 
function values that we had obtained, which demonstrated that there were 
recognizable differences in the antecedent characteristics of the four samples, 
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‘ even the .001 probability level permits a sizeable degree of commonality in 
the characteristics of two highly discriminant clusters. 


E. Conciupinc STaTEMENT 


heterogeneous study populations than the Yale classes of the 30’s, may be able 
to duplicate our findings on similar occupational groups and also to identify 
students with qualifications for membership in a variety of occupations. If given 
: stable college and occupational populations to investigate and if given the 
identification of additional personal characteristics having vocational signifi- 


Researchers using in-college data such as ours but having larger, more 


cance, they may be able to predict actual membership in some occupational 
groups. To achieve this goal, other factors, including the effects of contin- 
 gencies, must be identified and related to career development, so that they 
make a positive contribution to prediction. 

On the other hand, future researchers could experience greater difficulty 
than we. The instability in the composition of today’s college and occupa- 
tional populations could complicate the problem of locating clusters of anteced- 
ent characteristics delineating vocational groups. Career counselors could face 
similar problems in matching the characteristics of counselees with those of 
people in specific occupations; they will probably have to be painstaking in 
discovering and then cautious in using antecedent data such as we depicted in 
our nine-variable profiles of the characteristics of college teachers, engineers, 
lawyers, and salesmen. 


F. SuMMARY 


Matriculation and in-college data about two groups of men who were 
graduated from Yale in the 1930's were related to their most recent occupa- 
tion in order to determine whether such occupational groups might have been 
identified earlier by unique patterns of antecedent characteristics of an academic 
and nonacademic nature. Twenty-six antecedent measures included quantifi- 
able data about academic ability, achievement, extracurricular participation, 
and family background. Occupational information was secured from pub- 

f lished sources. Four of the 16 occupational categories analyzed ‘were found a 

"Possess statistically significant patterns of antecedent characteristics. While 
later membership in each of these four occupational groups could not be Dre- 
dicted with a high degree of assurance, nevertheless the patterns of qualifica- 
dions for group membership were determined with a high degree of statistical 

: Significance, 
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APPENDIX 


RECENCY OF CAREER INFORMATION IN APPROXIMATE NUMBER OF YEARS SINCE 
GRADUATION BY PERCENT OF SELECTED YALE GRADUATING CLASSES 


Class groups (percentages) 
Source 1931, 1932 1935, 1937 1959 


Class histories 
10 years (five years 


for 1959) 2.1 21 87.8 
20 years 12.0 54 
30 years 83.1 84.0 
1962 Alumni Directory 28 8.2 8.1 
No information 0 3 41 
100.0 100.0 100.0 
REFERENCES 


BURN P. S. Academic success and career choice. Coll. Board Rev., 1961, 45, 

2, BurNHAM, P. S., & Hewrrt, B. A. Thirty year follow-up of male college students. 
Final report, project no. 5-0816, contract no. OE-3-10-014, U. S. Department of 
HEW (Office of Education, Bureau of Research), Yale University, Office of 
Educational Research, New Haven, Connecticut, 1967. 


3. Cooter, W. W. & Lounes, P. R. Multivariate Procedures for the Behavioral 
Sciences. New York: Wiley, 1962. 


4. Fiswer, R. A. Statistical Methods f i : 
Oni 8 Bad aoe ethods for Research Workers (7th ed.). Edinburgh: 


5. THorwpixe, R. L, The prediction of i i 
suas p m of vocational success. Voc. Guid. Quart., 1962, 


6, Tuornpixe, R. L., & Hacen, E. Ten Thousand Careers. New York: Wiley, 1959. 
490 Edgewood Avenue 


New Haven, Connecticut 06511 


OO EL 


———— 


Published as a separate and in The Journal of Psychology, 1972, 81, 85-88. 


PERCEIVED MATERNAL CONTROL, ANXIETY, AND 
OPPOSITE SEX ROLE PREFERENCE AMONG 
ELEMENTARY SCHOOL GIRLS* 


University of Rhode Island and University of Massachusetts 


Henry B. Bitver AND Burton ZUNG 


A. INTRODUCTION 


Much attention has been given to examining the consequences of excessive 
maternal control on the individual's psychological functioning. For instance, 
Heilbrun and his colleagues have found relationships between control-centered 
maternal child rearing practices and emotional well-being, and cognitive func- 
tioning (e.g., 3). Some of the consequences of maternal control, pertaining to 
cognitive response to social censure and familial psychopathology, seem related 
to sex role development. The origins of psychopathology are often rooted in 
parentally influenced distortions of the sex role development process (1), Pre- 
adolescent children with anxious, impulsive, and neurotic personalities seem 
to prefer more recreational activities characteristic of the opposite sex (5). The 
present study was designed to explore the interrelationships among degree of 
Perceived maternal control, anxiety, and opposite sex role preference in ele- 
mentary school girls. 


B. MerHop 
1. Subjects 


The Ss were 42 white females ranging in age from 9 years to 12 years, 4 
months (mean, 10 years, 8 months), and in JQ (Otis Beta Test) from 90 0) 
126 (mean, 102). They attended elementary school (fourth, fifth, and sixth 
grades), lived in rural and semirural towns in western Massachusetts, and 
Were from lower-middle and middle-class backgrounds. 


2. Procedure and Measures 


Elementary school teachers were given questionnaire booklets for group 
£ the main booklet re- 


administration to their classes. The three sections 0 i 
Presented instruments designed to tap perceived maternal control, anxiety, 
and degree of opposite sex role preference. ‘The teachers were given instruc- 
™_ 
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tions intended as guidelines for presenting the task in a clear, nonthreatening 
way. The Ss had an opportunity to ask for clarification of difficult or ambig- 
uous words in the test items; however the teachers were told not to interpret 
the meaning of entire items. Responses to all items were made in the question- 
naire booklets. 

a. Parent perception. Scales from the Schaefer Inventory of Parental Be- 
havior (7) which were heavily loaded on the autonomy-control factor were 
used to measure perceived maternal control (e.g., Possessiveness, Intrusiveness, 
Hostile Control). The autonomy-control factor has been found to describe 
a basic dimension of parent-child interaction, In the current study item word- 
ing was simplified, and positively toned items from the acceptance scale of the 
inventory were interspersed among control-related items to facilitate adminis- 
tration to children. The resulting instrument consisted of 71 items, 55 of which 
measured the child’s perceptions of her mother’s controlling attitudes and 
behaviors. Response alternatives were limited to LIKE and NOT LIKE for each 
item, with affirmative responses to items pertaining to control scored at one 
point each. 

b. Anxiety. Portions of the Children’s Manifest Anxiety Scale (2) and the 
General Anxiety Scale for Children (6) were used to assess anxiety level. Re- 
dundant item content was eliminated, and the items were clarified and simpli- 
fied. The scale used in the present study contained 53 items which fell into 
four general content areas: a 12-item asocial anxiety subscale (e.g., fear of 
the dark and of certain animals) ; a nine-item physical: 


scale (fear of physical injury, of expressing strong emotion, and of losing self- 
control) ; an 11 


; ~item social anxiety subscale (fear of social disapproval and 
pets rea oed in social situations) ; and a 21-item symptom anxiety 
subscale (e.g., blushing, insomnia, excessive wo: hy itivi isturb- 
nee try, hypersensitivity, distur’ 


emotional anxiety sub- 


¢. Sex role preference. This variable Was measured by means of a 69-item 


modification of the Rosenberg and Sutton. Smith i 

i | play and game list (4). Items 
were included that clearly differentiated (p < .01) bilinear the ee Only 
affirmative responses to discriminatin: ; 


g items contributed to the sex role scores 

The final list included 22 masculine { Baiteinstives 
asculine items, yzs and No atives 

were provided for each item. oo 


C. Resuzts anp Discussion 

Maternal control was positively i: 
39, df = 40, p < .01). The anxie 
anxiety (r = .43, df = 40, p < 01), 


ited with overall anxiety level (r = 
ty subscales labeled physical-emotional 
social anxiety (r = .40, df = 40,p < 
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01), and symptom anxiety (r = .44, df = 40, p < .01) were also related 
to perceived Hostile Control. Thus, anxiety appeared a notable concomitant 
of girls’ perceptions of a generally controlling mother. Girls who viewed their 
mothers’ child rearing methods as characterized by a hostile, dictatorial, and 

* overbearing manner of disciplinary control reported anxious concern with 
several aspects of personal functioning. 

There was a relationship between total anxiety and masculine preference 
(r= 40, df = 40, p < .01), suggesting that variations in anxiety could 
. account to some extent for differences in masculine sex role preference among 

girls, Highly anxious girls generally had relatively masculine sex role pref- 
| erences, The physical-emotional anxiety (r = .47, df = 40, p < .01) and 
symptom anxiety (r = .49, df = 40, p < .001) subscales were also correlated 
with degree of masculine preference. Girl’s perceptions of maternal intrusive- 
ness were related to masculinity (r = .41, df = 40, » < .01) of sex role 
preference, but other areas of perceived maternal control were not significantly 
telated to sex role preference. ' 
According to the current data, self-reported anxiety is associated with per- 
| ceived maternal control for girls. Girls’ perceptions of hostile maternal control 
were accompanied by reported tensions concerning impulse expression, personal 
worth, and diverse cognitive-attentional-autonomic complaints (e.g., distract- 
ability, emotional lability, indecisiveness, hypersensitivity, excessive worry, 
nausea, insomnia, etc.). The girls appeared to be particularly responsive to 
Perceptions of hostile control, and they seemed to react to such Dercepacns 
with emotional tension and concern in several areas of subjective experience, 
_ The girls who appeared most responsive emotionally to perceptions of maternal 
| control seemed atypical in their sex role preferences. Masculine girls appeared 
Most vulnerable to anxiety elevation (and perhaps maladjustment in the 
general sense) as a concomitant of excessive maternal control. : 
The present evidence is correlational, and it is important to investigate more 
directly the developmental processes involved in the seeming interrelationships 
found in this investigation. 


D. SumMARY 


The interrelationship among degree of perceived maternal control, anxiety, 
‘ool girls was investigated by 


and opposite sex role preference of elementary sch _ Chil 
Means of the Schaefer Inventory of Parental Behavior, portions of His, a 
dren’s Manifest Anxiety Scale and the General Anxiety Scale for Children, 
and masculinity items from Rosenberg and Sutton-Smith’s play and game list. 
gree of maternal control was positively related to ansiety level, and anxiety 
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level was positively related to degree of masculine sex role preference. Per- 
ceived maternal intrusiveness was also positively related to masculinity of sex 
role preference. 
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THE PREDICTION OF 16 PF TEST SCORES BY 
EDUCATIONAL LEVEL AND 
DRINKING EXPERIENCE*? 


Veterans Administration Hospital, Lexington, Kentucky; and 
University of Kentucky 


Wit1am F. Gross AND Cuartes F, E:tTon 


A. IntTRopucTION 


Personality differences between alcoholics and nonalcoholics have been 
reported on the Sixteen Personality Factor Questionnaire (16 PF). The find- 
ings of these studies (2, 3, 4) and similar ones are only modestly relevant to 
the current stress in epidemiology upon studying the ways in which psychological 
and sociological factors operate to influence drinking behavior. Investigators 
have not attempted to relate personality test scores to biographical data im- 
portant to understanding drinking behavior, such as level of education and 
drinking behavior experience, with use of multivariate techniques of analysis. 

The definition of problem drinking reported by Plaut (7) illustrates the 
difficulty some investigators have had in interpreting the results of studies. 
Plaut’s report of the findings of the Cooperative Commission on the Study of 
Alcoholism defines problem drinking as “. . . a repetitive use of beverage alco- 
hol causing physical, psychological or social harm to the drinker or to others.” 
The difficulty, of course, is the word “causing.” In studies showing a relation- 
ship between personality structure and problem drinking shall the results be 
interpreted as indicating that alcoholic addiction stems from basic patterns of 
Personality, or shall they be interpreted as indicating that personality test 
scores have been influenced by prolonged use of alcohol (9)? Longitudinal 
studies of alcoholics have not been designed to aid in our understanding of 
these relationships. 

It may be feasible to examine this problem indirectly, however: that is, by 
using a multivariate analysis of variance design in which present age of § minus 
age he began drinking (experience with drinking) is one of the predictor vari- 
ables. Should this simple effect be significant, more assurance could be given to 
the argument that personality test scores are related in a linear fashion to past 
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experience with alcohol. Should, on the other hand, the interaction effect be 
significant, the argument fails, 

The present study, then, investigates the relationship between experience 
with drinking, number of years of education, and scores on the 16 PF test, 
with use of a 3 X 3 multivariate analysis of variance design. In this design, 
three levels of experience with drinking and three levels of education con- 
stitute the independent variables, while scores on the 16 PF are the dependent 
variables. Specifically, it is hypothesized that there is no significant interaction 
effect between levels of experience with drinking and levels of education. 


B. Metuop 


Following is a brief description of the Alcoholic Treatment Program at the 
Lexington, Kentucky, Veterans Administration Hospital. A patient who 
enters the hospital as a result of drinking is first detoxified and then given 
information about the Alcoholic Treatment Program (ATP). If the patient 
expresses an interest in participating in the program, he is referred to the 
alcoholic treatment ward where he is interviewed by members of the ATP’ 
staff to determine whether he meets the criteria for admission to the program. 
Although these criteria are general and flexible, the patient should 1) be in — 
reasonably good physical health; 2) express a sincere desire for treatment; 3) 
Possess normal intelligence; 4) have no severe psychopathology ; and 5) sign 
a statement agreeing to remain voluntarily in the Program to its conclusion 
(60 days) and to abide by ATP rules and regulations, 

At the time of admission, the patient completes a number of paper and pencil 
tests, including the 16 PF and a biographical inventory. A sample of 104 
patients was divided into three levels of education and three levels of drinking 
experience. The levels of education were 8th grade or less (elementary school) ; 
9th to 12th grade (high school) ; 13th grade or more (some college). Drink- 
ing experience consisted of the following categories: 19 years or less; 20 to 
27 years; 28 years or more, Since th 


‘ 


i i experience — 
middle, and top thirds of this distribution, 
43 with an SD of 8; the mean grade com: ~ 
of 2. The alpha level considered for signif+ 


Constitute approximately the bottom, 

The mean age of all patients was 
pleted in school was 11 with an SD 
icance was .05. 


C. Resurrs 
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TABLE 2 ‘ 
MULTIVARIATE ANALYSIS OF VARIANCE 
Source of dispersion df Likelihood ratio test 
Education 2 X2 = 41.38; 
Drinking Experience 2 X2=45. 
Education X Drinking Experience 4 X2 = 84.31; df 


The results of the multivariate analysis of variance are reported in Table 2 
which indicates that the hypothesis of no significant interaction between. 16 
PF test scores and the independent variables was rejected (p < .05). 

The interaction effect of the one significant 16 PF score, Qy, is plotted in 
Figure 1 (p < .01). The relationship between degree of intellectual function- 


ing on the 16 PF, Factor B, and the independent variables is shown in Figure 
2 


8.2 um 
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FIGURE 1 
Factor Q, Scores By YEARS oF DRINKING anp Levers oF EpucaTion 
Low ‘Score = relaxed, tranquil ; high Score = tense, fretful. I = 19 years of drinking 
or less; Il = 20-27 years of drinking; III = 2g years or more of drinking. a 
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FIGURE 2 
rs Factor B Scores BY YEARS OF DRINKING AND LEVELS OF EDUCATION 
ob Ww score = low intelligence; high score = high intelligence. I = 19 years or less 
drinking; II = 20-27 years of drinking; II] = 28 years or more of drinking. 


Table 3 presents the Ss mean age by levels of drinking experience and levels 


of education. 


TABLE 3 
MEAN AGE oF Ss BY DRINKING EXPERIENCE 
AND LEVEL OF EDUCATION 


Drinking experience in years 


<8 39 43 52 
9-12 35 42 51 
nad 47 


213 38 
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D. Discussion 


‘The findings in this study do not support the assumption of a simple linear 
relationship between scores on the 16 PF and either drinking experience or 
level of education. Univariate studies of the relationship between personality 
test scores and drinking behavior have floundered on the question: Which 
came first, personality deficit or drinking behavior? Figure 1 shows that the 
relationships among the independent variables are more complex than they 
are simple, Whether a problem drinker is tense and fretful or not is a function 
of years of schooling and number of years experience in drinking. Rather than 
defining problem drinking as the “repetitive use of alcohol causing psycho- 
logical harm to the drinker,” the findings appear to show that Tepetitive use of 
alcohol may be related to psychological harm (inferred from test score) de- 
pending upon years of drinking experience and level of education. 

Of course, the present emphasis upon psychological data does not preclude 
the possibility of physical or social harm accruing to the drinker. In fact, it is 
worth noting in Figure 2 that intellectual functioning, as measured by Factor 
B on the 16 PF, decreases as drinking experience increases. Alcoholics at all 
levels of education who have been drinking 19 years or less score higher on 
Factor B than those who have been drinking longer periods of time (p < .08). 
This supports the possibility of intellectual deterioration as a consequence of 
chronic ingestion of alcohol, \ ‘ 

In this study, amount of Ss’ experience with drinking appears to be related 
to their mean age as indicated in Table 3, Those who have been drinking the 
longest period of time are older than those who have less experience in drink- 
ing. Isit the age variable or the drinking experience variable that is interacting 
bg bi te wae to ae Personality test score? The data analyzed 
age level Sedeaen he , Urpraitenee our design did not include 
tween age and drinkin, gs Ie ies GeatslAcDA, a) SY ers, Bag 

@ experience for the Ss is .77, This correlation suggests 


use either age or drinking experience as an independent variable depends upon 
the choice of drinking experi 


related to the question: Are personality test 
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validity of self-report. These studies have been adequately reviewed elsewhere 
(5, pp. 106-110). Unfortunately, these studies have not dealt with samples 
of problem drinkers. Robins (8), indicated that self-report on the issue of 
“trouble with the law” associated with drinking was accurately reported when 
the criterion was independent checks against police records. 

Doubt as to the validity of self-report among this sample of Ss derives, of 
course, from the possible deterioration effects of intellectual functioning as 
shown in Figure 2, as well as the general tendency to forget or distort earlier 
events with increasing age. Only the longitudinal study is free from the poten- 
tial error of bias in retrospective data. 

The sample of Ss in this study were voluntarily self-admitted to the V.A. 
Hospital with a tentative admission diagnosis of “alcoholism.” The intensity 
of the drinking problem would appear established by the act of seeking hospital 
help. A related issue concerns that period of time in his life when the S rec- 
ognizes that drinking constitutes a serious problem. The establishment of 
these data on the basis of self-report is fraught with all of the hazards men- 
tioned earlier, In-addition, there may be a tendency for the problem drinker 
to be unable to recognize when he has passed from the domain of social drinker 
to problem drinker. Although the validity of self-report among problem 
drinkers is an unresolved issue at present, it would appear to be a reasonable 
hypothesis that the age an S began drinking can be recalled with more validity 
than the age he became a problem drinker. 

A recent study (6) succinctly presented the case for multivariate research 
designs in studies on alcoholics. Its conclusion regarding the potential promise 
of multivariate techniques receives additional support in this study. Too fre- 
quently studies (1) have used a host of psychological or sociological variables 
to predict problem drinking, and although each variable may be found to 
account for a limited amount of the variance, their cumulative effect contributes 
to redundancy rather than clarity. There is a pressing need for research that 
combines psychological and sociological variables and that takes into account 
the possible interaction among the independent variables. 


E. Summary 


In summary, this study investigated the relationship between experience 
with drinking, number of years of formal education, and scores on the 16 PF, 
with use of a 3 X 3 multivariate analysis of variance design. The Ss consisted 
of 104 patients in the Alcoholic Treatment Program at the Lexington, Ken- 
tucky, Veterans Administration Hospital. Because of a significant interaction 
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effect, the findings did not support the assumption of a simple linear relation- 
ship between scores on the 16 PF and either drinking experience or level of 
education. 
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PERCEPTIONS OF SELF AND GROUP ROLES AS A 
FUNCTION OF REPRESSION-SENSITIZATION 
AND TOLERANCE-INTOLERANCE 
OF AMBIGUITY*? 


University of Idaho 


WituiaM H, Bercouist anp JAMES E, CraNDALL 


A. INTRODUCTION 


Both the repression-sensitization (7, 8) and tolerance-intolerance of am- 
biguity (6) constructs are products of the “new look” perceptual-cognitive 
research of the late 1940’s and early 1950's (e.g., 5, 13). In more recent years, 
however, increased attention has been given to the interpersonal dimension of 
these constructs. In surveying the interpersonal research on the construct of 
repression-sensitization (R-S), Bergquist (4) concluded that repressors, in 
contrast with sensitizers, are more easily influenced by other people, are possibly 
more sensitive to implicit interpersonal cues, and are generally more skillful 
in performing tasks in small-group settings. Individuals who are intolerant of 
ambiguity, as compared to those who are tolerant, describe themselves as more 
docile, less competitive, and less aggressive on Leary’s (15) Interpersonal 
Adjective Checklist, and are apparently more intolerant of interpersonal dis- 
agreement (10). Highly intolerant Ss are also less well liked by their peers, 
though the self-descriptions of these Ss correspond more closely to well liked 
persons than do the self-descriptions of tolerant Ss (11). 

The main purpose of the present research was to investigate whether differ- 
ences in R-S and intolerance of ambiguity influence Ss’ perceptions of self and 
several group-role targets. Given the above findings, it is hypothesized that 
(a) the correspondence between the perceived attributes of self and of a group 
“dissenter” will be greater for sensitizers than for repressors, and for tolerant 
Ss than for intolerant Ss; (b) the correspondence between the perceived 
attributes of self and of a group “social specialist” will be greater for repressors 
than for sensitizers, and for intolerant than for tolerant Ss; and (c) the 
correspondence between the perceived attributes of self and of a group “task 
specialist” will be greater for intolerant than for tolerant Ss. 
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It is further hypothesized that sensitizers will be more likely to be tolerant 
of ambiguity than will repressors. This hypothesis is based on the finding that 
repressors are more likely than sensitizers to make premature decisions on an 
ambiguous cue-search task (18). The results from two other recent studies are 
not consistent with this finding. Dublin (12) found no significant differences 
between repressors and sensitizers in their ratings of threatening and non- 
threatening stimuli for ambiguity, and Machotka (16) found no significant 
differences between repressors and sensitizers in their esthetic preference for 
ambiguous stimuli of a sexual nature. However, since threat was coupled with 
ambiguity in both studies, the results may be somewhat confounded. 


5 


B. _Mernop 
1. Subjects 


Seventy males and 56 females from an introductory psychology course par- 
ticipated (total N = 126). 


2. Procedure 


Repression-sensitization was measured by the revised R-S scale (9). Low 
scores indicate repression, and high scores sensitization. Tolerance-intolerance 
of ambiguity was measured by Budner’s. (6) scale. High scores indicate 
intolerance. Completion of. these questionnaires was optional, though extra 
class points were given for their completion, — : r 
{ Descriptions of self and others were obtained by use of Leary’s (15) 128- 
item Interpersonal Adjective Checklist, Leary’s scale can be scored according 
to cight categories of interpersonal behavior, conceived as forming octants on 
a circle. Underlying dimensions for the circle have been labelled strong-weak 
and hostile-friendly. The quadrants, with their coresponding octant categories, 
are as. follows: hostile-strong (competitive and aggressive) ; hostile-weak 
(rebellious and self-effacing) ; friendly-weak “(docile and cooperative) ; and 


friendly-strong (responsible and managerial). 
Each student in the class was assigned to an eight to 10 member study 


lar classroom design). ises 
F f Q - sign). The exercises 
were integrated with textbook assignments. One such exercise required $ to 
use the Leary scale to describe hi: oe yoo Lid ae 
eee yi ny acta este himself and the group's “task specialist,” “social 
specialist,” and “dissenter.” The “task”. and. “social” ialists were to be 


identified by S on the basis of brief descriptions’ of ch : 
and Havemann (14, p. 610): : mde the meee Kam 
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One type of leader is extremely well liked but ranks lower in activity and 
ability; this type of person has been called a “social specialist.” A more 
frequent type is high in activity and ability but less well liked; this kind 
of person has been called a “task specialist.” 
The “dissenter” role was assumed to be commonly known to all of the Ss 
and was not given any additional description or explanation. The Ss were 
informed that the data were being collected for a research project. 


C. Resurts anp Discussion 


1. Repression-Sensitization and Intolerance of Ambiguity 


A product-moment correlation of .09 between R-S and intolerance scores 
did not approach significance. Contrary to the hypothesis, the R-S and in- 
tolerance scales appear to be measuring largely independent aspects of personal- 
ity. 


2. Correlations Between Descriptions of Self and Others on Leary’s Scale 

To analyze relationships between self-descriptions and descriptions of other 
targets, each §’s four Leary scales were scored according to the eight categories 
described above. Three correlations were then calculated for each §: viz., 
between the eight pairs of scores for self and social specialist, self and task 
specialist, and self and dissenter. The mean correlations were .31, 19, and 
—.08, respectively, for the three different pairings. 

In order to determine whether or not either R-S or intolerance or ambiguity 
influenced the size of the correlations, Ss were classified according to high, 
medium, and low groups on the R-S and the intolerance scales, To analyze 
the effects of R-S on the correlations between self-descriptions and descriptions 
of others, each of the above three sets of correlations was analyzed separately 
by the Kruskal-Wallis one-way analysis of variance by ranks (17, p. 434). The 
three analyses were then repeated for the intolerance subgroups. None of the 
six analyses was significant (p > .05). Contrary to the hypothesis, there is no 
evidence that identification, or perceived similarity of self and the three targets 
used here, is influenced by either R-S or intolerance of ambiguity. 


3. Differences in Leary Scores for R-S and Intolerance Groups ; 
Analyses. were made of differences in Leary octant scores as a function of 
RS, intolerance, and sex. Four-way analyses ‘of variance with repeated mea- 
sures and unequal cell size (21, pp- 374-378) were conducted on the data 
from each of the four Leary scales. R-S, intolerance, and sex served as between- 
subject sources of variance, with Leary octants serving as the within-subject 


source of variance. 
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a. Self-descriptions. The analysis of self-description scores indicated that 
while none of the main between-subject effects was significant, the main 
within-subject effect (Leary scale) was significant (F = 14.90, df = 7/756, p 
< .001), as were the interactions between R-S and Leary octant (F = 4.41, 
df = 14/756, p < .001), and sex and Leary octant (F = 5.96, df = 14/756, 
p < .001). The interaction between intolerance and Leary octant was also 
marginally significant, (F = 1.91, df = 14/756, p < .05). Subsequent one- 
way analyses of variance were conducted at each octant level where significant 
interactions were obtained (21, pp. 310-312). Each significant effect obtained 
in these second level analyses was then submitted to post hoc analyses by the 
Newman-Keuls procedure (21, pp. 309-310). Table 1 summarizes the signifi- 
cant differences shown by these analyses. 


TABLE 1 
DurFERENCES BETWEEN LEARY SELF-DESCRIPTIONS ACCORDING ‘TO 
REPRESSION-SENSITIZATION, INTOLERANCE OF AMBIGUITY, 


AND SEX 
Leary 
octant Differing groups ? 
EE NeCece BOURNE.) FT 
° A. Repression-sensitization 
Aggressive sensitizers > repressors OL 
: medium > repressors 01 
Rebellious apie a > repressors 01 
i medium > repressors 01 
Self-effacing sensitizers > repressors 05 
, sensitizers > medium 01 
Managerial medium > repressors 05 
’ B. Intolerance of Ambiguity 
Aggressive low > medium 05 
Rebellious low > medium 05 
a “2 Cc. Sex 
‘ompetitive males > females 1 
Docile females > males io 


; Repression-sensitization seems to be more closely associated with differences 

in Leary self-descriptions than is either intolerance or sex. Since the aggressive 

and rebellious categories reflect the greatest hostility of any of the Beans 

four of the differences involving R-S can be summed up by stating that ss 

pressors attribute less hostility to themselves than do sensitizers eiienedint 

scoring Ss, Sensitizers are most distinctive for their apparent self-effacement 
The results are consistent with those obtained by Altrocchi (2 3). 

i 


Rosenzweig (19) has hypothesized that sensitizers are “intropunitive” (di- 
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recting hostility inward), whereas repressors are “impunitive” (nonexpressive 
of hostility). The significant tendency of sensitizers to describe themselves as 
being more self-effacing than repressors, and of repressors to describe them- 
selves as being less hostile than sensitizers, tends to substantiate this hypothesis. 

The intolerance of ambiguity results are similar to those found previously 
by Crandall (10), though not as extensive. As before, the highest scoring Ss 
in the hostile-strong quadrant were the tolerant Ss, whereas the highest 
scorers in the opposite, friendly-weak, sector were the intolerant Ss. However, 
unlike the earlier research, the friendly-weak quadrant was the strongest one 
for all three groups. 

b. Task specialists. Results from the four-way analysis of variance of 
task specialist scores indicate no significant between-subject main effects, but 
a marginally significant interaction between two of the between-subject main 
effects—intolerance X sex (F = 4.46, df = 2/108, p < .05). Post hoc 
analyses were not conducted on this interaction, for a higher order interaction 
between intolerance, sex, and Leary scores was also significant (F = 2.03, 
df = 14/756, » < .025), as was an interaction among R-S, intolerance, and 
Leary scores (F = 1.62, df = 28/756, p < .025). The main within-subject 
effect (Leary scale) was also significant (F = 75.38, df = 7/156, p < 001). 

Two-way analyses of variance at each Leary octant level were subsequently 
conducted on the significant interaction effects. Post hoc Newman-Keuls anal- 
yses were conducted on the significant intraoctant effects that were obtained. 
Most of the significant effects were attributable to differences between medium- 
intolerance males and the other groups. These males attributed less rebellion 
to task specialists than did males of high or low intolerance (p < 01) or 
females of medium intolerance (p < .01). Medium-scoring males also at- 
tributed fewer managerial traits to task specialists than did low-scoring males 
(p <..05) or medium-scoring females (p < .01). Medium-scoring Ss, regard- 
less of sex, described task specialists as being more cooperative than did either 
high- or low-scoring Ss (p < 05). ¢ 

Although a previous study (10) based on the Leary scale indicated that 
intolerance had no influence on descriptions of friends, there is evidence here 
that intolerance is relevant to descriptions of task specialists. However, the 
results are somewhat difficult to interpret, given the fact that most of the 
significant findings involved one particular group (medium-scoring males) 
in contrast to other groups. Replication might clarify this issue. 

c. Social specialist and dissenter. Similar analyses for the other two targets 
resulted in only four marginally significant (P < .05) differences on the 
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Leary octants. One involved intolerance, and three concerned sex differences, 
In view of the large number of possible comparisons, these results are difficult 
to interpret. 


d. Comparisons between group role responses. Both the tolerance-intoler- 
ance and repression-sensitization dimensions of personality appear to differ- 
entiate between Ss in their descriptions of self on the Leary Adjective Check- 
list. Only the tolerance-intolerance dimension, however, seems to differentiate 
Ss on any of the three group role descriptions. These results point to an 
internal source of R-S differences on the interpersonal dimension. Repressors 
and sensitizers seem to differ in their perceptions of themselves, but not in 
their perceptions of other people (at least as these people are described by the 
three roles that were defined in the present study). These findings correspond 
with those obtained by Altrocchi (1) who found that repressors and sensitizers 
tended to agree in their descriptions of others, but differed in the degree to 
which similar descriptions were ascribed to themselves. 

The capacity of the tolerance-intolerance dimension to differentiate among 
group role perceptions points to new areas of investigation in this area. Such 
an expansion of the study of tolerance-intolerance, however, seems to be con- 
tingent upon the clarification of the results obtained in the present study. 


D. Summary 


One hundred and twenty-six Ss completed scales for repression-sensitization 
(RS), tolerance-intolerance of ambiguity, and Leary’s Interpersonal Adjec- 
tive Checklist. The latter was used for descriptions of self and three group 
role targets. While R-S and intolerance of ambiguity were unrelated, both 


were associated with differences in Leary self-descriptions. Descriptions of a 


“task specialist” were associated with level of intolerance of ambiguity of the 


judges, but descriptions of a “social specialist” and “group dissenter” were 


largely independent of the judges’ own scores on either R-S or intolerance. 
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ATTITUDES AND ADJUSTMENT OF ADOLESCENTS 
AND THEIR MIDDLESCENT MOTHERS*! 


School of Home Economics, The Ohio State University 


BarBara N, ARMSTRONG AND CLARIBEL M. Tayor? 


A. InTRopUCTION 


Is the mother a significant influence upon the American adolescent in an 
era in which the generation gap seems to have widened between parent and 
teenager? Are American youth rejecting the work-success-achievement orienta- 
tion of their parents in favor of peer-related values of feeling, sensing, being, 
and loving? Is the mother still successful in passing to the adolescent some- 
thing of the parental attitudes and values? Does the child in his teens reflect 
the adjustive acuity of his parents? The answers to these questions will deter- 
mine, in part, the extent to which the American family will succeed in its 
primary developmental task—the socialization of its members, 


1. The Functionality of the American Family 


Empirical data compiled by Ogburn (15) set the stage for an era of specula- 
tion about the future of the American family which was to bridge nearly 50 
years, Major decreases in the traditional functions once assumed by the family 
were depicted in his well-documented theory of social change. Sorokin (19) 
and Zimmerman (22) interpreted the decline in the assumption of traditional 
functions as a sign of the deterioration of the family as the primary institution 
in this society. 

The benchmark studies of Burgess and Cottrell (3) and Terman (20) 
brought renewed optimism relative to the functionality of the American 
family. Burgess and Locke (4) described the family as changing from an 
institutional to companionship model and suggested that the changing func- 
tions of the family were necessary for its implementation. Miller and Swan- 
son (14) coined the term “colleague” family to describe the type of familial 
unit important in an urban-industrial society. 
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Few writers have been more influencial than Parsons (18). in stimulating 
optimism about the survival of the family system in America. Parsons por- 
trayed the family as “differentiated”? and specialized in its functions. He 
predicted that the family would become increasingly important in the per- 
formance of its remaining functions of which high priority would be given to 
the socialization of its members. 

The ability of the family to assimilate social change was delineated by 
Goode (9) as a key factor in the maintenance of positive mental health among 
the masses in a highly technological, urbanized, and industrialized society. 
How well an individual will adjust to the changing demands of society may 
depend upon the functional acuity of the family in helping each of its mem- 
bers to equilibrate physically and emotionally on a consistent and regular 
basis (21). 

Familial effectiveness in the socialization of its members can be inferred in 
part by the extent to which children take on the attitudes and, subsequently, 
the values, of their parents. Horrocks (10) has cited parental attitudes as 
the single most important factor involved in the character of most homes. 
Thus, investigation of attitude and adjustment similarities between mothers 
and adolescents has been deemed an appropriate measure of familial effective- 
ness in the acculturation of its members. 


2, Adolescence and Middlescence—Stage Similarities 


Interesting parallels seem to exist between the adolescent and middlescent 
stages of life in the United States, A number of writers have likened middle- 
age to a second adolescence (2, 5, 11, 17). Since both the middlescent and 
adolescent stages in the life cycle are hypothesized to have high culture-related 
stress potential, the adjustment Tequirements for both parents and adolescents 


in relation to each other and to the larger social order. 


Thus, the adolescent and the middlescent mother are affected by the chang- 


ing cultural conditions which yield inadequately defined and confusing role 


requirements. There exists the Possibility, therefore, that adjustment require- 
ments for each one are extensive, yielding the Probability of stress producing 
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outcomes and circular reactions between the two with each other and with 
their total environment. However, since the adolescent self-image remains in 
a state of flux, there is reason to believe that the major benefactor of positive 
parent-adolescent relationships would be the adolescent. As the adolescent 
searches for self-identity, he sets into motion the mechanism for attitude and 
personality formation—dynamic processes which he will use throughout’ his 
life. 

Attitudes and adjustment of individuals in these two diverse and yet similar 
groups would seem to reflect some interesting similarities. Correlations be- 
tween attitudinal levels and adjustment potential of the parent and adolescent 
during potentially stressful stages in the individuals’ lives. would seem to 
suggest that, in fact, the family was to some degree successful in the physical, 
psychological, social, and emotional equilibration of its members. 


3. Statement of the Problem 


Are the attitudes and adjustment patterns of the adolescents in this culture 
similar to that of their mothers? This question is one of primary importance 
both theoretically and practically. Data pertaining to the effect of the parent 
upon the child and possible circular reactions between the two must preclude 
an assessment of familial effectiveness inthe socialization of its members. 

Since attitudes act in the service of motives, attitude measurement was con- 
sidered useful in revealing the intensity of an individual’s attitudes, as well as 
the nature of a person’s adjustment to his environment. Important to the 
design of the study was the development and refinement of the Adolescent- 
Middlescent Attitude Scale (AMAS), which was based on the semantic dif- 
ferential prototype (16). ; 


B. MerxHop 
1. Adolescent-Middlescent Attitude Scale (AMAS) 


The AMAS, a 36-item scale, utilized 12 major concepts basic to the rubric 
of life in general for middlescents and adolescents. Concepts included were 
Children, Health, Marriage, Leisure, Future, Daily Tasks, Self, Parents, 
School, Community, Friends, and Church. The concepts and bipolar termi- 
nology utilized in the AMAS were designed to elicit a minimal amount of 
emotional response on. the part of the participant. ‘They were primarily non- 
controversial in nature and deemed useful in measuring basic attitudes that are 
reflective of an: individual’s long-range attitudinal orientation. 

The test-retest reliability coefficient’ for the AMAS ‘with one week interval 
between its administration to 71 ninth grade adolescents was 87. Validation 
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was established by comparing actual scores made by respondents to the instru- 
ment (AMAS) and the mean attitude scores derived by two independent 
raters of tapes made during which the participants responded to open-ended 
questions pertaining to the concepts included on the AMAS. The Spearman 
technique of rank correlation was applied to a comparison of actual and rater 
attitude scores, yielding a coefficient of .91 (p = <.01). 


2. The Sample 


Subjects selected for the study were ninth (N = 85) and tenth (N = 74) 
grade students who were members of intact families unbroken by separation, 
divorce, desertion, or death of a parent, and their mothers (N = 159). 
Maternal age parameters were 35-55. Participants were from a central Ohio 
city (population approximately 50,000) and were primarily of the middle- 
socioeconomic stratum, Approximately two-thirds of the participating mothers 


had completed the twelfth grade. (See Table 1 for additional description of 
the sample.) 


TABLE 1 
DESCRIPTION OF RESPONDENTS ON THE ADOLESCENT-MIDDLESCENT ATTITUDE SCALE 


Ninth grade sample Tenth grade sample 
Students Mothers Students Mothers 


Variable Mean SD Mean SD Mean SD Mean SD 
Age of i 
a ertictpane Wet 72 4229 529° 15.53 1.34 «42,72 5.35 
tetidenoy in 
3 sre le area 12.22 4.42 2748 15.61 14.00 8.42 23.58 14.29 
family 3.62 1.57 3.62 1.57 3.62 1.76 3.62 1.76 
The AMAS was administered to 


cane n the adolescents in their own school settings. 
: ie ri anti to be anonymous were mailed to mothers of the 

articipating adolescents. Kighty-two percent of th d 
and returned the instrument, ee 
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ninth grade students and their mothers (N = 85 pairs) and for the total 
sample of adolescent daughters and their mothers (N = 107 pairs). A 
comparison of total attitude scores for the tenth grade students and their 
mothers (N = 74 pairs) and the total sample of adolescent boys and their 
mothers (N = 52 pairs) yielded correlation coefficients significant at the .05 
level of confidence, 


TABLE 2 
CoRRELATION COEFFICIENTS AND ¢ TEST RESULTS DEPICTING DIFFERENCES BETWEEN 
Mean ATTitupgE Scores OF MOTHERS AND THEIR ADOLESCENT SONS AND 
DAUGHTERS AS MEASURED BY THE ADOLESCENT-MIDDLESCENT 
ATTITUDE SCALE 


Ninth grade Tenth grade Mothers Mothers 
students students and and 
and mothers = and mothers sons daughters 
Concept (85 pairs) (74 pairs) (52 pairs) (107 pairs) 
Correlation coefficients 
Children 06 -30* -30* 18 
Health 12 AS 09 AZ 
Marriage 32° 17 .20 .28* 
Leisure 16 Al 04 22" 
Future 14 21 07 23" 
Community .23* 14 19 ae 
Friends 21% —.13 3 e 02 
School 14 29* 16 .25* 
Parents 17 22" AQ .28* 
Church .37* an ay rip 
Daily Task 04 0! 5 . 
Self i . .20 16 13 23" 
Total score 30” 24° i27* 31* 
Results from t tests . 
Children 91 3.56* 3.10* 1.6 
Health 2.96* 68 00 1,58 
Marriage 04 65 1,68 54 
Leisure 1.42 1.18 94 1.54 
Future 2.59% 80 98 2.12 
Community 1.63 2.39% 2.70* 1,62 
Friends 1.79 2.45* 3.06% 1.66 
School 1.44 3.79% aaah ee 
Parents .20 2.06' S ‘| 
Church 2.878 4.99® 3.42* 4.29% 
Daily Tasks 4.99% 3.56* 3,04 5.18 
Self 383 Al 1.05 119 


* Significant at the .05 level. 


The attitudes of ninth grade students and their mothers correlated signifi- 
cantly (p = <.05) on the concepts Marriage, Community, Friends, Church, 
“and on total attitude score. Attitudes of the tenth grade adolescent-middle- 
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scent samples correlated significantly on the concepts Children, School, Par- 
ents, and total score. 

The mother-son group was smaller (N = 52) than the other samples for 
which data were analyzed. This influenced to some degree the fact that only 
two correlation coefficients of a significant magnitude (6 = <.05) were 
found. These were in relation to the concept Children and the total attitude 
score, 

The attitude scores of the mothers and daughters correlated significantly 
(p = <.05) on the concepts Marriage, Leisure, Future, Community, School, 
Parents, Church, Self, and the total attitude scores. 

The ¢ test was applied to the mean scores of the parents and students to 
determine whether or not significant differences existed between the adolescent 
and middlescent groups (Table 2). No significant differences existed between 
the total attitude scores for the ninth grade students and their mothers (N = 
107 pairs). Significant differences did occur for the tenth grade students and 
aa es (N = 74 pairs) and for the sons and their mothers (N = 52 
pairs). . 

Significant differences occurred. between ninth grade students and their 
mothers on the concepts Health, Future, Church, and Daily Tasks. In the 
tenth grade sample, adolescents differed significantly from their mothers in 
their attitudes toward Children, Community, Friends, School, Parents, 
Charch, Daily Tasks, and total attitude scores, 

Sons differed attitudinally (i= <.05) from their mothers on the concepts 
Children, Community, Friends, School, Church, Daily Tasks, and the total 
~— score. Attitudinal differences between mothers and daughters occurred 

telation to the concepts Future, School, Church, and Daily Tasks. 


between adolescen i . 
congruence, nn ‘helt mothers was suggestive of their adjustment 


Mean ‘attitude: scores suggested that the ninth grade students and their 
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1s showed adjustment similarity (Table 3). Significant differences were 
nd between the attitude intensity of the tenth grade students and their 
mothers, and the total sample of adolescent boys and their mothers. These 
findings suggested that tenth grade students and the adolescent boys found 
* adjustment a more traumatic experience than did their ninth grade or female 
counterparts. ‘ 

Adjustment similarity between adolescents and their mothers was inferred 

from the clustering of their attitude scores on the concepts Marriage, Future, 
Parents, and Self. In relation to the concepts Church, Daily Tasks, Children, 
Community, Friends, and School, the mothers showed better adjustment than 
| did their adolescent sons and daughters (Figure 1). 
Attitude scores on the various concepts indicated that the ninth grade 
: students consistently registered higher levels of adjustment than did the tenth 
grade students. It was within the conceptual realm labeled Children, Leisure, 
Community, Church, and Daily Tasks that respondents tended to register 
"marginal adjustment. However, marginal adjustment was indicated on the 
AMAS for only a small portion of the participants. 

No correlations of a significant magnitude (p = <.05) were found be- 
tween the total attitude scores for the adolescents and their mothers and such 
Variables as the age of the mother, the size of the family, or the length of the 
family residency in the sample area. 

Data relative to the mothers of the ninth grade sample revealed little if 
any effect of the employment status of the mother upon her total attitude 
score (Table 4). However, in the tenth grade sample, mothers who were 
employed part-time out of the home showed the best adjustment, Full-time 
homemakers in this sample group showed the poorest adjustment potential. 


E. SumMMARY 


The maternal role remains an important one for maturing adolescents in 

‘this culture. Significant correlations were found between the attitude and 

_ fidjustment scores for the mothers and their adolescent sons and daughters 

Who responded to the Adolescent-Middlescent Attitude Scale. ; 

oat from the f tests revealed that significant differences coed DEN 

the mothers and adolescents in two of the four groups for which aye Riga 

. However, since attitudes tend to increase in intensity sah my o 

‘Adults were expected to have more intense attitudes than their teenage % 
ten, Thus, the correlation coefficients that revealed the endency Gs 

Tiiides of the rancher and child. to vary in the same, direction, wens £98 
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i 1 Standard deviation 


FIGURE 1 
A COMPARISON OF ATTITUDE SCORES OF NINTH AND TENTH GRADE STUDENTS AND 
THEIR MOTHERS AS MEASURED BY THE ‘ADOLESCENT-MIDDLESCENT 
ATTITUDE SCALE 


sidered more informative to the attitude theorist than were the scores from the 


t test. 
An analysis of the findings suggested that the mothers who participated 


in the study tended to be most effective and believable models for their 


maturing adolescents. Particularly impressive were the attitudinal similarities 


between the maternal and adolescent respondents to the AMAS on the con- 
cepts Marriage, Future, and Self (Figure 1). The attitudes of the adoles- 


TABLE 4 
EFFEct OF THE EMPLOYMENT STATUS OF THE Moruer upon Her ATTITUDE INTENSITY 
AS MEASURED BY THE ApoLesceNT-MIDDLESCENT Arritupe SCALE 


Mothers of tenth 


Mothers of ninth 
grade students grade students 
Employment Mean Mean 
status N % score N % score 
Unemployed 58 644 115 40 54.1 106 
Employed 
part-time 14 15.6 112 il 14.9 120 
Employed 
full-time 18 20.0 112 23 31.0 113 
Total 90 100.0 114 74 100.0 111 
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cents in these important realms seemed to reflect the socialization practices and 
prevailing attitudes within the milieu of their families. 

Even though the attitudes of the fathers were not measured, it is suspected 
that the attitudes of the mothers tended to serve as a barometric indicator of 
familial effectiveness of the family in the socialization of its members, 

It might be argued from the findings of the study that the maternal par- 
ticipants had, to a considerable degree, infringed upon the individuality of the 
adolescents by imposing their attitudes upon the children, producing robot- 
like replicas of themselves. However, the reader is cautioned to remember that 
the concepts utilized in the Adolescent-Middlescent Attitude Scale were non- 
controversial in nature and designed to elicit basic attitudes that stem from 
value orientations. Mothers and adolescents might have responded quite dif- 
ferently to an opinionnaire concerning dress code, length of hair, or personal 
behavioral parameters, all of which tend to reveal transient rather than basic 
and stable attitudes, 

The similarities that existed between the maternal and adolescent partici- 
pants seemed to form an attitudinal base or threshold from which the emerging 
adult would tend to develop his own premises, formulate his own decisions, 
and endeavor to establish behavior appropriate to his own generation era. 
Thus, existing similarities were interpreted to represent a secure base for the 
necessary exploration from the potentially comfortable realm of childhood into 
the complicated milieu of the adult. 

The dissimilarities found between tenth grade students and their mothers, 
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DETERMINANTS OF THE ECOLOGY OF THE DYAD: 
THE EFFECTS OF AGE AND SEX* 


Department of Psychology, Texas Tech University 


Ann I. Tyter, Wayne L. Waac, AND Ciay E. GzorcE 


A. INTRODUCTION 


The importance of physical space and distance has long been recognized by 
the ethologists in the explanation of certain types of animal behavior. Only 
recently has there developed in the area of social psychology a trend toward 
increased emphasis upon some of the spatial aspects of group behavior, One 
context in which research has been done concerns the specification of factors 
affecting the observed spatial characteristics of seating arrangement. As reviews 
by Sommer (4, 5) indicate, the existing information with regard to this 
problem is sparse. Sommer (3) reports that the nature of the social interaction 
is one important determinant of seating arrangement. Using college students, 
he found that for rectangular tables, dyads engaged in conversation preferred 
corner-to-corner arrangements, whereas dyads engaged in a cooperative effort 
preferred a side-by-side arrangement. Competing dyads sat across from one 
another, while coacting dyads sat in distant arrangements. 

Sex composition of the dyad has also been reported to be a determinant of 
seating preferences among individuals. Sommer (2) instructed employees at 
a Canadian mental hospital to be seated at a rectangular table and to dis- 
cuss some topic with a “decoy” who was already seated. Subjects and decoys 
of both sexes were used so that all combinations could be tested. It was ob- 
served that females preferred a side-by-side arrangement more often than 
males, When the dyad was comprised of members of the opposite sex, subjects 
most often sat across from the decoy. . 

A characteristic of the cited research is that all observations were obtained 
under controlled laboratory conditions. In each case, subjects were instructed 
to enter the room, be seated, and then perform some task, It seems likely that 
the experimental situation per se may be one factor that has an effect upon 
seating preferences. The purpose of the present investigation was to determine 
the effects of the sex composition of the dyad and the age of its members upon 
seating preferences in a naturalistic setting. 

® Received in the Editorial Office on February 11, 1972, and published immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. 
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B. MetHop 


Since observations were to be made under natural conditions, the cafeteria 
in the Student Union Building on the campus of Texas Tech University was 
chosen as a site where data collection could be carried out inconspicuously. The 
enrollment at the university approached 20,000, thus insuring an adequate 
turnover of people within the cafeteria. Certain limitations were imposed 
upon the data collected. First, only dyads were observed, since the arrange- 
ments were not as clear for groups comprised of a larger number. Second, 
observations were recorded only for square tables, since the arrangement was 
often difficult to ascertain for dyads seated at round tables. A third limitation 
was that only those groups were included that were formed after the experi- 
menter had arrived, This eliminated the possibility of the spatial arrangement 
being determined by a person or persons leaving the table. 

The dependent measure, seating arrangement, was scored as cither (a) two 
people sitting across from each other, or (4) two people sitting diagonal to 
each other, The sex composition of the dyad was recorded to be two males, two 
females, or a male and a female. The ages of the members were categorized 
to be (a) those clearly over 30, and (b) those clearly under 30. The ages of 
the dyad members were estimated by the observer. Cases in which the ob- 
server was not sure of the correct category were eliminated. Observations were 


recorded from approximately 9 a. m. to 3 p. m. for a period of three months. 
A total of 1073 dyads were observed, 


C. Resutts 


The observed frequencies for the 
In order to determine the effects of 
were computed. Significant overall 
tion, x? = 89.73 (p < .001), 


1073 records are presented in ‘Table 1. 
age and sex composition, a number of y*s 
effects were obtained for both sex composi- 
and age, x? = 40.13 (p < .001), Male-male 


SEATING PReF! Pei 
ERENCES AS A Joint Funcri 
= AGE AND Sex Colmrode OF THE DYAp om 


: Number of dyads sitting 
Age Sex composition Across Diagonal 
Under 30 Male-male 
Male-female ea ait 
Female-female 198 92 
Over 30 Male-male 35 16 
Male-female 42 13 
Female-female 


32 16 
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d male-female dyads preferred a diagonal seating arrangement, whereas 
female-female dyads preferred a face-to-face arrangement. With regard to 
age, dyads comprised of individuals over 30 preferred a face-to-face seating 
“arrangement. 

The effect of sex composition as moderated by age was then determined. 
Significant effects were obtained for dyads under 30, x? = 114.61, 2df(p< 
001), but not for dyads over 30, x? = 1.34, 2 df (N.S.). For the under 30 
category, male-male and male-female dyads preferred a diagonal arrangement, 

“whereas female-female dyads preferred a face-to-face arrangement. For the 
| over 30 category, all dyads preferred a face-to-face arrangement. 

If all possible seating arangements are considered, by chance alone one 
would expect a diagonal preference two-thirds of the time and a face-to-face 
arrangement preference one-third of the time. These expected valus were then 
compared with the observed values for each possible combination of sex com- 
position and age. For the under 30 category, xs of «11, 4.21 (p < .05), and 
248.35 (p < .001) were obtained for male-male, male-female, and female- 
! female dyads, respectively. These results suggest that male-female dyads sat 
diagonally more often than expected by chance. Likewise, female-female dyads 
preferred a face-to-face arrangement, whereas male-male dyads showed no 
preference different from that expected by chance. For dyads over 30, xs of 
27.02 (p < .001), 43.91 (p < .001), and 22.52 (p < .001) were obtained 
for male-male, male-female, and female-female groups, respectively. In all 
instances, a distinct preference for a face-to-face arrangement was exhibited. 


f 


D. Discussion 


The results suggest seating preferences to have been a joint function of 
both the sex composition of the dyad and the age of the individual members. 
For dyads whose members were under 30, male-male and male-female groups 

preferred a diagonal arrangement, while female-female groups preferred a 
face-to-face arrangement. For dyads whose members were over 30, no differ- 
ences in seating preferences were observed. All groups preferred a face-to-face 
arrangement. When compared with expected frequencies resulting from chance 
alone, the face-to-face arrangement was preferred with two exceptions. Male- 
female dyads under 30 exhibited a significant diagonal preference, while male- 
male dyads produced no significant deviations from the expected frequencies. 
The results of this investigation are difficult to compare with those of 
Sommer (2) who found male-female dyads to prefer a face-to-face arrange- 
ment. As indicated, the conditions under which the observations were collected 
are quite different. It is likely that such differences are responsible for the 
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disparity of the results of these two studies concerning the effects of sex com-— 
position, 

If actual distance is used as a criterion for “closeness,” it would appear 
that female-female dyads preferred the more “distant” of the two possible 
arrangements. However, it seems possible that a “psychological closeness” 
resulting from eye contact may have been of more importance than physical 
distance. Argyle and Dean (1) have supported such an expressive eye contact 
hypothesis with the finding that mixed-sex dyads tended to minimize eye con- 
tact. Applied to the present context, such an hypothesis would suggest that 
the face-to-face arrangement represented the more intimate of the two, The 
diagonal arrangement, by minimizing eye contact, consequently created the 
most distance, This is given some measure of support by the finding that 
male-female dyads preferred this type of arrangement. In any case, the ex- 
pressive eye contact hypothesis seems to hold some promise as a useful con- 
struct in the explanation of seating preferences, 


E. Summary 


The concern of the present investigation was to determine the effects of 
Sex composition of the dyad and the age of its members upon seating prefer- 
ences in a naturalistic setting, Records of 1073 dyads were obtained under 
natural conditions in a university cafeteria. The results indicated that both 
Sex Composition and age were significantly related to the observed preferences. 
For the under 30 category, male-male and male-female dyads preferred a 
diagonal arrangement, while female-female dyads preferred a face-to-face 


arrangement, For the over 30 Category, all dyads preferred a face-to-face 
arrangement, 
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CONCEPTUAL TEMPO AND THE ORIENTAL AMERICAN* 


Department of Educational Psychology, University of Hawaii 


Harotp I. AyaBe AND Susan SANTO 


A. INTRODUCTION 


Chinese and Japanese Americans have to some extent achieved success in the 
American culture. The two values that the Japanese traditionally hold, “enryo” 
meaning reserved or restrained and “gaman” meaning patience or perseverance, 
and the Chinese traditional values of “keqi” (patience) and “rennai” (per- 
severance) may have contributed to the development of successful citizens. 

Conceptual tempo, a construct isolated by Kagan (2) which focuses on 
differential decision times in problem solving situations of high response un- 
certainty, is related to behaviors of reserve and perseverance. The instruments 
used by Kagan to measure the construct, such as the Matching Familiar 
Figures Test (MFF), categorize the subjects into two groups: the reflective 
(few errors, slow) and the impulsive (many errors, fast). Kagan, concerning 
the reflective child and perseverance and reserve, wrote, “First, the reflective 
child demonstrates higher standards for mastery of intellectual tasks and 
greater persistence with such tasks. . . [and] second . . . the reflective child 
typically stands on the sidelines the first few days . . . and watches the group 
vigilantly before joining them” (1, p. 156). 


B. Meruop 


Thirty second graders (15 male and 15 female) of Japanese or Chinese 
extraction and 30 (15 male and 15 female) of other racial (Filipino, Hawai- 
ian, Samoan, or Portuguese) extractions attending Lanakila (public elemen- 
tary) School in Honolulu, Hawaii in the spring semester 1971, were randomly 
assigned to one of three conditions—fast, slow, and control. Ss in each group 
were tested individually with use of the children’s version of the MFF (1) 
and given the following instructions: 

Fast group MFF instructions. The experimenter said, “Go as fast as you 
can and point to the picture on this bottom page that is just like the one on 
this top page. . . . Remember to go as fast as you Can. .-- Now let’s see how 
fast you can point to the correct picture.” 


®” Received in the Editorial Office on February 22, 1972, and published immediately 
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Slow group MFF instructions, E gave the same instructions as to the fast 
group, but with reminders changed to “go slowly and think about your 
answer. . .. Remember to take your time and think about your answer.” 

Control group MFF instructions: E administered the MFF in the regular 
manner prescribed by Kagan. 


C. Resutts 


‘The Japanese and Chinese American groups did not differ reliably from the 
other racial extraction groups on the latency variables on all three treatment 
conditions, However, the former produced significantly fewer errors than the 
latter groups when instructed to “‘go as fast as you can” (F = 8.159, df 1/18, 
p < .01). There were no reliable differences between the two groups when 
tested in the normal or slow conditions. 

The finding of no differences between the groups except that the “other” 
racial extraction children produced a decrement in performance under the 
“fast as you can” instructions may be crucial to explaining the apparent better 
performance by the Japanese and Chinese American group in standardized 
exams, The California Achievement Tests Complete Battery (3), for example, 
instructs the examinees to “work as fast as you can...” Perhaps, the greatest 
bias of standardized tests toward the “other” racial groups is the amount of 

(or lack of) time given to complete. exams. 


D. Summary 


Thirty second graders (15 male and 15 female) of Japanese or Chinese ex- 
traction and 30 (15 male and 15 female) of other racial (Filipino, Hawaiian, 
Samoan, or Portuguese) extractions were randomly assigned to one of three 
Matching Familiar Figures (Kagan’s instrument for reflection-impulsivity) 
testing conditions—fast, slow, and control. It was hypothesized that the 
Japanese and Chinese traditional values of perseverance and reservedness 
would manifest themselves in the production of few errors and extended 
latencies, When the subjects were asked to “go as fast as you can,” Japanes¢ 


and Chinese American children produced significantly fewer errors than the 


other racial group. No other significant differences were found 
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DIFFERENTIAL EDUCATIONAL ASPIRATION LEVELS OF 
MEN AND WOMEN UNDERGRADUTE STUDENTS?® 


Learning and Behavior Laboratory, Santa Clara, California; 
and San Jose State College 


‘Tuomas J, Coates anp Mara L, Sournrrn 


A. Lwrropuction 


Recently legislation has been pending before Congress to provide equal 
employment opportunities for women. In light of the fact that the percentage 
of women in the labor force has increased 244% from 1940 to 1964 (11), 
this legislation may not seem to be necessary. But the percentage of women 
in all professional and technical occupations decreased from 40% in 1950 to 
37% in 1968 (10). For the same period, the percentage of doctorate degrees 
awarded to women remained at 12% (8). Moreover, at the present time, 
women account for only 1896 of the faculties in institutions of higher education 
(4) and only 24% of the members of the American Psychological Association 
(3); in 1968 their earnings, on the average, were less than those of the male 
members (2). This phenomenon is even more striking within such psycho- 
logical subspecialties as industrial psychology (9). Thus, it appears that the 
call for change is justified, and an examination of the reasons for the lack of 
women in the professions, particularly in the field of psychology, is warranted, 

One factor contributing to the dearth of women in academia is bias against 
professional women. Kaley (6) reported that both outside and inside the pro- 
fessional community definite negative attitudes exist toward the married pro- 


fessional woman in her dual role, ‘These may exist because the successful 
professional woman, at least in psychology, deviates from traditional feminine 
expectations both in personality and behavior (1). These biases seem to be 

translated into hiring practices. Lewin and Duchan (7) found that when 
equally qualified applicants of both sexes were considered for an academic 
Position, a man would be chosen over a woman. It is not inconceivable that 
this might generalize to graduate student selection practices. 

Another discriminating factor may be the aspiration levels of women them- 
selves. If the charge that women are socialized to achieve lower levels carries 
any validity (4), then indeed one would expect that women would have lower 
aspiration levels than men. This present study is addressed to an investigation 
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of this phenomenon as a contributing factor to the fact that women are under- 
represented in the professional academic commnity. In addition, three measures 
were used to ascertain if ability differences between men and women might 
not also account for either differential aspiration levels or the systematic ex- 
clusion of women from academia. 


B. MerHop 
1, Subjects 


All students enrolled in an upper division course required for the B. A. 
psychology major at a large state college from Spring of 1969 through Spring 
of 1971 participated as Ss. The sample included one group from a six-week 
summer session, the other students having taken the course during the regular 
15-week semester. A total of 364 undergraduates (198 males and 166 females) 
completed the course and were included in the sample. Sixty-six graduate 
students were also enrolled in the course, but were not included in the sample. 


2. Instruments 

A paper-and-pencil questionnaire was employed to determine each S’s highest 
degree aspiration and other demographic data. The ability measures included 
two common tests of verbal facility, the Concept Mastery Test (CMT) and 
the Wechsler Adult Intelligence Scale (WAIS), and performance on course 
criterion measures (three midterm examinations and one final examination), 
as this has been found to be a valid predictor of graduate school performance 
(3). Each midterm consisted of two equally weighted parts: a take-home essay 
section and an in-class objective section. The final examination was a com- 
prehensive 90-item in-class multiple choice examination, ‘The raw scores on all 
examinations were converted to a normalized standard score with a mean of 
100 and a standard deviation of 20. A summation of each S’s standard scores 
on the four examinations determined his final grade. This same summation 
score was used for comparison purposes in this study. 


3. Procedure 

The CMT and the demographic questionnaire, both group paper-and-pencil 
measures, were administered during regular class sessions. The examinations 
were administered during regular class sessions covering specified assigned 
material. In each section offering of the course, the sequence of testing was the 
same. To prevent biasing of results, the examinations were scored by the 
instructors without identifying individual student papers, The WAIS was 
administered outside of class time by one of several dkilled student examiners: 


Because of administrative difficulties, nin, i 
did not take the WAIS, ime: 
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C. Rzsuxts anp Discussion 


The reported educational aspirational levels of both male and female under- 
graduates are presented in Table 1 in both frequencies and proportions. The 
significant x? suggests that some degree of association exists between sex and 
aspiration. A closer examination of the proportions reveals that a larger pro- 


‘TABLE 1 
DIFFERENTIAL EDUCATIONAL ASPIRATION LEVELS OF Mew anp WomENn 
UNDERGRADUATE STUDENTS 


Educational aspiration level 


Sex B.A. M.A./M.S. Ph.D. 

Males 

Observed frequency 68 84 46 

Proportion 34 A2 23 

Expected frequency 75.6 $9.7 $2.6 
Females 

Observed frequency 7 $1 4 

Proportion AS 49 08 

Expected frequency 63.4 75.2 274 

% Note: X2 = 14.456, p < .001. 


portion of females planned to terminate their education with the attainment 
of a baccalaureate or masters degree, while a larger proportion of males aspired 
to the doctorate level. An examination of the discrepancies between expected 
and observed frequencies in the contingency table reveals that the observed 
frequencies for the men at the B. A. and the M. A./M. S. levels are below 
the expected frequencies, while the reverse is true for women. Moreover, at 
the doctorate level the expected frequency for women is well above the ob- 
served frequency, while the opposite holds for men. 

Differences in verbal facility between the sexes on both the CMT (males: 
M = 64.94; females: M = 64.45; t = .147) and the WAIS Full Scale IQ 
(males: M = 117.72; females: M = 116.97; t = .801) were not significant. 
With use of the composite score which determined each §’s final grade as a 
criterion, females (IM = 402.84) were found to outperform males slightly 
(M = 397.94) on the course criterion measures, but this difference was not 
significant (¢ = .737). 

Taken together, these data provid 
contention that the underrepresentation 
counted for not only by discrimination, 
lower goals for themselves. This they do 
as equally intellectually capable as men of pu 


le empirical verification for Graham’s (4) 
of women in academia is to be ac- 
but also by the fact that women set 
in spite of the fact that they appear 
rsuing professional education, 
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These data may also suggest that it is not sufficient to legislate against dis- 
crimination or change the attitudes of those in decision-making positions, 
although both of these are important, but it is also necessary to change the 
attitudes of women themselves. Of course, a change in the latter condition 
may be partially contingent on a change in the former two conditions. 


D. SumMMaArRY 


In an investigation of the factors accounting for the underrepresentation 
of women in the academic professions—in this case, psychology—198 male and 
166 female undergraduates were compared on three predictors of educational | 
success (the Concept Mastery Test, the Wechsler Adult Intelligence Scale, 
performance on course criterion measures) and on educational aspirations. 
The women were found to have lower educational aspirations than the men, 
although they appeared equally intellectually capable as the men of pursuing 
graduate education. Thus it appears that not only discrimination, but also 
the aspiration level of women themselves accounts for the marked lack of 
women in the academic professions. ‘ 
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; BEYOND THE GROUP’S MYTHOLOGY* 


Department of Educational Psychology, The University of Texas at Austin 


Ear A. Koire AND JUNE GALLESSICH 


A. INTRODUCTION 


In psychotherapy, whether group or individual, an essential process is the 
: reality testing of accumulated beliefs and expectations. Ezriel (3), for ex- 
ample, regards the testing of unconsciously determined fears to be the essence 
of therapy. Whitaker and Lieberman (7, p. 262) emphasize that the“... 
crucial therapeutic experience for the individual involves the discovery that 
feared consequences do not occur, . . . that the impluses which he has dis- 
guised, suppressed, or ignored can be recognized and sometimes expressed 
without exposing himself to damaging consequences.” Gibb and Gibb (5, p. 
45) suggest that for a person to change he “. . . must have deep experiences 
that disconfirm his fears and distrusts, and he must rebuild his life theory in 
a way that is more congruent with his trust experiences and trust assump- 
tions.” 

Therapy groups, encounter groups, and training (T) groups, like indi- 
viduals, operate on assumptions, fantasies, and expectations about the group, 
the individual members, the leader or therapist, and the experiences that un- 
fold within the group. Many of these assumptions and expectations are 
untenable, do not square with the realities of the group experience. In order 
to understand the dynamics of the group process and of individual members, 
group therapists and encounter group leaders strive to make explicit the 
assumptions of the group and to examine and conceptualize group behavior 
in a variety of ways. Bion (1) hypothesizes that the group assumes that (a) 
there is dependency upon the leader, (4) members must fight or flee, and (¢) 
the group has met for sexualized pairing. Whitman and Stock (8) applied 
focal conflict theory (4) to a homeostatic theory of group dynamics, at- 

* tributing group behavior to two dynamic and conflicting forces: the disturbing 
motive and the reactive motive. The group, in struggling with various focal 
conflicts, is engaged in a continuous process of achieving, losing, and struggling 
to regain its equilibrium. The “nuclear focal conflict” is trust versus mistrust. 

Here we want to explore group behavior in terms of the latent belief 


* Received in the Editorial Office on March 2, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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systems of its members. We call the belief systems of a group and of indi- 
vidual members myths. The notion of the myth is important because it em- 
phasizes the fallibility of belief systems and the need to re-examine them 
continuously in the light of new experiences. The usefulness of each myth 
stems from the challenge it presents for a reality-testing process, not from 
verifying its initial validity. 

Looking at a group in terms of its mythology is also helpful in identifying 
and examining major themes. Often what the group members are saying and 
doing adds up to a central theme which may be characterized by a word, 
phrase, or sentence. Words and actions, for example, may be manifestations 
of such beliefs as: “Isn’t it good to be here with so many interesting nice 
people?” Then later: “We really must be careful not to embarrass ourselves 
and others by revealing too much.” Still later: ‘This is a dangerous place to 
be; people could get hurt here.” And finally: “We can be extremely different, 
expose ourselves and one another, and still be cared for here.” The statements 
represent, in turn, “whistling in the dark,” cautiousness, fear and distrust, 
and risk taking and trust. The deepest underlying theme is a concern for 
psychological safety. Thus we can think of manifest behavior in the group 
as representing themes. Group myths and belief systems, however, represent 
latent attitudes that constitute the basis for the prevailing and recurring 
themes. The characteristics and uses of a group’s mythology may be more 


readily understood if some common and recurring therapy and encounter 
group myths are examined. 


B. Common Group Myrus 
1. Authority Myths 


Early and pervasive group myths are related to authority issues. The most 
casual observer can see the power invested in the leader by the beginning group. 
Increasing silence, expectant or reproachful glances toward the leader, and 
other signals for him to get things started suggest dependency. The belief 
system or myth is: The leader is omnipotent; only he knows what to do. The 
corollary to the belief that the leader is extremely powerful is the belief that 
the group members are relatively helpless. 

Members behave differently in Tesponse to the myth of leader omnipotence. 
Some members deny or minimize their strengths and find comfort in attributing 


to the leader compensatory heroic Power and in regarding him as the sole source 
of reward and punishment. When a climat 


: ; e of helplessness, dependence, and 
deference is pervasive, the latent group statement is: “He is king and superior.” 


While this legend pervades the group, some members experience a healthy 


j 
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need to seek more independence, to test the limits of the leader’s control, and 
to discover a measure of freedom for themselves and others. Still other 
members may secretly suspect and hope that the leader is incompetent for the 
task, lacking the strength and knowledge to lead. Thus the myth of leader 
omnipotence may represent only one of two poles. The other pole is: The 
leader is impotent; we do not need him. Actually, we would be better off 
without him. The corollary of “the impotent-leader” myth, then, is the belief 
that members of the group are autonomous, independent, or could be if he 
were not present. Assertive behavior directed at the group leader may reflect 
a healthy or reality-testing approach or a maladaptive coping style. 

Leader behavior may confirm the group’s mythology. A leader may, for 
example, need to cultivate the mythology of his own omnipotence, and con- 
trol the group through paternalism, excessive ground rules, or directed 
activities. Members then may react with exaggerated dependency or counter- 
dependent, openly hostile behavior. Some leaders are able to use the reactive 
behavior to help free members from authority hangups. Leaders with relatively 
low power needs, either because of a crisis or group pressures, may feel com- 
pelled to take a controlling stance. The risk of leader take-over, whatever the 
cause, is that once the leader has forcefully used power, members tend to 
abandon responsibility for their own growth and for contributing to the 
growth of others. 

Just as leader behavior may confirm the myth of his omnipotence, his be- 
havior may also confirm member myth of his impotence. He may, for example, 
become so ensnared in group tensions that he becomes immobilized; personal 
needs may lead him to take permissive of passive roles which result in mem- 
bers’ escaping productive interaction. 

Group preoccupation with authority and depe 
able therapeutic material. Our assumption is 
eliciting, explicating, and reality-testing the authority and dependency myths 
of the group, members will emerge with greater freedom and responsibility. 
A freeing experience for the group is the discovery that the leader is another 
human being, not king, Whitaker and Warkentin. (6, p- 244) describe the 
“... ‘death of the prototype’ as the means by which the patient becomes able 
to inherit his own adequacy.” . 

Authority problems with the leader are resolved in the process of ereC Rony: 
ing him as neither omnipotent nor impotent. If the leader is dealt with success- 
fully as an authority person within the group, he is no longer regarded as the 
Sole source or perhaps even the most important source of power, whether the 


Power is to reward, punish, or nourish member growth. The process through 


ndency themes produces valu- 
that through the process of 
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which the myth of his power is understood and broken can hopefully be 
generalized and applied in the resolution of authority problems that members 
may have outside the group. 


2. Intragroup Relationship Myths 


Conflicted emotions are evident as group members struggle to resolve the 
complex and ever-changing issues of distance and intimacy in relationships 
with one another. Recurring questions are: “How do I want to be known and 
how do I want to know others? How much involvement with different in- 
dividuals and with the group dare I risk?” These are the questions faced by 
most people as they weigh the possibilities of safety and alienation associated 
with distance against the promise of joy and threat of pain in interpersonal 
intimacy. 

‘After a group moves beyond superficial niceties and members begin en- 
countering one another at deepening levels, a common myth born out of fear 
and fostered as a bulwark against involvement is: We are so different, have so 
little in common, it is impossible to form close deep relationships with one 
another. This myth of uniqueness reflects common anxieties that are usually 
kept private early in the group. The myth wanes as members begin to experi- 
ence enough trust and openness to discover their common anxieties and fears, 

As members begin to form relationships, the polarity of the uniqueness 
myth may become dominant: There are no important differences in this group; 
we feel the same things; we like one another. Boris (2, p. 96) describes this 
group fantasy: “. . . each member dreams that the other will join him in 
unanimity of viewpoint and a community of intention. This will, in turn, 
give him several fine experiences. It will prevent any apprehensiveness lest 
the other members, individually or collectively, gang up against him or in 
rivalry with him.” The myth of uniqueness is a form of resistance used to 
avoid further risk taking and may manifest itself in either distancing behavior 
or simulated intimacy. 

The leader’s task is to help the group get beyond both of these defensive 
myths and to build an atmosphere in which both similarities and differences 
are examined, accepted, and sometimes nourished. But such a climate is dificult 
to achieve and maintain, for the work of the group inevitably involves tension 
and conflict. Hopefully the disruptions lead to deeper levels of integration, but 
complete and final integration is never achieved. 

4 Group cleavages appear and reappear; common bases for them are differences 
in race, religion, sex, age, or career field. Fears based on differences become 
exaggerated. For example, in a college faculty encounter group a cleavage 
developed between teachers of liberal arts and teachers of technical disciplines 
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when, in fact, differences that did exist were more related to individual per- 
sonalities. The myth that was dispelled was: We cannot relate to “them” 
because we and our disciplines are fundamentally different. 

The myriad of differences that becomes manifest in the group can be used 
to seek deeper member understanding of one another rather than remain 
divisive barriers. Political differences—radicals versus conservatives—may be 
seen more sharply, understood, and accepted as legitimate or discovered to be 
masks and relatively unimportant in the learning tasks of the group. Stereo- 
types that talkative and quiet members impose on one another may become 
the basis for exploring and discovering the nature of giving and taking and 
for understanding the uses of both words and silences for either involvement 
or withdrawal. Members of both sexes may discover that a man can show 
tenderness and tears without loss of masculinity and that a woman can show 
strength and tenacity without loss of femininity. 

While groups work to get beyond cleavages, members may experience 
moments of loneliness and despair. Fears of contact with deep emotions lead to 
myths that shield members from one another. Janet, who had remained on the 
perimeter of an intense group, was confronted by Don, bothered by her nar- 
cissism and noninvolyement. A turbulent confrontation ensued. She lashed out 
verbally; her defenses crumbled, and she spilled out tormented dreams and 
childhood degradations. The group, visibly shaken by her psychological naked- 
ness and violent emotions, tried to move away from deep involvement by denial 
of her suffering and hatred. The myth they tried to invoke: We must protect 
against this deep exposure and pain which might destroy her—and us. The 
reality for which the therapist worked in exploding the myth: Experiencing 
the depth of pain and anguish and ultimately owning and sharing repugnant 
aspects of self can be therapeutic. Through such shared experiences members 
discover their resilience, strength, and tolerance for both the dark and bright 
sides of themselves and one another. 

The therapist is a significant model in enabling the group to discover and 
use belief systems about distance and intimacy in interpersonal relationships. 
Rather than modeling right or wrong, however, he models his own beliefs and 
behavior in the way he relates to members, whether distantly, intimately, or 
both. Moreover, he models the processes by which he and others can validate 
effective relationships, try new kinds of relationships, and change relationships. 


3. Communication Myths 


Closely associated with relationship mythology are communication myths 
which limit the range and mode of group expression. A common group assump- 
tion is that it is not good or helpful to tell people when one is angry. Voicing 
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angry feelings may violate a mythical norm which, in essence, is: To express 
anger is bad, is to be out of control, and therefore, shows weakness, Anger 
may be harmful and may lead to feelings of shame and guilt. Once the anger 
of some members has been expressed without apparent damage, the polarity of 
the “anger is bad” myth may prevail: To show anger is good, is human, and 
provides a healthy release. Anyone who does not show anger is covering up his 
feelings, is not genuine. The goal of the therapist is to help members to own 
their anger and express it in ways that yield a constructive interaction within 
the group, in contrast with indiscriminant suppression or expression of it. 

Closely related to stigmatizing or overly prizing the expression of anger is 
the tendency of groups to invoke myths to govern the conditions under which 
members share impressions and give feedback to each other. The particular 
myths that control the giving and receiving of impressions and feedback vary 
from member to member. A common one is: Other members will not like me 
if I let them know my negative feelings about them; consequently, I can win 
approval and preserve our relationship by expressing positive and ignoring 
negative reactions. This belief, however, runs head on into the realization 
that: I cannot know how others truly see me and feel about me unless they 
are able to share both positive and negative reactions; yet I am not giving 
others the benefit of both. Work of the group includes assisting each member 
in checking out the kind of feedback and reactions he is able to share with 
other members, and to square this with the kind he wants for himself. 

Self-disclosure—how much, what kind, when, for what purpose—is a central 
group concern that also involves intimacy: distance and risk-trust issues. One 
self-disclosure myth prevalent at one time or another is: No one should share 
private secrets and thereby risk being exposed, misunderstood, hurt, or burden- 
ing to others with his problems. The myth that represents a polarity of ex- 
treme guardedness is: Everyone must share secrets or at least sume weaknesses. 
Confession and soul-baring are the price of admission to this group. 

The issues in self-disclosure are manifold and complex. How open need a 
member be in order to get reactions from others, gain self understanding, and 
form close relationships? One goal of the therapist is to assist individual mem- 
bers to discover their own guidelines for what they choose to reveal about 
themselves, and to use whatever is revealed to understand one another as well 
as the significance of the group’s self-revelation norms and myths. 
both individual members and the group is to move toward greater a 
consequently, genuineness in what members reveal is mo 
attempts to reveal everything. 


A goal for 
uthenticity ; 
re important than 
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4. Viability Myths 

An important concern during the course of the group experience is group 
viability. Questions that may become dominant early, later as the group 
develops, and when termination time approaches are: Will we make it as a 
group? Will we be a good group? How can we make it last? 

A myth that becomes apparent as the group haltingly and painfully begins 
is: We can never make it as a group. It may not be worth trying. Some mem- 
bers bring specific problems for which they seek solutions; others feel intense 
loneliness and hunger for close personal relationships, human connectedness. 
Fulfillment of their needs may seem utterly impossible as the group stumbles 
in apparent disarray, finding no well-marked path. The leader knows that the 
frustration, fumbling, risk taking, and attendant fears and frustrations are 
essential to making it as a group. His sharing this knowledge is less likely to 
help than his patience and competence in assisting members to share the 
feelings they are experiencing in these early moments. In the sharing, mem- 
bers begin to form the relationships necessary for movement. 

After the early sessions members begin to get a feel for their own behavior 
and for the group as fluid but functioning, and to see growth potential. In this 
second phase they may become less concerned with group viability than with 
using the group. Setbacks or thrusts forward, however, may bring out varied 
feelings, such as: “We are not as good as we thought.” Or, “We are quite 
good, better than we thought.” If the group gets caught up with either under- 
valuing or overvaluing itself, its underlying myths might be variations of 
these statements: We are not making it. We were succeeding, but now we are 
failing. We are succeeding beyond expectations, but it cannot be trusted. The 
main dynamic underlying preoccupation with these group evaluations is avoid- 
ance and resistance to further, deeper involvement through risk-taking be- 
havior. A task for the therapist is to help members to discover, experience, 
and own their underlying feelings, whether insecurity, fear, or whatever. The 
wning their feelings breaks the risk-trust logjam 


process of discovering and o' eaks 
with which the group may be stuck. If the therapist is frustrated with the 


group’s lack of productivity, he may feel that his only recourse is to clobber 
members for their failure. This may help the therapist, and it may also help the 
group if the therapist will work, perhaps with member assistance, to experi- 
ence and own the feelings underlying his punitive actions. In such an instance, 
the therapist’s risk-taking behavior, not the group’s, may break the risk-trust 


impasse and start the group back to work. ; ; i 
Tn contrast with open groups, in closed groups with a predetermined time 
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for termination many members face difficulties accepting the loss of the 
experience of intimacy, with its joy and pain, as the group approaches its end. 
Anxiety over the impending separation often leads to fantasies in which the 
group continues in a happy cohesive state and to ingenious schemes for con- 
tinuing or reconvening the group on special occasions. The central dynamic 
of these myths is denial. In groups where members have experienced deep 
involvement and intensive encounters, the feeling at the end may be: This is 
the most significant experience of my life. Born out of the fear of loss and the 
need to make the experience last may be this myth: We have shared and been 
through so much together, we must and will always keep in touch with one 
another; we will meet again from time to time and recapture and enjoy what 
we have worked so hard to achieve here. 

Underneath a dominant myth denying termination of the group may be 
such individual beliefs as: “T must try to hold onto this experience, savor it, 
not let the feelings of closeness to myself and to others get away. If I try to 
share it outside the group, others may devalue it. No one outside the group 
could possibly understand it.” Beliefs that the experience is exclusive, can in 
no way be shared, or may be devalued by outsiders are in part true. They 
also betray fear that the relationships and mountain-top feelings will not last. 

While the members may focus on the viability of the group, on keeping it 
intact, the leader strives for the viability of the experience. He works for the 
generalization of the group processes outside and beyond the life of the group, 
to help members discover that they can find similar relationships and experi- 
ences in their ongoing lives. Just as outside experiences are brought into the 
group and made relevant for members to learn about themselves and one 
another, the processes and learning experiences within the group may be taken 
outside and applied in other settings with other people. Recreating authentic 
relationships is possible if members are willing to experience anew the painful 
as well as the pleasurable processes by which the group achieved its goals. The 
therapist can help members understand that significant outcomes are achievable 
through their own work, not through shamanism, as they struggle to test out 
belief systems along the way. 


C. Myrn-THeme VariaTIONS 


Experienced group therapists and encounter group leaders discover the com- 
monalities in group experiences and can predict the dominant themes and 
underlying myths within their groups. Idiosyncratic conditions peculiar to each 
group, however, may yield diverse and unique myths and determine how and 
when the more common ones appear. For example, group membership is some- 
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times homogeneous in ways that determine theme choice. Themes of depen- 
dency and hostility were covertly expressed in a group composed largely of 
unmarried, psychosexually immature men. The most pervasive theme in a group 
of first year seminarians was guilt. The cycle became one of confession, punish- 
ment, and forgiveness. 

Urgent concerns of some members may establish themes early that might 
otherwise never appear. The punitive mother was the motif of a group led 
by a female therapist. Two dominant and articulate female members with 
unresolved conflicts in maternal relationships became preoccupied with this 
theme, which tapped common myths of authority and punishment and, there- 
fore, captured the energies of the entire group. The myth, We must be very 
protective, pervaded another group and was traceable to early disclosures of 
serious personal problems by several members. Early exposure within the group 
seemed to foster the myth: We are fragile and must be treated gently. The 
overly protective group response created an atmosphere that prohibited the 
honesty and directness needed for work. A confrontation that showed the 
group that it also had strength and resiliency dissolved the myth of fragility. 


D. SUMMARY 


We have suggested that people can learn to function better if they experi- 
ence a process of testing and experiencing the consequences of their assumptions 
and beliefs. We have called member and group belief systems myths, whether 
they turn out to be valid or distorted at any given moment in the group. Dis- 
covering and examining the group’s mythology are means by which leaders 
and members can understand both individual and group behavior. 

We examined characteristics and uses of myths with respect to authority 
issues, intragroup relationships, communication among members, and group 
viability. Most dominant myths appear to be bipolar—e.g., we are completely 
unique, we are all alike; or the leader is omnipotent, he is impotent. The 
reality-testing process, therefore, involves working between the polarities. 
Groups and leaders who get hung up on either extreme become myth-bound 
and may flounder on irrelevant dynamics and topics. Still, intensive focus on 
myths that members and leaders hold onto tenaciously may be the most growth- 
producing process if a breakthrough can be achieved. The mythology of a group 
not static. What is mythical at one: moment may be reality the 
the leader and member perceptions are more in harmony 
Members and leaders learn to check their beliefs and 
for vital learning is in the moment-to-moment pro- 
d-for-all discovery. A never-ending task for the 


is dynamic, 
next in the sense that 
with their experiences. 
expectations continuously, 
cess rather than a once-an 
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leader is to assist members to learn the processes by which they test out myths 
and to apply what they learn to life beyond the group. 
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INTERPERSONAL ATTRACTION, POPULARITY, SIMILARITY 
OF PERSONAL NEEDS, AND PSYCHOLOGICAL AWARENESS* 


Towa State University 


Cuartzs A, Poz anp Davw H. Murs! 


A. INTRODUCTION 


The study of one person’s attraction for another has been a “late bloomer” 
in the psychological literature. As late as 1956 Newcomb stated that 


. .. there exists no very adequate theory of interpersonal attraction. .. . 
It is almost as if we, like our lay contemporaries, assumed that in this 
special area the psychological wind bloweth where it listeth and that the 
matter is altogether too ineffable, and almost passeth even psychological 
understanding (11, p. 575). 

Since Newcomb’s admonition, much research has been reported in the gen- 
eral area of interpersonal attraction. In a review of the literature between 1950 
and 1962, Lott and Lott (9) cited nearly 300 articles; Sherman (14) has 
cited 150 additional articles subsequent to 1962. It has been well documented 
that the attraction of two persons increases as the similarity of their attitudes 
increases (e.g., 1, 2). Less evidence has been accumulated supporting the 
relationship of interpersonal attraction and similarity of personality, partic- 
ularly when the latter had been measured by a personality instrument (eg., 
6, 7, 8, 13). One objective of the present study was to assess the relationship 
of interpersonal attraction and similarity of personality. 

Another objective of the present study was to assess the relationship between 
interpersonal attraction and awareness of others. It appears almost a foregone 
conclusion that to be attracted to another person the perceiver must know the 
other person to some extent. Maucorps’ (10) findings that attractive persons 
are more accurate in perceiving others would suggest a positive relationship 
between interpersonal attraction and awareness of others. However, little 
research is available bearing directly on the relationship of interpersonal 
attraction and awareness of others. Therefore, this relationship was assessed. 

It was expected that certain “person-oriented” needs which were measured 
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by the Edwards Personal Preference Schedule—EPPS (5), affiliation, nur- 
turance, and succorance, would facilitate the perception of interpersonal 
attraction. Since interpersonal attraction could not be directly related to 
specific needs, it was necessary to assess the distribution of scores that resulted 
from transforming the interpersonal attraction choices from sociometric data. 
From the association of needs and popularity, the relationship of needs to inter- 
personal attraction was to be inferred. 

Another relationship to be assessed was that between popularity and self- 
awareness. Self-awareness, presumably, could facilitate one’s effective inter- 
action with others and, thereby, influence the individual’s perceived popularity, 
The findings of Taft (16) that good judges of self have greater social orienta- 
tion provide some support for this hypothesized relationship between popular- 
ity and self-awareness, 

The primary objective of the Present study was to compare the relative 
association of interpersonal attraction to personal needs versus the interaction 
of personal needs. More specifically, the goal was to determine whether a 
subject’s interpersonal attraction choices were more related to the personal 
needs of a selected peer, or to the similarity of needs between the subject and 
the peer. This comparison of the relative association of interpersonal attraction 
to personal needs versus the similarity of personal needs has not been ex- 
plicitly made in previous research (e.g., 1), but was of primary interest in the 
present study. 


B. Metxuop 
1. Subjects 


The subjects were 67 undergraduate women at Towa State University 
affiliated with a campus sorority. The group included 50 “active” members 
who lived in the sorority house and who were Teasonably well-acquainted with 
each other, and 17 members of the “pledge class,” well-acquainted with each 
other but not well-acquainted with the “active” members, Because of these 
differences in aquaintance, the data from the two groups were collected and 
analyzed separately ; however, after ascertaining the similarity of the two 
roups and standardizing the relevant data, the data were combined. 


2. Procedure 


Ata first testing session, subjects selected two peers in the sorority whom 
they perceived at each of three levels of interpersonal attraction—“close,” 
oe S : 
“distant,” or “in-between” ; subsequently, the EPPS, an inventory of 15 


personal needs, was administered. At a second session the subjects estimated 
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the rank order of their own needs; their estimates were based on the 15 needs 
as they are defined in the EPPS manual. Also, the subjects estimated the needs 
of two of their peers whom they had selected at each of the three levels of 
interpersonal attraction. 


3. Measures 


In order to test the hypotheses of the present study the following variables 
were measured: interpersonal attraction, popularity, similarity of personal 
needs, awareness of others, self-awareness, and personal needs. As noted above, 
personal needs were assessed by the EPPS. The instrumentation of the present 
study required two measures of attraction to others. To differentiate these 
two measures the term interpersonal attraction was used to refer to “liking for 
others”—i.e., feeling close to another person—and the term popularity was 
used to refer to “liking by others,” or perceived closeness of a person in her 
peer group. As such, interpersonal attraction referred to individual choices, a 
subject’s choices of her peers according to her felt closeness to them. Popular- 
ity, on the other hand, reflected group choices, a subject’s perceived closeness 
felt by the entire group.” 

‘The measure of interpersonal attraction was assessed by requiring each sub- 
ject to select peers whom she judged at varying degrees of “closeness” to her- 
self, It was assumed that each subject’s selection of peers, on the basis of her 
felt closeness to them, willingness to confide in them, and desire to share inti- 
mate feelings with them, was an operationalization of interpersonal attraction. 
Presumably, those variables associated with the subject’s behavioral choices— 
ie., peers judged as close, distant, or in-between—would reflect upon and 
clarify the nature of interpersonal attraction. 

Popularity was assessed by a second transformation of the original judg- 
ments of interpersonal attraction: close, distant, and in-between. The results 
were a standardized popularity score for each subject, indicating each sub- 
ject’s perceived closeness by the entire group of peers. The popularity score 
was obtained by summing the frequency of each subject’s selection into one of 
the attraction categories by her peers and assigning numerical weights to each 
category (1 for “close,” —1 for “distant,” and zero for “in-between”). 
These weights were summed and normalized (z scores). 

Similarity of personal needs was assessed by correlating a subject's raw 
scores for the 15 EPPS needs with the same EPPS raw scores for a peer whom 


2 Since popularity scores were based on the same data as interpersonal attraction, 
the two variables were highly related. When an individual’s judgments of her peers 
in categories of interpersonal attraction were compared to the group’s judgment of 
popularity, 82 percent of the comparisons showed complete agreement. 
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she had selected as close, distant, or in-between. The resulting correlation coef- 
ficients estimated the similarity of needs of a subject and her peer. For each 
subject there resulted six coefficients indicating her similarity of needs to the 
two peers she had selected as close, the two selected as distant, and the two 
selected as in-between. These coefficients were rank-ordered in terms of their 
magnitude, and the ranks were compared across the interpersonal attraction 
categories (close, in-between, and distant) by means of a Friedman two-way 
analysis of variance; this comparison assessed the relationship of interpersonal 
attraction and similarity of personal needs. 

Awareness of others was assessed by correlating a subject’s estimate of the 
rank order of a peer’s 15 EPPS needs and the rank order of that peer’s EPPS 
needs that had been determined from the peer’s EPPS raw scores, For 
each subject there resulted six coefficients indicating her awareness of others; 
ie., her accuracy in estimating their needs. To assess the relationship of 
interpersonal attraction and awareness of others, the six coefficients that 
resulted for each subject were ranked in terms of magnitude, and the ranks 
were compared across the three levels of interpersonal attraction by means of 
a Friedman two-way analysis of variance. 

To assess the relationship of popularity and specific personal needs, the 
popularity scores were correlated with the 15 EPPS needs. To assess the 
relationship of popularity and self-awareness, these two variables were cor- 
related. 

To achieve the primary objective of the Present study, the method was 
largely inferential. The objective was to assess the relative association of inter- 
personal attraction to personal needs versus the interaction of personal needs, 
In other words, it was of interest to determine whether a subject’s inter- 
personal attraction choice of a peer was related more to the similarity of the 
subject’s and the peer’s personal needs or to the Pattern of needs manifested by 
the peer. Since personal needs were assessed psychometrically as traits, and 
interpersonal attraction was assessed sociometrically as individual choice be- 
havior, it was not possible to compare the continuously distributed needs 
directly to the trichotomous interpersonal attraction choices, Therefore, it 
Was necessary to transform the interpersonal attraction choices to continuously 
distributed data; there resulted a measure of the roup’s preference for each 
subject, a psychometrically measured popularity score for each subject that 
could be correlated with personal needs. Since Popularity scores were a trans- 
formation of interpersonal attraction choices, popularity and interpersonal 
attraction showed high correspondence.® It was reasonable to infer, therefore, 


3 See footnote 2. 
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that the relationship of interpersonal attraction to psychometrically measured 
needs could be little different from the relationship of popularity and needs. 
Then, after the preceding inference was made, the relationship of inter- 
personal attraction to personal needs could be logically compared to the 
relationship of interpersonal attraction and the similarity of personal needs, 


C. Resuts 
1. Interpersonal Attraction and Similarity of Needs 


Interpersonal attraction, the degree of perceived closeness of a peer by a 
subject, was related to the similarity of the subject’s need pattern to the need 
paterns of her selected peers in the three attraction categories, as may be seen 
in Figure 1. The degree of similarity of needs differed significantly across the 
three attraction categories [.05 level, Friedman two-way analysis of variance 
(15) ]. Subjects’ need patterns correlated more highly with the need patterns 
of peers whom they selected as “close” than they correlated with the need 
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FIGURE 1 
INTERPERSONAL ATTRACTION AND SIMILARITY OF NEEDS 
Note: The lower the numerical rank, the greater the similarity of needs. 
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patterns of peers whom they selected as “distant.” Thus, as hypothesized, 
there was a significant positive relationship between interpersonal attraction 
and similarity of personal needs; i.e., the more attraction was reported, the 
more similar subjects’ actual patterns were. 


2. Interpersonal Attraction and Awareness of Others 


Interpersonal attraction was also related to awareness of one’s peers; i.e., 
the accuracy in estimating their need patterns, as may be noted in Figure 2. 
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NTERPERSONAL ATTRACTION AND AWARENESS OF OTHERS 

. i 
Note: The lower the numerical rank, the greater the awareness of others, 


The degree of awareness of others differed significantly across the attraction 
categories [.001 level, Friedman two-way analysis of variance (15)]. Sub. 
jects estimated the need patterns of peers whom they selected as “close” more 
accurately then they estimated the need patterns of peers whom they selected 
as “distant.” Therefore, the data supported the hypothesized positive relati 

ship between interpersonal attraction and awareness of others. ia 
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3. Popularity and Personal Needs 


Popularity, reflecting a subject’s perceived closeness by the entire peer 
group, generally correlated lowly with the 15 EPPS needs, though two sig- 
nificant relationships were observed: with exhibition (r = —.20; p < .05) 
and automony (r = .22; p < .05). These correlations would account for 
minimal variance, and therefore, it was concluded that popularity was un- 
related to personal needs as measured by the EPPS. 


4. Popularity and Self-Awareness 


The hypothesized positive relationship between popularity and self-aware- 
ness was not supported. The correlation of popularity and self-awareness— 
ie., the accuracy of estimating one’s own needs—was not significant (r = 
.01). Though self-awareness was not related to popularity, it is of interest 
to note, as may be seen in Figure 2, that self-awareness did appear to be more 
accurate than awareness of others. 

As noted in section 1 above, interpersonal attraction was significantly 
related to similarity of personal needs. Also, popularity was only minimally 
related to personal needs. Since interpersonal attraction and popularity were 
highly correspondent, it was concluded that interpersonal attraction could be 
no more than limitedly related to personal needs. Therefore, since inter- 
personal attraction was found to be significantly related to similarity of needs, 
but was inferred to be only limitedly related to the magnitude of specific 
needs, it was concluded that interpersonal attraction was more related to an 
interaction effect, the similarity of needs. 


D. Discussion 
1. Interpersonal Attraction and Similarity of Needs 


As noted previously, the relation of interpersonal attraction and the sim- 
ilarity of individual characteristics has been documented rather well. Pri- 
marily, the evidence bears upon the relationship of similarity of attitudes and 
interpersonal attraction, and only secondarily upon the relationship of per- 
sonality traits and interpersonal attraction (e.g., 1). The findings of the 
present study are significant in that they document the relatively few studies 
(e.g., 6, 13) that have shown that similarity of personality traits, as measured 
by personality instruments, is related to interpersonal attraction. Izard (7) 
also showed that such personality similarity was an antecedent to inter- 
personal attraction choices, suggesting that personality similarity was related 
not only to the status of interpersonal attraction at a given point in time, 
but also that personality similarity may have been a significant influence in 
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the developmental process of interpersonal attraction. Byrne (2, 3, 4) in 
similar investigations has found interpersonal attraction related to similarity 
of attitudes. Both Izard and Byrne discussed their findings with Newcomb’s 
(11, 12) rationale that similarity of characteristics affords greater interper- 
sonal attraction by increasing the likelihood of reciprocation in mutually sat- 
isfying rewards, Byrne (2) further specified that similarity enhances inter- 
personal attraction because of a learned cultural drive to be logical, to make 
correct interpretations of the environment. Agreement from others, which 
similarity facilitates, is perceived as rewarding, and the person affording this 
reward is perceived positively. Izard (6, 7) extended Newcomb’s rationale by 
suggesting that similarity increases the probability of mutually satisfying ex- 
periences and, further, enhances perception and communication with the net 
effect being a positive affect, which in turn, contributes to the cohesiveness of 
the interacting pair. The results of the present study support the relationship 
of similarity and attraction, as suggested by Newcomb, but neither support 
nor disconfirm the extended rationales suggested by Byrne and Izard. 


2. Interpersonal Attraction and Awareness of Others 


Interpersonal attraction was significantly related to awareness of others’ 
needs, As noted previously, this relationship has received little investigation. 
Subjects more accurately predicted the need patterns of peers who were 
selected as “close” than the need patterns of peers selected as “distant.” A 
straightforward rationale for the relationship of interpersonal attraction and 
awareness of others is that as one is more attracted to certain peers, he will 
tend to interact more with them and thereby know them better than in- 
dividuals he is less involved with. However, it has been noted (12) that the 
relationship of interpersonal attraction and awareness of others might be 
confounded, since some evidence indicates that interpersonal attraction may 
influence perception so that a better-liked Person is perceived as more similar 
to the subject than he, in fact, is. In another view in this vein, Maucorps 
(10) observed that attractive persons were more accurate in perceiving others 
than unattractive persons. Either of these lines of research would indicate a 
complex relationship between interpersonal attraction and awareness of others. 
In order to assess appropriately the interrelationships of interpersonal attrac- 
tion, awareness, and similarity of personality, it is obvious that adequate ex- 
perimental controls must be utilized in future research, 


3. Popularity and Personal Needs 


It was concluded that popularity was unrelated to the EPPS needs. As 
noted above, this finding was of particular interest in conjunction with the 
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observed relationship of interpersonal attraction and similarity of needs. In 
sum, it was inferred from the data that the perceived attractiveness of a 
ratee was more related to the similarity of the two persons, rater and ratee, 
than to the specific needs of the ratee. 


4. Popularity and Self-Awareness 


Popularity was not related to self-awareness. A subject’s accuracy in esti- 
mating her own needs was not related to her popularity as perceived by her 
peer group. Though subjects generally predicted themselves more accurately 
than they predicted their peers, their ability to predict themselves accurately 
seemed to be unrelated to how popular they were perceived to be by their 
peers. As a result of the correspondence of interpersonal attraction and popu- 
larity, it was inferred that self-awareness could be no more than minimally 
related to interpersonal attraction. 


5. Interpersonal Attraction and Needs Versus Similarity of Needs 

The primary objective of the present study was to compare the relative 
association of interpersonal attraction and personal needs to the relationship 
of interpersonal attraction and similarity of personal needs. As previously 
noted, interpersonal attraction was found to be significantly related to simi- 
larity of personal needs. Also, it was concluded that popularity was unrelated 
to personal needs, Since interpersonal attraction and popularity were highly 
correspondent, it was inferred that interpersonal attraction could be no more 
than minimally related to personal needs. Therefore, it appeared that a sub- 
ject’s interpersonal attraction choice of a peer was more related to the sim- 
ilarity of needs of the subject and the peer than to the personal needs mani- 
fested by the peer. This inference would suggest that interpersonal attraction 
is primarily associated with an interactive social process and could be no more 
than secondarily associated with the effects of individual characteristics per se. 
Thus, the data of the present study would suggest that “the interpersonally 
attractive person” as a general condition specified by a unique pattern of per- 
sonal needs may not exist, since the general condition that was found to be 
associated with a subject’s interpersonal attraction choices was the similarity 
of needs of that subject and a peer. This greater relative importance of the 
interaction of personality characteristics, rather than the direct effects of per- 
sonality characteristics per se, has not been explicitly clear in previous research 
but is clearly suggested in the present study. 


E. SumMMaRY 


Sixty-seven undergraduate college females associated with a campus sorority 
participated in the present study. Measures were obtained for the following 
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variables: interpersonal attraction, popularity, personal needs, similarity of 
personal needs, awareness of others, and self-awareness. Selected relationships 
between these measures were assessed. Interpersonal attraction was signifi- 
cantly related to similarity of personal needs (6 < .05) ; a subject’s percep- 
tion of attractiveness of a peer was significantly related to the similarity of 
the personal needs between the subject and that peer. Interpersonal attraction 
was significantly related to awareness of others (6 < .05) ; a subject’s per- 
ception of attractiveness of a peer was significantly related to the accuracy 
with which the subject estimated that peer’s needs. The popularity of a sub- 
ject was not related to her personal needs or to her self-awareness. With 
respect to the primary objective of the present study, it was concluded that 
interpersonal attraction was more related to similarity of needs between a 
subject and a peer than to the personal needs of the perceived peer. There- 
fore, the interpersonally attractive person, as specified by a unique pattern of 
personal needs, may not exist, since it appears that interpersonal attraction 
is more related to an interactive social process, the similarity of needs. 
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KaTHLEEN Dewinc AND NorEEN KENNEALY 


A. INTRODUCTION 


When subjects are presented with paired stimuli, they experience more 
than their mere co-occurrence. If A and B are visual stimuli, they are per- 
ceived in some relation to each other; A may be underneath, next to, darker, 
or longer than B, and subsequent presentation of the stimulus A may redin- 
tegrate the idea of the whole stimulus compound R(A, B) rather than B 
alone. 

The effect of relations between component parts of the stimulus was ne- 
glected in traditional models of learning, but in recent years attempts to 
explore the problem have been initiated by Asch (1), by Horowitz and 
associates (4, 5, 6), and by Rozeboom (10, 11). 

Rozeboom (11) reported a study that was designed “to explore the prob- 
lem of redintegration through study of a related question . . . what happens 
when §’s cue for recalling part of an experience is not just another detached 
component ‘thereof, but that component embedded in the relational frame- 
work which previously united those elements?” (p. 623). Ss were trained 
on a set of stimulus displays consisting of rectangular frames with male and 
female names in each corner, and tested for recall with a test stimulus con- 
sisting of the test frame, one or two of the four components of the training 
display, and a dotted blank in one of the remaining corners. Patterns of 
errors were analyzed, and it was evident that Ss recalled stimulus components 
in the correct position but from incorrect display cards more frequently than 
could ‘be expected by chance. Rozeboom argued that errors of this ‘kind 
demonstrate that information about the position of the stimulus was learned 
in the training trials and utilized in the recall task. A similar position effect 
has been reported by Asch, Hay, and Diamond (2). 

As the particular relational framework in the Rozeboom experiment was 
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a rectangle and the components its four vertices, it seems that the memory 
task depended to some extent upon the structuring of perceptual space with 
use of horizontal and vertical coordinates. Piaget (7) has shown that this 
structuring increases with age. He says, “The perceptual space of the small 
child is less structured along horizontal and vertical coordinates than that of 
the adult . . . With increasing age, there is constantly a broader and more 
far reaching frame of reference as a function of the establishment of per- 
ceptual relations. This leads to an ever increasing qualitative contrast between 
horizontals and verticals” (p. 140-141). His recent memory studies (8) have 
shown that what Ss remember is influenced by their current perceptual and 
cognitive schemas. 

The progressive structuring of space as concepts of verticality and hori- 
zontality develop could influence the occurrence of the position errors in recall 
found by Rozeboom in young adults. The present study was planned to 
examine age-related differences in the pattern of these errors. Four age groups 
were compared on a task similar to that used by Rozeboom, with outline 
drawings of familiar objects substituted for the proper names used in his 
stimulus displays. 


B. Merxop 
1. Subjects 


' Four groups, each of 40 Ss, were used, as follows: 

Group 1: Kindergarten children, 15 males and 25 females; mean age 4 
years 7 months; range 3 years 10 months to 5 years 2 months. 

Group 2: Lower primary school children, 16 males and 24 females; mean 
age 7 years 7 months; range 6 years 6 months to 8 years 1 month. 

Group 3: Upper primary school children, 26 males and 14 females; mean 
age 12 years 10 months; range 11 years 8 months to 13 years 3 months. 

Group 4: First year university students, 19 males and 21 females; mean 
age 18 years 10 months; range 16 years 6 months to 38 years. 


2. Apparatus 


a. Stimulus materials. The stimulus materials for the training trials con- 
sisted of four white 6” < 6” sheets of paper, on each of which were drawn 
four objects in thick black ink. The stimulus objects, one in each corner 
of each page, are shown’in Figure 1, as follows: stimulus display 1—clock, 
cat, tree, chair; stimulus display 2—flat, yacht, key, hand; stimulus display 3 
—duck, gun, spade, cup; stimulus display 4—star, tap, brush, fish. The sheets 
for Groups 1 and 2, which were tested individually, were mounted on 6” X 


6” white cardboard. For group testing (Groups 3 and 4) they were assembled 
into booklets. 


ue 
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FIGURE 1 
Tue Four Stimutus DisPLays 


b. Test materials. The test materials consisted of four numbered 6” X 6” 
white pages, like the training booklets. The first page of each test booklet 
contained a portion of the stimulus display on page i of the training booklet 
together with a dotted blank in place of one of the missing parts (see 
Figure 2). The portion of the stimulus reproduced on the test page was 
always that from the upper left hand corner of the corresponding stimulus 
display (the fixed cue). 


3. Procedure 
a. Individual testing (Groups 1 and 2). There were six steps in the 
training test sequence. 
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FIGURE 2 
Test PAGES FROM TRAINING BOOKLETS SHOWING A Frxep Cuz and 
EACH OF THE THREE Positions TESTED 


1. Ss were shown each of the four stimulus display cards for eight 
seconds and told to learn and remember the names of the pictures on each 
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card. To control for stimulus specific or order effects the 24 possible orders 
were counterbalanced across Ss. 

2. Each S was presented with one four-page test booklet and required 
to name the picture that had been in the corner indicated by the dotted line. 
The four pages were exposed in turn until § had responded or until 12 
seconds had. elapsed, when an omission would be recorded. Each test booklet 
tested one position (top right, bottom left, or bottom right). The six possible 
orders of the three test positions occurred with equal frequency across all Ss. 

3. The four stimulus display cards were presented again as for Step 1, 
in the same order. 

4, Each. 8 was presented with the second test. booklet, 

5, Display cards presented again as for Steps 1 and 3. 

6. Each S was presented with the third test booklet. 

Instructions for individual testing were as follows: 


Now we are going to play a kind of game. I have here four cards 
(cards displayed facing away from §). On each of these cards there are 
four pictures. I shall show you the cards one at a time, and what 1 want 
you to do is to see if you can learn and remember the names of the 
pictures on each card. When you have looked at each of the cards, I 
shall then show you one of these booklets, In the booklet there will be a 
picture which was on one of the cards. As well as the picture there will 
be a dotted line in one of the other three corners. I want you to tell me 
what the picture was in the corner where the dotted line, is. There will 
be four pages like that, We shall look at the four cards and then fill in 
the booklet three times. You may guess if you don’t think you can re- 
member. Do you want to ask me any questions? 


b. Group testing (Groups 3 and 4). The instructions for the group testing 
were as follows: 


On your desks you will find four booklets, Do. not open them until I 
tell you. One of these is marked “Learning” and the others are marked 
“Test” In the booklet marked “Learning” there are four white pages 
numbered 1 to 4, On each of these pages there are four pictures. I want 
you to learn and remember the names of the pictures, You will be given 
eight seconds (Group 3) or six seconds (Group 4) to look at each page. 
When I say “Start,” you are to turn to page 1, When I say “Turn over” 
you are to turn to page 2, and so on until you have looked at and tried 
to learn each of the four pages. I will then ask you to put Test Booklet 
No. 1 in front of you, but do not open it In. this booklet you, will also; 
find four white numbered pages, but on each of these pages there will 
only be one picture. It will always be in the upper left hand corner and 
will be a picture which appeared in the upper left hand corner on one 
of the pages of your “Learning” booklet. On each page you will also find 
a dotted line. I want you to write on the dotted line the name of the 
picture which appeared in that corner with that picture in your “Learn- 
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ing” booklet. You will be given 48 seconds to fill in the four pages, and 

I shall not say “Turn over” for each page as I did with the “Learning” 

booklet. Please fill in the pages one after another and guess if you cannot 

remember. We shall do this, that is, look at and learn the “Learning” 

booklet, and then fill in one test booklet, three times. Are there any 

questions? 
Ss were then instructed to open their “Learning” booklets and to turn 
over the pages at eight-second intervals (Group 3) or six-second intervals 
(Group 4). They were then told to close their “Learning” booklets and place 
Test Booklets No. 1 in front of them, but not to open them. They were 
reminded that they would be given 48 seconds to fill in the four pages, with- 
out being told to turn over the pages, and were asked to fill in the pages one 
after another, to work as quickly as possible, and to guess if they could not 
remember. This procedure was repeated three times. 


C. Resutts 


Eight different types of responses could be made. The number of different 
items that qualified as an instance of each error type is shown in parentheses 
following each response category: A—correct recall (one instance possible) ; 
B—item from same display, other than fixed cue (two instances) ; C—item 
from same position in other display (three instances) ; D—item of different 
position, other than fixed cue, from other display (six instances) ; E—fixed 
cue from other display (three instances) ; F—repetition of fixed cue from 
same display (one instance); G—outside intrusion (i.e., item not on any 
display) ; H—omission, The total number of responses in each category for 
the different age groups is shown in Table 1. 


TABLE 1 
ToTaL NuMBER OF RESPONsES IN EACH ResPoNsE CATEGORY FoR EACH AcE Group 
Error type 

Age groups A B Cc D E F G H 
EG 

Kindergarten 44 36 58 73 67 1 
5 62 
Grade 2 70 59 66 59 26 _ ie 186 
Grade 7 270 28 58 28 13 15 15 53 
Adult 250 34 46 35 17 _ 9 89 
Total 634 157 228 195 123 30 63 490 
i 8 


Although an attempt was made to equate task difficulty by varying the 
learning trial presentation times, it is clear that the task was much less diffi- 
cult for the older groups. On many trials the younger children did not 
respond at all and were therefore scored as making Type H errors (omis- 


tw 


KATHLEEN DEWING AND NOREEN KENNEALY 157 


sions). However there is still a considerable difference between age groups 
even when correct responses are considered as a proportion of the total num- 
ber of actual responses made, excluding omissions. There are, of course, 
interpretation problems when number of correct responses is used as a per- ' 
formance measure in studies using very young children. In this study, poor 
performance may be due to other factors besides memory ability per se because 
of lack of experience in formal testing situations—such factors as poor motiva- 
tion, distractibility, difficulties in following instructions, shyness, etc. 

Because the task was not of comparable difficulty for the different age 
groups, in the analysis of results errors classified as omissions, intrusions from 
outside the stimulus displays, and simple repetition of the fixed cue were not 
included, and only errors representing the retrieval of an item from the set 
of the 16 stimulus objects shown on the display cards were considered. The 
pattern of errors made by the Ss on the occasions when they recalled (in- 
correctly) one member of this set was examined in the total group and in the 
different age groups. 


1. Types of Error 


The number of possible responses that qualified as an instance of each 
error type was not constant. Comparisons, therefore, were made between 
raw error frequencies divided by the number of different response items that 
counted for that entry, yielding the corresponding “per unit” error frequency 
utilized by Rozeboom (11). 

‘As in the Rozeboom study, Ss were much more likely to make correct 
position errors; i.e., intrusions from the same position on another display than 
intrusions from a different position. 

The “per unit” error frequency was significantly greater for Type C errors 
than for Types D and E combined (¢ = 5.99, p < .001). 

Ss were also likely to make more intrusion errors from the wrong position 
on the same display card than from the wrong position on other display cards. 
The “per unit” error frequency was significantly greater for Type B errors 
than for Types D and E combined (¢ = 5.01, p < .001). 


2. Age Differences in Position Errors 


The number of Ss who made more errors involving items from the position 
marked by the test blank in other displays than items from a different position 
was calculated for each age group. These frequencies together with the num- 
ber of Ss making fewer correct position than other position errors are shown 
in Table 2. 
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TABLE 2 
Error TYPes AND AGE GROUPS 
Number of Ss making more 


Type C errors than, or as many Number of Ss making fewer C 
Age groups C errors as, B, D, and E errors than B, D, and E errors 


Kindergarten 9 31 
Grade 2 15 25 
Grade 7 25 15 
Adult 21 19 


The tendency to make position errors was not significantly greater for the 
younger primary children than for the kindergarten group (x? = 1.488), nor 
was there a significant difference between the older primary and young adult 
groups (x* = .818). However there was a highly significant increase in the 
proportion of position errors made by the two older groups combined over 
that made by the two younger groups combined (x? = 10.18, p < .001). 


3. Age Differences in Same Card Intrusion Errors 


Comparisons were also made between the four age groups on the number 
of same card intrusion errors compared with other errors, Although there was 
a tendency for the frequency of this type of error also to increase with age, 
none of the differences between groups was statistically significant. 


D. Discussion 


Rozeboom (11) demonstrated that information about the compositional 
structure of the stimulus is utilized in recall. This finding’ has been confirmed 
in the present study. In a task similar to the one described by Rozeboom, the 
most frequent error was an item from the correct position but from the wrong 
stimulus display. The results point clearly to a relative increase of this type 
of error with age. There is a very sharp increase between the ages of seven 
and 12. The fact that same card intrusion errors did not show a similar in- 
crease with age indicates that the increase in spatial intrusion errors is not a 
characteristic of all error types. 

It is of interest to note that Elkind (3), ina study of children’s concep- 
tions of right and left, found that it was not until the age of 10 or 11 that 
abstract relational conceptions were established, Right and left for this age 
group were abstract in the sense that they were dissociated from any par- 
ticular content; for younger children right and left were not relative’ positions 
but absolute properties of things. 

The results of the present study are consistent with Piaget’s general. the- 
oretical position that recall is a function of the S’s operational level. Piagetian 
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* research has demonstrated interaction between performance on perceptual, 


: 


by 


te 


memory, and problem solving tasks and the cognitive level of the individual, 
with continuing reorganization of existing cognitive structures to accommodate 
new experience. In particular, recent studies of memory (8): illustrate the 
effect on recall of the acquisition of the principles of conservation and seria- 
tion. It is suggested that the patterns of spatial errors revealed by the dif- 
ferent age groups in the present study may reflect the development of a 
Euclidean type of reference system for structuring space, as the child masters 
spatial concepts like right-left and vertical-horizontal. Although the age when 
this type of reference system is established is not clear-cut (9), there does 
appear to be some congruence in age level between the mastery of abstract 
spatial concepts and the sharp increase in spatial errors found in the present 
study. 


E. SumMMaRY 


Four groups of 40 Ss, aged 4-5 years, 6-7 years, 12-13 years, and young 
adults, were trained on structured displays and tested on fragments of these 
structures. Patterns of errors were analyzed, and a significant tendency was 
observed to recall stimulus components in the correct position but from the 
wrong stimulus display, as reported by Rozeboom (11). The relative fre- 
quency of spatial intrusion errors of this type increased with age. The results 
were interpreted as consistent with Piaget’s theoretical position that recall 
is influenced by the cognitive level of the subject. It was suggested that pro- 
gressive structuring of space is reflected in the differential spatial errors of the 


four age groups. 
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REPRESSION-SENSITIZATION AND DISCREPANCY BETWEEN 
SELF-REPORT AND OFFICIAL REPORT OF ILLNESS** 


Department of Psychology, Saginaw Valley College 


ExizaBetH A, WILLETT AND MarysetTH HEILBRONN 


A. INTRODUCTION 


The present study attempted to assess the relationship between self and 
official reports of illness and the characteristic way an individual reacts to 
stress. The Repression-Sensitization (R-S) scale (3, 4, 5) has been used to 
differentiate between two major defensive styles. Studies relating the R-S 
scale to measures of adjustment suggest that repressors are better adjusted than 
sensitizers (3, 5, 6, 11). Difficulties inherent in the design of such studies, 
however, raise at least two important issues. First, as pointed out by Byrne 
et al. (6), item overlap between the R-S scale and measures of adjustment 
used in many of the investigations could account for the findings, Another 
serious limitation of these R-S adjustment studies is that repressors, disposed 
as they are to the use of avoidance, may deny they have problems and thus 
appear well-adjusted on a self-report measure. As Eriksen and Pierce (8) 
suggested: “What is needed is evidence to show that the repressors’ self- 
descriptions are at variance with the actual state of affairs” (p. 1035). Such 
evidence has recently been provided by Parsons, Fulgenzi, and Edelberg (14) 
and by Weinstein, Averill, Opton, and Lazarus (19). 

Parsons et al. (14) using a group discussion approach found that “sensi- 
tizers are less aggressive and not as affectively aroused at least as measured by 
skin conductance response. On the other hand, they rate themselves higher 
than repressors on aggressiveness and hostility but are not so rated by others” 
(p. 242). Evidence that both repressors’ and sensitizers’ self-reports are at 
variance with physiological indices of stress is provided by Weinstein and his 
associates (19). Their reanalyses of six experiments indicated that “ree 
pressors showed relatively greater autonomic than self-report reactions to 
stress, while sensitizers tended to show the opposite pattern of response” 
(p. 406). 

Considering illness as a reaction to stress (15, 20), the present study 
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attempted to ascertain if a somewhat similar interactional pattern exists be- 
tween R-S and self-official reports of illness. Do repressors report fewer in- 
cidents of illness than is actually the case? Repressors generally deny their 
anxiety or hostility, but reveal it autonomically, while sensitizers aggrandize 
their anxiety or hostility, but show less autonomic reactivity (14, 19). In 
view of the psychophysiological principle (11) and the role of prolonged 
autonomic reactivity in the production of bodily stress which can subsequently 
produce illness (15), the present study predicted greater incidence of illness 
as reflected in medical referrals for the repressors than for the sensitizers. 


Also, in consideration of the possible cruciality of discrepancy between self- 


reports and objectively determined reports of repressors’ and sensitizers’ reac- 
tions to stress, the following specific predictions were made. 

1, Official records will show a significant number of referrals for te- 
pressors, but not a significant number for sensitizers, 

2. Fewer repressors will report medical visits than will be revealed in 
official records. 

Because the sensitizer has been described as being alerted to potential threat 
and conflict, responding readily with intellectual and obsessive defenses (14), 
additional predictions were made. 

3. There will be little or no difference between the number of sensitizers 
reporting medical visits and that revealed by official reports. 

4, More sensitizers than repressors will report medical visits. 


B.. Merxop 

The revised R-S scale (5) was group administered to 80 first-year female 
students in a ‘hospital school of nursing. Instructions were read by the senior 
experimenter. Subjects whose scores fell in the top quartile were designated as 
sensitizers; those in the lowest quartile, repressors. The mean R-S scores for 
the sensitizers and repressors were 115.10 and 71.50, respectively. A self- 
Teport personal information form was used to determine for a 16-week period 
(1/1/71 to 4/23/71) the number of times medical attention was sought iout- 
side the school and the nature of each illness—i.e., the reason for each visit. 
F rom the school’s health services was obtained (for the same 16-week period) 
information pertaining to the number of times each subject was referred for 
medical treatment. The nature of each illness was also recorded from official 
sources, 


C. Resutts anp Discussion 
y As predicted, official records revealed that more repressors, 15, than ‘sen- 
sitizers, 10, were referred for medical treatment, When these data were sub- 
jected to binomial tests, it was found that the number of sensitizers referred 
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for medical treatment could have occurred by chance alone (X = 10; N = 20; 
p = 1.00), whereas the number of repressors referred was significant (X = 5; 
N = 20; p < .05), thereby supporting Prediction 1. The fact that official 
records revealed a significant number of referrals for the repressors may be 
indicative of a substantial relationship between repression and the seeking of 
medical services, Such a finding is consistent with the traditional psychosomatic 
position on the role of repression in psychophysiologic disorders. The number 
and type of referrals made for repressors and sensitizers are presented in ‘Table 


TABLE 1 
NUMBER AND Type OF MEDICAL REFERRALS FOR 
REPRESSORS AND SENSITIZERS 

Referral to Repressors Sensitizers 
School physician 8 [6] 8 [5 
Emergency 1 [1] 3. US 
Dermatologist 4 [3] 0 
Allergist 4 {aj 0 
Podiatrist 3 [1] 0 
Gynaecologist 1 [1] 0 
ENT specialist 1 [1] 0 
Immunologist 1 1) 0 
TB tests 0 2 [2] 

Total 23 [15] 13 [10] 


Note: The number of subjects contributing to the total number of specific referrals 
ey appears in brackets beside each type of referral. 


1. Inspection of the table indicates that eight visits wee made by subjects 
from each group to the school physician. With the exception of three sensitizers 
4vho went to emergency and two for tuberculin tests all other referrals were 
accounted for by repressors. Excluding the two repressors who were referred 
to emergency and immunology, all other referrals were to specialists. 

In view of the postulated reciprocal relationship between psychological and 
physiological functioning (10), it seemed reasonable to assume that such a 
relationship could also obtain when physiological malfunctioning is under 
scrutiny. Within such a theoretical framework the higher incidence of illness 
demonstrated by the repressors can be viewed as a reflection of greater psycho- 
physiologic malfunctioning which was a reciprocation of the repression effects, 
Further inspection of Table 1 not only shows that more repressors than sensitiz- 
ers were referred for medical treatment, but also reflects a greater number of 
referrals for the repressors than sensitizers, 23:13. The fact that almost twice 
as many referrals were made for repressors than sensitizers lends further sup- 
port to the confirmation of Prediction 1 wherein a relationship between re- 


pression and medical referral was hypothesized. sl 
The second and third predictions were that there would be a significant 
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discrepancy between self and official reports of illness for the repressors, but 
not for the sensitizers. The number of repressors who sought medical atten- 
tion as self-reported and as revealed by official medical records were four and 
15, respectively. The sensitizers reported seyen visits, but official reports re- 
vealed 10, When subjected to chi-square tests, the discrepancy between self and 
official reports of the repressors was found to be significant (2 = 10.30; df = 
1; < .01); that of the sensitizers was not significant (x? = .91). There- 
fore, Predictions 2 and 3 were confirmed indicating that repressors did not 
accurately report medical visits, whereas the sensitizers did. This finding could 
be interpreted either as a failure on the part of the repressors to recall 
accurately the number of times they sought medical treatment, or the reflec- 
tion of the repressors’ view of the medical self-report as threatening, dis- 
posing them to avoidance, denial, and repression. Failure on the part of 
repressors to report the actual state of affairs is consistent with the findings of 
Parsons (14) and Weinstein (19) and their associates. Their results, as well 
as those of the present study, which pertain to repression phenomena, can be 
viewed as instances of avoidance training. As such, the significant discrepancy 
between self and official reports of illness for the repressors in the present 
study is consistent with the results of a recent investigation by Axtell and 
Cole (1). When their subjects were asked to discuss themselves, Axtell and 
Cole found that repressors, in contrast to sensitizers and neutrals, talked 
significantly less. In the present investigation the repressors reported less than 
was actually the case, whereas the sensitizers showed no significant discrepancy 
between self and official reports of illness. 

As predicted, more sensitizers, seven, than repressors, four, reported that 
they had sought medical services. When these data were subjected to chi-square 
analyses, however, no significant relationship was established (x? = .82)- 
Although these results failed to confirm Prediction 4, they were in the pre- 
dicted direction with almost twice as many sensitizers as repressors having 
reported medical visits. Such a finding, although not significant, is compatible 
with an approach-avoidance view of repression-sensitization, as well as with 
the more traditional view that repressors are disposed to denial as a general 
trait (12) and are better adjusted than are sensitizers (3, 5, 6, 11). If in- 
cidence of illness can be considered a measure of adjustment, then it would 
appear that the higher incidence of illness as self-reported by the sensitizers 
provides some support for the earlier findings wherein it was established that 
on self-report measures of adjustment, sensitizers look psychologically less 
healthy than repressors (3, 5, 6, 11). On the other hand, when the official 
reports of 23 referrals for 15 repressors and 13 referrals for 10 sensitizers are 
considered, the earlier view that repressors are better adjusted is questioned. 
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If absence of illness is an appropriate indicant of adequate adjustment, then it 
would appear that sensitizers rather than repressors are better adjusted. As 
Thelen (17) suggested, “Perhaps the R-S scale measures ‘adjustment’ as well 
as the tendency to approach or avoid stress” (p. 164). 

The possibility that sensitizers are better adjusted notwithstanding, there is 
an extraneous variable, social desirability, which may have contributed to the 
fewer reports made by repressors of medical services sought. With one ex- 
ception (7), measures of social desirability have been found to be consistently 
negatively correlated with the R-S scale (2, 9, 11, 13, 16). The possible con- 
founding effects of social desirability in the present study cannot be overlooked 
in view of the implicit negative sanctions that health service personnel placed 
on frequent requests for medical services. It is possible that repressors, who 
generally rank high on social desirability scales, delimited their reports of 
medical attention sought. Clarification of the role of social desirability in the 
reporting of illness by repressors and sensitizers awaits further research. 


D. Summary 


Self-reports of medical visits and official reports of medical referrals for 20 
repressors and 20 sensitizers were analyzed to test the following predictions: 

1. Official records will show a significant number of referrals for repressors, 
but not a significant number for sensitizers. 

2. Fewer repressors will report medical visits than will be revealed in 
official records, 

3. There will be little or no difference between the number of sensitizers 
reporting medical visits and that revealed by official reports. 

4. More sensitizers than repressors will report medical visits, 

The results provided confirmation of Predictions 1, 2, and 3. Prediction 4 
was not supported, although the results were in the predicted direction with 
almost twice as many sensitizers as repressors having reported medical visits. 
The higher incidence of officially established illness for the repressors was 
discussed in terms of the psychophysiological principle. Discrepancy between 
self and official reports of illness made by the repressors and accurate reporting 
by the sensitizers were viewed as instances of approach-avoidance phenomena. 
The repressors’ self-official discrepancy was also considered from a general 
defensive trait point of view. 

An important implication of the results as they pertained to Prediction 4 
is that sensitizers, rather than repressors, appeared better adjusted when 
absence of illness was used as an indicant of adjustment. Earlier reports (3, 
5, 6, 11) of sensitizers looking psychologically less healthy were cast in a new 
light. 
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PREDICTIVE VALIDITY OF “BONUS” SCORING FOR 
COMBINATIONS ON REPEATED FIGURES TESTS 
OF CREATIVE THINKING* 


Department of Educational Psychology, The University of Georgia 
E. Paut Torrance 


A. InTRoDUCTION 


One of the more effective nonverbal tasks of the Torrance Tests of Crea- 
tive Thinking (2) has been the repeated figures task. One form of the test 
confronts the subject with two pages of circles, and alternate forms use 
parallel lines, squares, and triangles. The subject is asked to make as many 
objects or pictures as he can from these repeated figures and to think of 
objects or pictures that no one else will think of. Nowhere in the instructions 
is he forbidden to combine two or more figures to make an object or picture; 
neither is he instructed to do so. In fact, the mental set produced is against 
making such combinations. Thus, to make combinatory responses, a subject 
has to be free to go beyond the expectations generated by the instructions and 
to exploit other permissible approaches. Even during the early stages of the 
development of this task in its alternate forms, the author and his associates 
recognized such behavior as characteristic of creative persons. Yet we were 
uncertain about how to represent the behavior numerically in the form of a 
score. 

For a while the problem was evaded, since relatively few subjects combined 
two or more of the circles, squares, triangles, or parallel lines in this test task. 
As researchers began using this test task in experiments designed to) increase 
skills in creative thinking, this combinatory or synthesizing kind of behavior 
became more: frequent: A critical point was reached when a first grade 
teacher presented the author with the record of a six-year-old boy who had 
combined the circles on the first page to make an excellent Army tank and 
had used all of the circles on the second page to make a honeycomb with a 
beautiful and whimsical bee buzzing above it. She asked, ‘How do you score 
this?” The author’s immediate impulse was to. say, “Who needs to score it? 
Can’t you tell: that this boy has a powerful creative mind?” This did. not, 
however, solve the researcher’s problem of providing objective, yet meaning- 
ful, quantification. 
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During the early stages of the development of the test, it was difficult to 
obtain really meaningful validity criteria. All of the observations of the 
author and his associates told them that the children and young people who 
made combinatory or synthesizing responses behaved more creatively than 
their peers. On the basis of this conviction, the scoring manuals were revised 
in 1970 (3) to provide a system of bonus scoring for combinations of two 
or more figures. Two bonus points were awarded for combining two figures 
(circles and parallel lines) ; 5, for three to five figures; 10, for six to 10 
figures; 15, for 11 to 15 figures; and 20, for more than 15 figures. It has 
not been possible to validate this procedure on the basis of meaningful, long- 
range predictive validity data until the present time. The author will present 
in this paper the data for evaluating this scoring procedure. 


B. Metrnop 


The author initiated a major predictive validity study of the Torrance 
Tests of Creative Thinking in September, 1959, by administering a battery 
of seven of the test tasks to the total enrollment of the University (Min- 
nesota) High School (grades 7-12). Included in this battery was the Circles 
form of the repeated figures task. The specific instructions for the task were 
as follows: 


In 10 minutes see how many objects you can sketch which have a 
circle as the main element in their design. Just use a few lines on the 
circles below to identify your ideas which might start: wheel, tire, 
steering wheel, etc. 


A 12-year follow-up of the subjects was conducted in 1971, and completed 
questionnaires were obtained from 250 of the original 392 subjects. The 
questionnaire obtained information concerning a variety of achievements in- 
cluding post-high school creative achievements and activities by means of 2 
checklist, descriptions of three most creative post-high school achievements, 
and a statement of future aspirations. Generally, the subjects supplied rather 
complete data. 

A measure of Quantity of Creative Achievement was obtained by adding 
the number of creative achievements checked and/or listed by each subject. 
‘These included unpublished poems, stories, songs; published poems, stories, 
songs; unpublished book manuscripts; books published; radio and television 
scripts performed ; original research designs developed; awards for leadership; 
research grants received; scientific papers published; scientific papers pre- 
sented at professional meetings; business enterprises initiated; patentable de- 
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vices invented; awards received in music, visual art, creative writing, etc.; 
films produced ; instructional materials created ; etc. 

A measure of Quality of Creative Achievement was derived from the 
ratings of five expert judges (mature researchers and doctoral candidates with 
concentrations on creative studies) of the descriptions of the three most crea- 
tive achievements reported by each subject. The judges were instructed to 
rate these data on a 10-point scale in terms of the level of creativeness re- 
flected by the achievements described, 

A measure of Creativeness of Aspirations was based on a similar set of 
ratings by the five expert judges of the statements each subject made con- 
cerning what he would most like to do in the future, assuming the necessary 
talent, training, and opportunity. The mean reliability coefficient of the five 
judges for these two sets of ratings was .91. 

Scoring according to established procedures was executed shortly after the 
testing in 1959, However, there was no scoring of combinatory responses. 
Records of 144 of the subjects (52 twelfth graders, 28 eleventh graders, 32 
ninth graders, and 32 seventh graders) returning questionnaires were avail- 
able in 1972 for rescoring for this purpose. Two scores were obtained. The 
first score is the number of different combinations or syntheses made. The 
second is the total number of circles used in these syntheses. For example, one 
subject made the following combinatory responses: four circles in a railway 
train engine; two circles in a pair of glasses; two circles in a toy wagon; four 
circles in a stop light (one each for stop, caution, and go, and another for 
the mechanism at the bottom); three circles in a cluster of cherries. His 
record was scored 5 for number of syntheses and 15 for total number of 
circles used. To facilitate correlation analysis and eliminate zero scores, the 
constant 1 was added to all scores. 


C. Resutts 


Since the coefficients of correlation between grade level and the criteria are 
insignificant (.09 for Quantity of Creative Achievement, —.03 for Quality 
of Highest Creative Achievements, and —.02 for Creativeness of Aspira- 
tions), the data across grade levels were pooled. 

The product-moment coefficients of correlation between the number of 
combinations or syntheses on the circles task and the criterion scores are as 
follows: Number of Creative Achievements, .54; Quality of Creative Achieve- 
ments, .47; Creativeness of Aspirations, .42. For 143 cases, all three coeffi- 
cients of correlation are significant at better than the .01 level of confidence. 

The product-moment coefficients of correlation for number of circles used 
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in the syntheses with the criterion variables are as follows: Number of Crea- 
tive Achievements, .50; Quality of Creative Achievements, .45; Creativeness 
of Aspirations, .39. Again, all of these coefficients of correlation are signifi- 
cant at better than the .01 level of confidence. 


D. Discussion 


Although the relationships between the predictor variables (number of 
syntheses and number of circles used in the syntheses) and the criteria (Quan- 
tity of Creative Achievements, Quality of Creative Achievements, and Crea- 
tiveness of Aspirations) are impressive and consonant with expectations, they 
must be interpreted with caution. 

The subjects of the study came from homes that could afford to give them 
opportunities for attaining their potentialities. It is not likely that as favor- 
able results would have been obtained in a disadvantaged population. It would 
be interesting, however, to determine whether disadvantaged young people 
who make syntheses on this test task are also able to see their way out of the 
disadvantaged circumstances in which they were reared. Such a tendency would 
be predicted. 

It must also be recognized that syntheses on ‘the circles task may mean 
something different from syntheses on the parallel lines, squares, or triangles 
test. However, one would expect similar results. 

The context within which the subjects of this study lived and were tested 
may also have influenced the results. A higher frequency of combinations on 
the repeated figures task is observed on posttests than on pretests in studies 
where deliberate efforts have been made to facilitate creative thinking. 

One may also argue that different results may have been obtained with 
the full sample of 250 subjects. This is not likely, however, since the original 
scoring for such other variables as fluency, flexibility, originality, elaboration, 
and the like did not yield different results for the 144 subjects in this study 
and for the 250 subjects in the full sample. 

The theoretical support for the findings is quite strong. A large number of 
the studies ef creative persons (1) have emphasized the fact that a key char- 
acteristic of such persons has been their ability to see beyond where others 
have seen, to get away from the obvious and commonplace, to test their limits 
and those of the situation in which they function. Certainly the empirical 
support provided by this study and the theoretical support derived from a 
large number of empirical studies are sufficient to justify present practices 
regarding bonus scoring for combinations on the repeated figures test task. 
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E. Summary 


It was hypothesized that scoring for number of combinations of circles and 
number of circles used in combinations on the repeated figures task of the 
Torrance Tests of Creative Thinking is valid for predicting creative achieve- 
ment. The subjects were 144 young adults (aged 25 to 31) tested 12 years 
prior to their follow-up in 1971. 

Original test records were scored for number of syntheses and number of 
circles used in these syntheses on the circles test. Indexes were developed for 
number of post-high school creative achievements, quality of peak creative 
achievements, and creativeness of future aspirations, 

Product-moment correlations between the predictor and criterion variables 
ranged from .39 to .54 and averaged .45, all significant at the .01 level. 
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MARIJUANA: ITS MEANING TO A COLLEGE POPULATION* 


University of Detroit 


M. LaVERNE LaDrigre AND THomas R, SzczEPKOWSKI 


A. History 


As a result of the burgeoning use of drugs among American high school and 
college aged-young people, recent literature has been filled with studies investi- 
gating the motivation for drug use and the meaning of drugs to these young 
people, Some (4, 5) attributed the appeal of drugs to a need pattern involving 
desire for independence, thrill and adventure seeking, and rebellion. Tension 
release, escape from frustration, psychedelic and mind expanding experiences, 
and the search for a better world were other motives commonly related to 
drug use (1, 2, 5). Curiosity and the need to be “in” socially and to “go 
along with the group” were recurrent themes when the dynamics of drug use 
among young persons was considered, (2, 4). Keeler (2) and Scher (8) em- 
phasized the achievement of the “high” as the fundamental motive in continued 
drug use, The cumulative effect of such studies made very apparent the com- 
plexity and multiplicity of the dynamics underlying drug use. 

Understanding the motivation behind drug use was further complicated by 
the fact that there seemed to be no single “drug personality” (3). The user 
population was not a single homogenous group, but rather consisted of many 
very different subgroups tentatively defined across such variables as age, sex, 
race, educational level, and even degree of psychiatric deviation and pathology. 
Scher (8) stressed the direct relationship between different user subgroups 
and specific drug motivation and drug choice. Consideration of the dynamics 
within carefully delineated samples representing these subgroups within the 
user population appeared to be the logical next step in attempting to clarify 
the motivational complex involved in drug use. Finally, Popoft (1) em- 
phasized the need to establish a kind of “baseline” or “frame of reference” 
against which to evaluate user dynamics, by pointing out the meaningfulness 
of comparisons between attitudes of users with those of appropriate nonuser 
samples. 

The present study represented an attempt to follow these guidelines. It 
proposed to study the implications and motivation underlying marijuana use 
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with similar material derived from a related nonuser sample drawn from the 
same college population. 


B. Merxop 
1. Selection of Marijuana as Focal Drug 


Marijuana was selected as the pivotal drug concept to be studied for the 
practical reason that, as a result of its widespread popularity and availability, 
securing subjects for the user population was greatly simplified. Also, this 
determination was considered fortunate, since examination of the literature 
suggested that marijuana possessed the power to represent most of the dynamic 
kaleidoscope involved in drug use motivation from production of the “high,” 
through risk and thrill seeking and rebellion, to social conformity. Therefore, 
it was felt that the data would not only be specific to marijuana itself but 
would have implications for motivation basic to use of other drugs. 


2. Sample 


The variables of chronological age, sex, and educational level constituted 
the major sampling restrictions. One hundred subjects were drawn from a 
university student population. All subjects were between 18 and 22 years of 
age; 50 were male, 50 female. Of these, 50 (25 male and 25 female) were 
marjiuana users, and 50 (again 25 male and 25 female) were nonusers. The 
fact that all subjects were active students in good standing at the university 
was taken as an operational indication of minimal personality deviation. Selec- 
tion of individual subjects was randomized within the sampling restrictions 
imposed by the selection variables. 


3. Testing Techniques 


All subjects completed both a 12 dimension semantic differential rating scale 
on the meaning of the concept of marijuana and a questionnaire concerned 
with the motivation involved in drug use and drug nonuse. 

a. Semantic differential. The semantic differential provided the basic com- 
parative data on the meaning of marijuana to each of the four student groups 
tested. This task consisted in locating the concept marijuana on a seven point 
scale across the 12 dimensions, or sets of polar adjectives, in terms of which 
the concept was to be defined.1 The scale was so constructed that positions 1 


1 It should be noted that nine of the 12 dimensions were selected not only for 
applicability to the concept of marijuana but also in such a way that sets of three 
dimensions represented heavy loadings on each of Osgood’s three major factors (6). 
It was originally intended to combine these dimensional sets’ into factors and analyze 
the data at that level. Subsequent factor analysis performed on the data supported the 
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and 7 represented extreme values of each adjective pair, and position 4 con- 
stituted the neutral or indifferent point for each dimension. The 12 dimensions 
and polar values have been indicated in the footnote to Table 1. 

b. The questionnaire. Questionnaire data were collected to supplement 
semantic differential data in two ways. First, many of the questions served to 
define the subject groups operationally in terms of degree, frequency, and 
recency of drug use. Secondly, it was hoped that some of the questions would 
clarify the motivational dynamics governing the use or nonuse of the drug. 
Therefore, two multiple-choice questionnaires were constructed, one for users 
and one for nonusers. The users were queried on frequency and recency of 
use, use of other drugs, and reasons for use of marijuana. The nonusers were 
asked about reasons for abstinence, possibility of “trying” drugs in the future, 
and attitudes towards users. More than one alternative might be checked by an 
individual on questions where this was appropriate. 


C. Resuits 


The response locations for the concept marijuana on each of the 12 semantic 
differential dimensions individually were tabulated on each of the four subject 
groups separately. Mean position values for each dimension for each group 
were then computed, and these data have been presented in Table 1. 

From the data in Table 1, it was obvious that the outstanding differences 
in the concept of marijuana occurred between user vs. nonuser groups regard- 
less of sex. The dimensions upon which the largest user vs. nonuser differences 
appeared were moral-immoral, pleasant-unpleasant, good-bad, intelligent-un- 
intelligent, and reality-fantasy. When the median test was applied to the com- 
parisons between the groups on each of the dimensions (refer to Table 1), 
this pattern of data was confirmed. Significant differences occurred on five 
dimensions (i.e., moral-immoral, good-bad, pleasant-unpleasant, intelligent- 
unintelligent, and reality-fantasy) where responses of male users vs. those of 
male nonusers were compared. Similarly, significant differences appeared be- 
tween the responses of female users vs. those of female nonusers on these same 
dimensions, excepting only intelligent-unintelligent which failed to achieve 
significance. Comparisons between male users vs. female nonusers and between 
female users vs. male nonusers merely served to underline the above. No 
significant differences on any of these dimensions occurred between the two 
sexes within either the user or the nonuser group. This fact underscored the 


choice of these dimensions relative to Osgood’s factors. However, an overview of the 
data suggested that the procedure of averaging into factors might have hidden some 
of the more informative dimensional differences between group responses. Hence, 
analysis was based on all dimensions separately considered. 
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assumption that the group differences that attained significance were directly 
related to the variable of drug use or abstinence per se. 

As noted above, questionnaire data served two purposes, providing both 
empirical description of sample groups and subsidiary data regarding motiva- 
tion for drug use. Responses to the first few questions operationally defined 
the user group, male and female, as a homogeneous group of consistent users. 
This was based on the fact that all except one girl reported use of marijuana 
many times, usually at least once weekly, over the past two or more years. 
Over 70 percent of them had used it within the two weeks preceeding taking 
the questionnaire. The majority (72 percent of the females and 84 percent of 
the males) reported use of other drugs as well. The mean number of other 
drugs used was four, with a range of one to nine depending upon the individual. 
Speed and LSD and Downers were those most frequently used. 

In responding to the question concerning motivation for marijuana use, the 
majority of the users (72 percent females and 76 percent males) cited the 
pleasure of the high as the motivation. Desire for expanded awareness was the 
only other reason consistently chosen (12 percent females and 40 percent 
males). Escape, social acceptance, curiosity, excitement, and so on, were hardly 
indicated at all as motives for marijuana use by this group. Of the six students 
who were considering giving up marijuana, the single reason for doing so 
given by all was that they “didn’t need it any more.” “Too much expense,” 
“too much trouble,” and “too much bad stuff” provided second reasons in some 
cases. 

Nonusers were operationally defined as those who had never used marijuana. 
The major reasons given by nonusers for abstinence from marijuana usage 
revolved around personal and practical considerations, rather than around 
religious or moral restraints. That the drug could be physiologically harmful 
(36 percent female and 60 percent male) was the major deterrant. Family 
considerations, risk, and expense were other reasons cited with some frequency. 
Forty percent of the males and 16 percent of the females considered them- 
selves open to the possibility of “trying” the drug someday, but the majority 
of both groups felt they would continue to refrain from use. Over half of the 
nonuser group had friends who were users and for the most part expressed a 
kind of tolerant acceptance concerning them. 


D. Discussion 


From these data, two dynamically distinct profiles were defined within the 
college population tested. 
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1. The User 


The data of Table 1 indicated that users, regardless of sex, viewed mari- 
juana as very pleasant as well as moral and good, while nonusers saw it as 
hedonistically and morally indifferent. Female users especially tended to 
describe marijuana as smooth, soft, and light: certainly experiential aspects 
of the high. Questionnaire response confirmed the fact suggested by semantic 
differential data that the primary implication of and motivation for marijuana 
use was the pleasant experience of the high. This substantiated the conviction 
of Scher (8) that the high was the single major motive for continued drug use. 
It supported Keeler’s (2) finding that tension release and attainment of the 
high constituted the major reason for marijuana use. The fact that marijuana 
was also seen by users as good and moral suggested that they would admit 
nothing “wrong” in taking pleasure from its use. Many verbalized that 
marijuana use should be an accepted social fact (just as the use of alcohol and 
beer is) and such use should suggest nothing intrinsically evil. 

Questionnaire data further indicated that the only other consistent motivation 
for marijuana use reported was the desire for “expanded experience and in- 
creased sensitivity.” These data, too, closely paralleled Keeler’s findings that 
the search for psychedelic and/or expanded experience was the outstanding 
motive, second only to the high itself, cited by his marijuana user group. Al- 
though: his subjects were all psychiatric referrals, his sample consisted of pre- 
dominately male college users, closely approximating the male user sample of 
the present research. It was interesting, therefore, that this motivation was 
primarily expressed among the male user group in the present study. When 
this desire for increased awareness was taken in combination with the semantic 
differential data which revealed male users rating marijuana as intelligent, it 
provided a basis for hypothesizing the operation of the mechanism of intellec- 
tualization as part of the expressed rationale for drug use, especially among 
male users. It was very possible that marijuana use to them represented an 
intelligent and sensitive means of discovery of self and reality. In this context, 
its use might even have defined the limits of a kind of elite intelligentsia 
within the college group, with marijuana: use operating in a manner similar 
to its cohesive. and initiate function within the hippie group (1). It would not 
be surprising to discover this type of dynamics among a college sample where 
value has often been placed on intellectual pursuits. 


2. The Nonuser 


In marked contrast was the profile of the nonuser drawn from the same 
college population. Nonusers saw marijuana not so much as unpleasant or 
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immoral, but rather as unintelligent and bad. Questionnaire responses of non- 
users indicated that the outstanding reason for abstinence was. possible physio- 
logical harmfulness. Could this be the “bad” on the semantic differential? 
Nonusers emphasized the aspect of drug involvement related to potential: 
danger and harm much more than that of potential pleasure. Further, they 
saw marijuana as fantasy-related, involving aspects of escape, dependence, and 
withdrawal from reality. As both a danger and an escape, marijuana use was 
to the nonuser unintelligent, unrealistic, and impractical. This was strongly 
supported by questionnaire data which asserted the personal and practical bias 
of this group in contrast to the pleasure-oriented dynamics observed in the user 
group. A final characteristic of the nonuser sample was the tolerance and/or 
indifference in their attitude toward marijuana use by others. Although they 
had not tried it themselves (indeed, they seemed not even sure of its pleasure 
giving aspects), nonusers did not condemn or reject others for using it. They 
saw marijuana use as morally indifferent, and expressed accepting, often 
Positive, attitudes towards users. Indeed, many nonusers remained open to 
possible future use themselves. They apparently viewed the drug and its use 
today as an existing social fact to be freely exploited by those who wish to do 
$0. 


E. SumMMaARY 


One hundred college students, 50 users and 50 nonusers, responded to a 
semantic differential rating scale and a questionnaire regarding the meaning 
and motivation of marijuana use. Results indicated that marijuana usage was 
primarily related to seeking the pleasure of the high, and secondarily to the 
mind expanding potential of the drug. Nonusers abstained for reasons of 
danger to personal health and for practical reasons associated with use. Neither 
group emphasized the moral implications of drug use, but rather both groups 
agreed that such indulgence should be considered a socially accepted fact. 
Further, escape, risktaking and thrill seeking, desire for independence, re- 
bellion, social conformity, and the several other motives assigned in the 
literature to drug use did not seem to be dynamically operative in this college 
sample. It seemed probable that many of these motives, which involved 
independence, social acceptability, and adventure seeking, characterized adoles- 
cent dynamics and might have applied more meaningfully to samples younger 
and less mature than the college group studied. Finally, the alternative dynamic 
pattern of escape seeking, helplessness, and frustration delineated by Allen and 
West (1) in their intensive review of the hippie subculture also did not seem 
to operate meaningfully within an active college sample. 
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PRAISE, EVALUATIVE DEPENDENCE, AND THE 
EXPERIMENTER AS FACTORS IN A 
FREE-LEARNING TASK** 


DePaul University and Ohio State University 


Wuu1am Terris? AND THomas W. Mirpurn 


A. INTRODUCTION 


Motivational theorists have suggested that the presence of such learned 
social motives as the need for achievement or the need for social approval can 
facilitate performance in many situations. For example, Crowne and Marlowe 
(1) have reported that verbal conditioning is faster for subjects scoring high 
on the Marlowe-Crowne Social Desirability scale than for those scoring low. 
Enhanced performance for high Marlowe-Crowne subjects has also been found 
for simple motor tasks (6). More recently, Milburn, Bell, and Koeske (3) 
have investigated how subtle praise and censure interacted with evaluative 
dependence (Marlowe-Crowne Social Desirability scores) in affecting per- 
formance in a free-learning situation. In their first experiment they found that 
only the subjects high in evaluative dependence showed a reliable improvement 
in performance over trials, and for these subjects improvement occurred to an 
equal degree under both approval and censure conditions. In their second 
experiment no praise or censure was given, and any possible interaction be- 
tween the experimenter and subject was minimized by having the instructions 
automated and presented by tape recorder while the experimenter was kept 
out of subject’s view. Under these conditions, there was no difference be- 
tween the high and low Marlowe-Crowne subjects’ performance. 

Milburn e¢ al. concluded that the improvement in performance for the 
high Marlowe-Crowne subjects in the first experiment was due to the inter- 
action of the censure-approval factor with the evaluative dependence factor, 
rather than from either factor alone. At least four theoretical explanations can 
account for this interaction. The first (threshold hypothesis) is that evalua- 


* Received in the Editorial Office on March 23, 1972, and published immediately 
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tive dependence reflects the threshold for the detection of evaluation, with sub- 
jects high in evaluative dependence more likely to detect and therefore react 
to subtle praise, censure, or other kinds of evaluation than are subjects scoring 
low on this scale. The second explanation (responsive hypothesis) holds that 
subjects high in evaluative dependence show no special ability to detect evalua- 
tion, but rather they are more responsive to any given amount of evaluation 
than are the subjects low in evaluative dependence. The third explanation 
(audience hypothesis) holds that the Marlowe-Crowne test may measure a 
person’s sensitivity to the presence of an audience. The mere presence of an 
audience has been found to facilitate performance in many situations (eg., 7), 
and it would be reasonable for high Marlowe-Crowne subjects to show the 
greatest amount of facilitation in these situations. For the “audience hypoth- 
esis,” no praise, censure, or overt evaluation would be necessary to produce the 
effect, since the mere presence of an audience and perhaps the potential for 
evaluation could be sufficient to facilitate the performance of the high 
Marlowe-Crowne subjects. The fourth explanation (experimenter hypothesis) 
holds that the results of Milburn et al. may represent the effect of only one 
experimenter’s subtle censure or praise. In consideration of the wide individual 
differences in both the expression and the interpretation of facial expressions, 
it is important to be able to demonstrate the effect with more than one experi- 
menter, 

The major purpose of the present experiment was to provide a test of the 
relative merits of these four hypotheses. This was accomplished by essentially 
replicating the study by Milburn et al., but including three levels of praise 
which differed essentially in the obviousness of the evaluation (i.e., obvious, 
subtle, or no praise). In addition the entire experiment was performed inde- 
pendently by three different experimenters, 

Each of the four hypotheses would Predict a different pattern of results in 
regard to performance of subjects for the various levels of evaluative depen- 
dence and amount of praise. If the audience hypothesis is correct, then all 
high Marlowe-Crowne subjects perform better than low Marlowe-Crowne 
subjects regardless of amount of Praise. In addition, there should be no reli- 
able differences among the three praise groups. If the responsiveness hypothesis 
is correct, the high Marlowe-Crowne subjects should perform better than the 
low Marlowe-Crowne subjects tegardless of amount of praise. In addition, 
performance of all subjects (high and low Marlowe-Crowne) should im- 

prove with increasing praise. If the threshold hypothesis is correct, then pet 
formance should generally increase with more praise, but the differences, if 
any, between the high and low Marlowe-Crowne groups would depend upon 
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the amount of praise. When the praise is clearly below or above the threshold 
level (i.e., no praise or obvious praise), there should be little difference be- 
tween the high and low Marlowe-Crowne subjects. However, when the praise 
is approximately at the threshold level (subtle praise), the high Marlowe- 
Crowne subjects should perform better than the low Marlowe-Crowne sub- 
jects. Of course, if the experimenter hypothesis is correct, the pattern of 
outcomes for the various treatment groups should vary with the experimenters. 


B. Meruop 
1. Subjects and Materials 


The subjects were 144 male and 144 female students enrolled in intro- 
ductory psychology courses at DePaul University, who participated in the 
experiment as part of the course requirement. The three lists of words used 
in this experiment were identical with those used by Milburn, Bell, and 
Koeske (3). Each list consisted of 24 one- or two-syllable nouns of AA fre- 
quency on the Thorndike-Lorge (5) frequency count. The words were pre- 
sented on a Ralph Gerbrands Co. Model MIA memory drum with each word 
exposed to the subject for one second. 


2. Design 


The experiment consisted of a 2 X 2 X 3 X 3 factorial design with the 
following factors: (a) sex of subject (male and female) ; (4) Marlowe- 
Crowne scores of subject (high and low evaluative dependence) ; (c) praise 
treatment (none, subtle, and obvious) ;and (d) experimenter (experimenter 
1, 2, and 3). 


3, Procedure 


Prior to testing in the free-learning situation, all subjects took part in a 
“social opinion poll” in which the Marlowe-Crowne test was administered. 


The social opinion poll was conducted as if it were a separate study com- 


pletely independent of the free-learning experiment. The administration of the 


Marlowe-Crowne scale and the free-learning testing situation took place with 
different times, locations, and experimenters. In addition, the subjects were 


contacted separately, by different people, for each part of the experiment. For 


the purposes of analysis, subjects were assigned to one of two evaluative de- 
f their scores on the Marlowe- 


pendence groups (high and low) on the basis 0: heir score t ev, 
Crowne Social Desirability scale. In order to minimize experimenter as’ 
(eg., 4) the experimenters were not made aware of the subjects Marlowe- 
Crowne scores. 
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The subjects were tested in the free-learning situation individually, with 
the subject sitting facing the memory drum and the experimenter located to 
the front and to the side of the subject. The experimenter and the subject 
were always in full view of each other. The initial instructions given to the 
subject were as follows: “This is a learning task. There will be three sets 
of words to learn. In each set the words will be briefly presented in the 
window here [pointing]. After all the words in the set have been presented, 
I shall ask you to write down all the words you can remember in any order 
you like. Are there any questions? All right, when I turn the machine on, 
there will be a few blank spaces, and then the words will appear. Ready?” 

After all the words in the list were presented, the experimenter turned off 
the machine and asked the subject to write down as many of the words as 
possible in any order. When the subject finished writing, the experimenter 
briefly looked at the lists, quickly counted the words, and displayed one of 
three conditions of approval after each list: (a) in the no praise conditions, 
the experimenter would say or do nothing to indicate approval (or dis- 
approval) ; (4) in the subtle praise condition, the experimenter barely nodded 
his head and raised his eyebrows slightly so as to indicate slight pleasure; (¢) 
in the obvious praise conditions, the experimenter would accompany the ges- 
tures with a statement to the effect that his or her performance was “very 
good,” 

At the end of the experiment a questionnaire was given in which the sub- 
jects were asked to rate (on seven-point scales): (a) the friendliness of the 
experimenter, (b) the competence of the experimenter, (c) the interestingness 
of the experiment, and (d) the difficulty of the experiment. The subjects were 
instructed on how to use the scales and informed that the experimenter him- 
self would not see the responses. The subject then filled out the questionnaire, 
sealed it in an envelope, and dropped it into a box which was used especially 
for this purpose. 

The experimenters, three male graduate students in psychology, received 
training on how to make the gestures for the approval from the person who 
had served the experimenter in the Milburn, Bell, and Koeske experiment. In 
addition, the three experimenters worked together in perfecting the approval 
techniques so as to maximize the uniformity of behavior within each approval 
condition. 


C. Resutts 
1. Number of Words Recalled 


An analysis of variance was performed on the number of words recalled 
as a function of Sex of subject (male or female), Experimenter (1, 2, or 3); 


> — af 
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Marlowe-Crowne scores (high or low), Praise (none, subtle, or obvious), 
and Lists (1, 2, or 3). The F ratios for the main effects for the Sex (F = 
8.87; df = 1/252, p < .01) and the Lists (F = 20.00; df = 2/504; » < 
.01) factors were statistically reliable, with female subjects recalling more 
words per list (8.94) than male subjects (8.23) and the number of words 
recalled varying on lists 1, 2, and 3 (9.11, 8.16, and 8.48, respectively). 
Three of the F ratios for the interactions were also reliable. The mean 
number of words recalled for the main treatment groups in the Sex by 
Marlowe-Crowne by Praise interaction (F = 4.25; df = 2/252; p < .025) 
is shown in Figure 1. Inspection of the figure suggests that the major portion 
of the three-way interaction can be accounted for by the interaction of Sex 
and Marlowe-Crowne which appears to occur mainly under conditions of 
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subtle approval. A simple effects analysis of each praise condition yielded 
results consistent with this interpretation. Under the condition of subtle ap- 
proval the Marlowe-Crowne by Sex interaction was reliable (F = 6,96; 
df = 1/84; p < .01), but under the conditions of either no praise or obvious 
praise the Marlowe-Crowne by Sex interaction did not approach statistical 
reliability (7 < 1 in both cases). For male subjects tested with subtle praise, 
those subjects high in evaluative dependence recalled more words than the sub- 
jects low in this regard, while for female subjects tested with subtle praise 
the difference was in the opposite direction. These results are not fully con- 
sistent with those of Milburn, Bell, and Koeske (3) who found that subtle 
praise (and censure) facilitated the performance of both male and female 
subjects scoring high on the Marlowe-Crowne. 

The Sex by Marlowe-Crowne by Lists interaction was also statistically 
reliable (F = 3.15; df = 2/504; p < .05), and a simple effects analysis 
indicated that the three-way interaction could mainly be attributed to a 
Marlowe-Crowne by Sex interaction which occurred primarily on the second 
list. As in the previously mentioned interaction, males high in evaluative de- 
pendence recalled more words than did males low in this respect, but the 
difference between the females high and low in evaluative dependence was 
in the opposite direction. 

The mean number of words recalled for the main treatment group com- 
binations for the statistically reliable Experimenter by Praise by Lists inter- 
action (F = 2.69; df = 8/504; p < .01) is shown in Figure 2. As can be 
seen, the effect of varying the obviousness of praise across the three lists is 
different for each of the three experimenters, With the exception of a strong 


Lists effect there is no apparent similarity of pattern common to all experi- 
menters. 


2. Rating Scales 


A four-way analysis of variance was performed on each of the four rating 
scales as a function of Praise, Marlowe-Crowne, Sex, and Experimenter. A 
summary of the analyses of the four scales is shown in Table 1. For the Dull- 
Interesting scale, the F ratios for the Praise by Sex interaction, the Praise by 
Marlowe-Crowne interaction, and the Marlowe-Crowne main effect were all 
statistically reliable. None of the F ratios for the other main effects or inter- 
actions reached the .05 level of significance, For the male subjects, increasing 
praise (none, subtle, and obvious) resulted in a less interesting experiment 
(Means = 5.7, 5.6, and 5.1, respectively; Dull = 1, Interesting = 7), while 
for the females increasing approval increased interest (Means = 5.1, 5.9, and 
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5.9, respectively). While overall, the subjects scoring low in evaluative de- 
pendence had lower interest scores than the subjects scoring high in evaluative 
dependence (Means = 5.4 and 5.7, respectively), the two groups were 
affected differently by increasing amounts of praise. For the low Marlowe- 
Crowne subjects, increasing praise resulted in an initial increase in interest 
followed by a subsequent decrease (Means = 5.2, 5.8, and 5.1 for the none, 
subtle, and obyious groups, respectively). For the high Marlowe-Crowne 
subjects, increasing the amount of praise resulted in higher interest scores 
(Means = 5.6, 5.7, and 5.9, respectively). 

For the scale dealing with the ease of the experiment, only the Sex main 
effect and the Sex by Experimenter interaction were reliable. While the fe- 
males rated the experiment as easier than the males (Mean = 4,3 and 3.8, 
respectively; Difficult = 1, Easy = 7), the difference between the male and 
female groups varied among the three experimenters. The mean interest scores 
for the male and female subjects, respectively, were (a) for experimenter 1, 
3.6 and 4.6; (6) for experimenter 2, 3.7 and 4.1; and (c) for experimenter 3, 
4.2 and 4.2. 

For the ratings of the friendliness of the experimenter, the Sex main effect 


TABLE 1 
SUMMARY OF ANALYSES OF VARIANCE OF RATINGS PERTAINING 
TO EXPERIMENT AND EXPERIMENTER 


Rating scales 


Experiment Experimenter 
Dull- _Difficult- Un- Incom- 
friendly- _ petent- 
Source df Interesting Easy Friendly Competent 
Experimenter (E) 2 2.00 1 1 5.498% 
Sex (S) 1 1.08 Sie 4.37* 3.35 
EXxs 2 <1 3.09° <1 <1 
Praise (P) 2 1.62 2.75 <1 2.02 
PXE 4 1.51 <1 <1 1.38 
eS Z 7.06%* <1 2.26 <i 
Ber Crowne SS < es = 
MC 1 6.31* ei! 3.53 2.33 
MC XE 2 <1 1.63 2.35 <1 
MCxS 1 <i 2.57 <A 6.50* 
MCXSXE 2 1.20 cs | <1 1.04 
MC x P 2 3.25% 2.12 <i 147 
MCXPXE 4 gi <i 241* <i 
MCXPXS 2 <i <1 <i 211 
MCXPXSXE 4 1.03 <1 <1 <1 
Error (MS) 252 (1.724) (2.357) (.5964)  (.6516) 
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and the Marlowe-Crowne by Praise by Experimenter three-way interaction 
were statistically significant. The female subjects rated the experimenter as 
more friendly than did the male subjects (Means = 6.6 and 6.4). The main 
treatment group combinations for the three-way interaction are shown in 
Figure 3, and again it can be seen that the pattern of the Marlowe-Crowne 
by Praise interaction was different for each experimenter. 

Analysis of the competence ratings yielded reliable F ratios for the Experi- 
menter main effect and for the Marlowe-Crowne by Sex interaction. The 
mean ratings of competency for experimenters 1, 2, and 3 were 6.61, 6.45, 
and 6,22, respectively (Incompetent = 1, Competent = 7). For male sub- 


7.0 
oe —* H=MC 


piarrarrnd oa), 


Bahn 


FRIENDLINESS 


FIGURE 3 
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See Figure 1 for definitions of terms. 
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jects the difference in ratings was small between the subjects high and low 
in evaluative dependence (6.29 and 6.39), while for the female subjects, 
those high in evaluative dependence rated the experimenter as more competent 
than did those low in this measure (6.71 and 6.32). 


D. Discussion 


The results of this experiment were only partially consistent with those of 
Milburn, Bell, and Koeske. Since Milburn e¢ al. did not use obvious praise 
or no praise (at least as used in the present study), the important comparison 
would be for the subtle praise conditions in both experiments. While Milburn 
et al. found that subtle praise resulted in facilitation of performance of both 
male and female subjects high in evaluative dependence, the present study 
found that only males high in evaluative dependence showed the facilitation 
effect, For the female subjects in the present study the opposite was true, with 
female subjects low in evaluative dependence superior to the females high in 
this respect. 

The results of the present experiment were not consistent with the audience, 
threshold, or responsiveness hypothesis. These hypotheses would not predict 
that the effects of praise would be limited to only the subtle praise treatment 
conditions and would not predict the type of Marlowe-Crowne by Sex inter- 
action found in this experiment. The results suggest that the subjects may not 
have been responding to praise but to the ambiguity of the situation. Under 
the conditions of subtle praise the gestures made by the experimenter were 
ambiguous and could be interpreted in a number of different ways (i.¢., the 
E is displeased, impressed, etc.). The no praise and obvious praise treatment 
conditions were less ambiguous and subject to fewer and less diverse inter- 
pretations. While the Marlowe-Crowne by Sex interaction in this experiment 
is difficult to interpret, the fact that it occurred only under conditions of 
subtle praise suggests that the interaction may be at least partially due to the 
interpretation by male and female subjects of the behavior of the male experi- 
menter in different ways. The present experiment did not include male and 
female experimenters because of the absence of any Sex by Treatment inter- 
actions in the Milburn ef al. study. However, the present results indicate the 
need to vary the sex of both the subject and experimenter in order to clarify 
the meaning of the Sex (of subject) by Marlowe-Crowne interaction which 
was found in this experiment. 

The results of this experiment do lend partial support for the experimenter 
hypothesis. Several Experimenter main effects and Experimenter by ‘Treat 
ment interactions were found, and the nature of these interactions shows that 
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different experimenters may obtain drastically different results even while 
apparently replicating the same experiment. This is illustrated in Figure 3, 
where the pattern of ratings for each experimenter, by itself, could probably 
be interpreted with little difficulty, and the problem of interpretation only 
became apparent when the results of more than one experimenter were ob- 
served, 

The presence of strong experimenter by treatment interactions indicates 
the importance of using more than one experimenter in psychological studies 
in which the experimenter interacts with the subject. While social scientists 
have traditionally stressed the importance of standardizing and validating the 
experimental apparatus and testing materials, much less emphasis has been 
placed on the experimenter. Yet the experimenter interacting with a subject 
has much the same function as a piece of apparatus or test material and should 
be evaluated in much the same way (e.g., 2). The presence of experimenter 
by treatment interactions could account for the failure of this present experi- 
ment to replicate fully the results of Milburn e¢ al. In general, the presence 
of strong experimenter by treatment interactions could be responsible for the 
often reported difficulty in replicating social-psychological experiments. 

Finally, the subjects used in the present study were somewhat different in 
social and economic background from those used in the Milburn et al. study. 
While both studies selected subjects from introductory psychology subject 
pools from schools in the Chicago area, and while Milburn e¢ al. did not 
consider that their results were applicable only to students of the type found 
at Northwestern University, it is undoubtably true that Northwestern and 
DePaul University students have different social, religious, economic, and 
ethnic backgrounds. It is possible that the differences in the results of the 
Milburn et al. and the present study are due to the response to praise varying 
as a function of the social class and background of the subjects. 


E. Summary 


Previous research has shown that subtle praise or censure facilitates the 
performance of subjects who are high in evaluative dependence (Marlowe- 
Crowne Social Desirability scores). The main purpose of the present experi- 
ment was to determine which of four hypotheses could best account for this 
facilitation. Two hundred and eighty-eight subjects were used in a 2 X 2 X 
3 X 3 factorial design with the following factors: (a) Sex of S (male, fe- 
male), (5) Evaluative dependence (high, low), (c) Praise (none, subtle, 
obvious), and (d) Experimenter (1, 2, 3). While praise did interact with 
evaluative dependence and sex in determining performance, the results were 
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only partially consistent with the previous research. The effect of increasing 
the obviousness of praise upon performance was found to depend upon which 
experimenter administered the praise. In addition, several other instances of 
experimenter and experimenter by treatment interactions were found, The 
results suggest the importance of the experimenter interaction effects in psy- 
chological research and indicate the need for more than one experimenter to 
be included in such research, 
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REINFORCEMENT HISTORY AS A DETERMINANT OF 
SELF-REINFORCEMENT* ? 


University of Tennessee 


Barry A. Bass? 


A. INTRODUCTION 


Self-reinforcement (S-R) as a method of self-control is attracting the 
interest of growing numbers of investigators (eg., 4, 5, 6). The concept of 
S-R holds particular appeal for clinicians, since it seems to offer an individual 
the opportunity to alter his own behavior by programming the consequences of 
his behavior without having to rely upon additional external support. Un- 
fortunately, because of the ambiguity of the concept, S-R has come to denote 
any of a number of different terms and operations. 

The definition of self-reinforcement proposed here, although no less arbi- 
trary than others (1), emphasizes what this experimenter considers to be the 
crucial aspect of the concept—complete control by the subject over the ad- 
ministration of reinforcement. This same point is emphasized by Skinner when 
he states that “the individual may at any moment drop the work in hand and 
obtain the reinforcement” (11, p. 238). The individual who actually admin- 
isters the reinforcement (be it the experimenter, a machine, or the subject 
himself) is irrelevant. What is important is an awareness by the subject that 
he alone possesses total control over the administration of reinforcement to 
himself. It is crucial in the S-R paradigm that the subject understand and 
believe that he truly does have complete control over all contingencies and can 
thereby ‘‘cheat” or reward himself even when the schedule does not permit. 
If this condition is not met, then the individual is externally controlled rather 
than self-controlled. 

The rationale for defining S-R in this limited way relates to the manner in 
which the concept is employed in clinical practice. It is often an explicit 
assumption of certain behavior modifiers that with a thorough knowledge of 
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psychological principles—especially reinforcement theory—any given patient 
or client can administer treatment to himself without the supervision of a 
therapist. Kanfer (5) refers to just such a goal as “autotherapy.” 

‘Thus a totally self-administered treatment program should presumably 
owe its effectiveness to the self-administered treatment and not to the nonspe- 
cific social reinforcement dispensed by the therapist to the client for complying 
with instructions. In reviewing the literature in this area, this investigator 
has been unable to find any cases in which a patient administered a self- 
control technique to himself without the concurrent supervision of a therapist. 
In each case it was impossible to determine whether it was the self-control 
operations themselves, the social reinforcement of the therapist, or the inter- 
action of the two that was responsible for any behavior change (cf. 2, 3, 9, 13) 

‘The purpose of this study is to determine under what conditions administer- 
ing contingent rewards to oneself is reinforcing. It is hypothesized that the 
behavior maintaining properties of $-R are a direct function of either external 
reinforcement or the external reward anticipated by a subject. When all ex- 
ternal reinforcement (either actual or anticipated) is removed from a self- 
rewarded task, it is hypothesized that S-R operations will possess no behavior 
maintaining properties. Restated in terms of the Premack Principle (8), there 
is no reason to believe that individuals will switch from emitting high prob- 
ability behavior to emitting low probability behavior in order to administer to 


themselves rewards that they already possess and can administer to themselves 
in any event. 


B. Meruop 


1. Subjects 


i Sixty-four fifth grade children were randomly assigned to four groups of 
eight boys and eight girls each. The mean age of the Ss was 10.8 years with a 
range of 10.1 to 12.3 years, 

Group 1 (SRC). Self-reinforcement in which, in a precondition phast, 
compliance with instructions was consistently reinforced. | 

Group 2 (SRN). Self-reinforcement in which, in a precondition phase, 9” 
compliance with instructions was consistently reinforced. 

Group 3 (SRR). Self-reinforcement in which, in a precondition phase, both 
compliance and noncompliance with instructions were each intermittently 
(randomly) reinforced. 


Group 4 (SRNP). Self-reinforcement with no precondition. 
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2. Apparatus 


The apparatus was a plywood console (dimensions 22” ST SOTA Le 
marble placed into a hole three inches from the bottom rolled down a chute 
to an opening one-half inch from the bottom. It took approximately one second 
for the marble to roll down the chute. The console was also equipped with a 
second chute through which pennies, which served as reinforcers, were delivered 
at the appropriate time. Ten holes running horizontally along the top of the 
console, each numbered from 5 to 50 (in multiples of five) into which a one- 
half inch dowel could be placed, served to indicate the criterion for reinforce- 
ment on any given trial. A bell, rung prior to the delivery of reinforcement, 
was not connected to the console. 


3. Procedure 


Ss in all groups received the following tape-recorded precondition instruc- 
tions, 


I will show you how you can get some pennies, You can keep all of the 
pennies that you get. Your job is to try to get as many pennies as you can, 

To get a penny, you first put the marble into this hole a certain number 
of times, [Demonstrate putting marble into hole.] To find out how many 
times to put the marble into the hole, look up at these holes. [Pointing to 
horizontal row of holes.] If the stick is in this hole, then you will get a 
penny when you put the marble into the hole five times. If the stick is in 
this hole, then you will get a penny when you put the marble into the hole 
25 times. If the stick is in this hole, then you will get a penny when you 
put the marble into the hole 50 times. After you put the marble into the 
hole the right number of times, a bell will ring and then a penny will roll 
down here. [Pointing.] All the pennies that roll down are yours to keep. 

Sometimes you might get the penny before you put the marble in the 
right number of times, Don’t worry. Just remember that your job is to 
get as many pennies as you can. If you get a penny before you are sup- 
posed to, you can keep putting the marble into the hole, or you can stop and 
wait for the next number. It’s up to you, When I take the stick out of a 
hole and then I put it into another hole, that means that you can start 
working for another penny. But remember to look up at these holes that 
tell you how many times to put the marble into the hole before you get 
your next penny. If you have any questions, I want you to ask me now 
because once we start, I cannot answer any questions at all. 


After listening to the instructions, Ss in groups SRN, SRR, and SRC par- 
ticipated in a series of 10 preconditioned trials. Each trial began when the 
experimenter placed a dowel into one of the 10 holes along the top of the con- 
sole. This indicated to the S the number of times he was to put the marble 
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into the hole. A trial was considered over after the S had not responded fo 
at least five seconds. At a preselected point during each trial, a bell was rur 
and then, one-half second later, a penny was delivered to the child. The poi 
in the sequence at which the penny was delivered was determined by the ! 
membership in one of the three precondition groups. 

Ss in SRC heard the bell and received a penny immediately upon pu' 
the marble down the hole the previously indicated number of times (as 
determined by the dowel placement). Ss in this group were consistently 
warded for complying with the initial instructions of the experimenter. If 
was assumed that these Ss would most resemble Ss in a no preconditior 
group; that is, it was assumed that most Ss participating in a psychological 
experiment would expect reinforcement for complying with the instructio 
of the experimenter (cf. 10). 7 

Ss in SRN heard the bell and received a penny before responding the P 
viously indicated number of times (as determined by the dowel placemen 
Here Ss were rewarded for noncompliance with experimenter instructio 
That is, the expectancy presumably induced by this procedure was that com- 
pliance with instructions would not be rewarded by the experimenter. 

Ss in SRR received reinforcement at the previously indicated time on five 
random trials and received reinforcement before the previously indica J 
time on the remaining five trials. Here it was assumed that Ss would come 
expect that compliance with instructions would sometimes, but not always, be 
rewarded by the experimenter. ; 

Ss in SRNP did not participate in the precondition phase although # ¢ 
heard the same instructions as those given to the Ss in the other three grou! 

_The precondition phase served as the independent variable for this study 
Since it was predicted that the efficacy of self-administered reward ‘was | 
function of the expectancies of the S, the four precondition treatments sé! 
eu control for the varying expectancies of the Ss, The efficacy of self-adm 
istered reward, the dependent variable, would presumably differ be 
groups depending upon Ss’ precondition experiences. 

After participating in the precondition phase, all Ss immediately 
the following tape-recorded instructions. (Ss in SRNP listened to the fo ( 
ing instructions immediately after hearing the precondition instructions. 


Now I will show you how you can get some more pennies. You can 
keep all of the pennies that you get. Your job is to try and get as many 
pennies as you can. 

ieee 

To get a penny you first put the marble into the hole a certain number 

of times. You decide how many times to put the marble into the hole 
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before you get a penny, If you put this stick into this hole, then you will 
get a penny when you put the marble into the hole five times, If you put 
the stick into this hole, then you will get a penny when you put the 
marble into the hole 25 times, If you put the stick into this hole, you will 
get a penny when you put the marble into the hole 50 times. 

Whenever you want a penny, just ring this bell. [Demonstrate.] Re- 
member your job is to get as many pennies as you can as quickly as you 
can, You can even cheat if you want. Just remember that whenever you 
want a penny, ring this bell and I will roll down a penny to you. After 
you get a penny remember to check this stick, You can leave it where it 
is, or you can move it to another hole. Then when you want another 
penny, just ring the bell again and a penny will come out. You will get a 
penny every time you ring the bell. I will not be checking you, so re- 
member you can get a penny whenever you like. Just ring the bell, and a 
penny will come out. Try to get as many pennies as you can, You can 
stop anytime you feel like it, If you have any questions I want you to ask 
me now. Once we start I cannot answer any questions at all. 


All Ss were permitted to work at the console until they had earned 30 
pennies or for a total of 45 minutes, whichever occurred first. At all times 
the experimenter was in the room with the §, seated behind the console out 
of view. 

5. Dependent Measures 


The following dependent variables were recorded for each S; 

1, Self-selected criterion level (SSCL). This is the number chosen by the 
S to indicate for each trial the predicted number of responses he would emit 
before he rewarded himself. 

2. Total responses per reward (TR). This is the actual number or re- 
sponses emitted by the S before he rewarded himself. 

3. Discrepancy score (DS). This is the discrepancy between the predicted 
schedule of reinforcement preselected by S (SSCL) and the reinforcement 
schedule actually observed (TR). 

6. Hypothesis 


It was hypothesized that Ss in a compliant precondition group (SRC) and 
in a no precondition group (SRNP) would show significantly less discre- 
pancy between predicted and actual number of responses (DS) than would 
Ss in either a noncompliant (SRN) or a randomly compliant (SRR) precon- 
dition group. 

Groups SRC and SRNP had been rewarded in the laboratory or in the 
natural environment for complying with previously selected criteria, Ss in SRN 
and SRR had been rewarded for failing to comply and would presumably be 


200 JOURNAL OF PSYCHOLOGY 


less likely to anticipate reinforcement for compliance. Ss in SRN and SRR 
would therefore, it was predicted, show greater discrepancies between antici- 
pated and observed reinforcement schedules. 


C. Resutts anp Discussion 


A mean DS score was obtained for each § by subtracting the sum of his in- 
dividual TR scores from the sum of his SSCL scores and then dividing by 
the number of trials that comprised each measure. Table 1, which presents a 
4 X 2 analysis of variance of DS, indicates that the main effect of the four 
precondition treatments was significant (p < .001). No significant differences 


TABLE 1 
ANALYSIS OF VARIANCE OF DiscrePANcIEs (DS) BETWEEN PREDICTED 
AnD AcruaL Number oF Responses (SSCL-TR) As A FUNCTION 
OF PRECONDITION AND SEX OF SUBJECT 


Source of variation df MS F é 
Between 
Precondition 3 398.480 7.015 p< .001 
Sexe 1 57.949 1.020 p> 30 
Precondition X Sex 3 28.211 497 p> .60 
Within 56 56.804 
Total 63 


could be attributed to the sex of the subjects. Duncan’s New Multiple Range 
Test indicates that the mean DS of group SRN was significantly greater 
than the mean DS of groups SRR, SRC, and SRNP (p < .05). Although 
the mean DS of SRR was greater than either SRC or SRNP, this difference 
failed to reach significance, 

Since the DS score is actually a derived score obtained by subtracting TR 
from SSCL, it was possible that most of the variance could be attributed to 
the levels of either SSCL or TR. This was not found to be the case, since 
the correlations between DS and SSCL (.61) and between DS and TR 
(—.49) were not significantly different from each other (p > .10). 

It was demonstrated that in groups in which compliance with instructions 
had been previously rewarded (SRC and SRNP) little or no discrepancy was 
observed between the criteria predicted by Ss and the observed self-reward 
schedule of these individuals. That is, children in these groups apparently 
exhibited “self-control.” On the other hand, individuals in a group in which 
compliance with instructions was not rewarded (SRN) did not comply later 
with the schedules of reinforcement that they themselves had previously 
selected. These children showed little or no “self-control.” 
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These results strengthen the position that individuals in SRN saw nothing 
to gain by complying with previously selected criteria. That is, they anticipated 
no subsequent approval for compliance with high self-imposed performance 
demands and consequently construed the situation as one in which the would 
“get what they took.” Conversely, groups SRC and SRNP probably resem- 
bled that population that comprises most of the subjects of psychological 
experiments. That is, they wanted to be “good subjects” and as such they 
were willing to work hard and not “cheat.” They thereby hoped to receive 
the praise and acknowledgement of a “good job well done” from the experi- 
menter. 

The epitome of this approval-seeking attitude was observed in subjects who 
continued to comply with instructions even when they were required to 
perform obviously trivial tasks and then told to destroy their work (7), Al- 
though in that study it was, in fact, impossible for the experimenter to know 
how a particular subject performed, the subjects themselves were convinced 
that the experimenter could somehow find out. 

Although this study was no more than a laboratory analogue of S-R 
which analyzed the performance of children on a relatively nonmeaningful 
task, the results nevertheless possess relevance for self-directed therapy pro- 
grams, First, this study supported the hypothesis that S-R, without the con- 
curent social reinforcement dispensed by an external agent (therapist) is unable 
to maintain low probability behavior. Any attempt at this time to eliminate 
the therapist completely from a behavior therapy program must be considered 
premature, 

It appears to this experimenter (who is also a behavior therapist) that the 
recognition of this fact can lead applied psychologists to pursue one of two 
courses, They can continue in the direction in which they now appear to be 
headed; that is, toward developing more sophisticated and elaborate self- 
treatment programs. Provided that provisions are made for concurrent ex- 
ternal reinforcement of changes brought about through the programming of 
self-administered consequences, progress in the treatment of disorders result- 
ing from the apparent loss of self-control (e.g., alcoholism, sexual deviations) 
will continue to be realized. 

The other alternative is to return to the strategies that self-administered 
therapy programs initially served to replace. We can devote our energies to 
devising better ways to control the situational determinants of problem be- 
haviors (12). This endeavor would ultimately lead to the engineering of 
societies in which problem behaviors would have little or no opportunity to 


develop. 
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D. Summary 


It was hypothesized that the behavior maintaining properties of self-rein- 
forcement (S-R) were a direct function of either external reinforcement or 
the external reward anticipated by a subject. 

Sixty-four fifth grade children were given an opportunity to comply with a 
self-selected schedule of reinforcement by dropping a marble through a 
chute. For purposes of this study, compliance with a self-selected criterion was 
defined as S-R. It was predicted that subjects previously rewarded during a 
precondition phase for complying with predetermined criteria would sub- 
sequently comply with criteria they themselves selected; i.e., they would ex- 
hibit S-R. Conversely, it was predicted that subjects previously rewarded for 
not complying would subsequently not comply with self-selected criteria. Both 
predictions were supported. The relevance of these data for self-directed 
therapy programs was discussed. 
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VERIDICALITY OF INTERPERSONAL PERCEPTIONS BASED 
UPON PHYSIOGNOMIC CUES* A 


Department of Psychology, Hanover College 


Rocer L. Terry aND WiLL1AM G. SNIDER 


A. INTRODUCTION 


In the course of daily social interactions people must often form immediate 
judgments or impressions of other persons on the basis of extremely restricted 
information. There are two components of such judgments; viz., (a) con- 
sensus or agreement in the perceptions of the stimulus person, which is a 
prerequisite for (b) veridicality or accuracy of the perceptions. Since the 
interaction process is affected by the perceptions of the actors, it is important 
for the perceptions to be not only consensual, but yeridical. Because most social 
interactions do not cause difficulties for the actors, it is apparent that inter- 
personal perceptions are typically veridical. 

However, systematic investigations of person perception in conditions of 
limited cue information have generated conflicting results (2, 3, 4, 10), indi- 
cating considerable consensus in the perceptions, but relatively little veridical- 
ity. For example, the research of Secord and his associates (1, 5, 6, 7, 8, 9) 
indicates not only that persons show little reluctance in forming judgments 
of personality from physiognomic cues, but also that there is great consensus 
in these judgments; however, adequate tests of veridicality were not con- 
ducted. Intuitively, judging another person’s personality is a more difficult 
task than judging his social position, That is, judges may first categorize the 
stimulus person and then attribute to him personality traits that are congruent 
with the sdcial category (4). Informal observations show that college students 
can predict with surprising accuracy the social category to which a fellow 
student belongs. The present study was designed to investigate this phenomenon 
under controlled conditions by examining the veridicality in assigning opposite 


sex persons to socially significant categories when only physiognomic cues are 


provided. $ 


B. MetHop 

1. Subjects 
The subjects (Ss) were 24 college males and 24 college females. They 
participated in one of four identical testing sessions without monetary com- 


* Received in the Editorial Office on March 27, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 


205 


206 JOURNAL OF PSYCHOLOGY 


pensation, although they did receive credit toward their introductory psychol- 
ogy course. To control for the biasing effects of prior familiarity, Ss were 
instructed to admit acquaintance with any of the stimulus persons; none of 
them did so. } 


2. Stimulus Persons 


Head-and-shoulder, black and white photographs of 48 males and 48 fe- 
males were selected from recent college yearbooks. The photographs of each — 
sex group were assembled into 12 sets of four pictures each. As determined 
from resumes provided in the yearbooks, one male in each set was described 
as inactive in extracurricular events (no activities listed), one as an athlete 
(varsity football player), one as an honor student (membership in at least 
one scholastic honor society), and one as active in extracurricular events 
(participation in at least five activities). Similarly, one photograph in each 
female set pictured a coed who was inactive in extracurricular events, one 
who was a beauty queen (homecoming queen or candidate, fraternity sweet- 
heart, or cheerleader), one who was an honor student, and one who was 
active in extracurricular events. Pictures were selected which unequivocally — 
fit these descriptions, 


3. Procedure 


Male Ss examined the 12 sets of female photographs, and the female Ss 
examined the 12 sets of male photographs. Each §’s task was to match each 
picture in a set with one of the four possible descriptions by checking 
answer to the question, “Which re best fits the stereotype of (a specified 
social category) ?” ¥ 


C. Regsuits anp Discussion 


The significance of the difference between the observed number of Ss_ 
who accurately matched a picture and the expected number was determined 
by the chi-square test. These analyses are presented in Table 1. A chi-square 
result was treated as a measure of perceptual veridicality. It is clear that both 
males and females were able to match a photograph with the correct descrip- 
tion above a chance level. The total chi square for the males was 260.74 
(df = 12; 6 < .001), and the average was 21.73 (df = 1; p < 001). For 
the female Ss the total chi square was 100.63 (df = 12; » < .001), and 
the average was 8.39 (df = 1; p < .01). While both groups of subjects had 
more veridical perceptions than wold be expected by chance, male Ss weft 
more accurate than female Ss, However, our male Ss and female Ss did 2% | 


— a ae = 
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TABLE 1 


Cut-Square ANALYSES OF THE DIFFERENCE BETWEEN OBSERVED AND 
Expectep VERIDICAL Responses (df = 1) 


Male Ss Female Ss 
Set 42 - a - 
1 50 <.50 4.50 <.05 
2 89 <.50 50 <.50 
3 12.50 <.001 14.22 <.001 
4 37.56 <.001 10.89 <.001 
5 2.00 <.20 20.06 <.001 
6 56.89 <.001 06 <.50 
vi 3.65 <.10 1.39 <.30 
8 5.56 <.02 16.06 <.001 
9 20.06 <.001 2.00 <.20 
10 40.50 <.001 9.39 <01 
11 80.02 <.001 3.56 <.10 
12 50 <.50 18,00 <.001 


examine the same photographs, so a direct comparison of these two groups 
is not possible, 

Males were best able to identify those females who were in extracurricular 
events (x? = 112.67; df = 1; p < .001), less able to identify those who 
were noted for their physical attractiveness (x? = 73.50; df = 1;~ < .001), 
even less able to identify those who were inactive in extracurricular events 
(x? = 32.67; df = 1; p < .001), and least able to identify those who were 
honor students (x? = 14.52; df = 1;  < .001). Females were more able 
to identify athletes (x? = 112.67; df = 1; p < .001) than persons who were 
active in extracurricular events (x? = 14.52; df = 1; » < .001), but they 
were not able to identify honor students (x? = .67; df = 1; p < .50) or 
those who were inactive in extracurricular events (x? = .01; df = 1; 
# <.i95)8 

Beyond chance expectations, our male Ss were able to identify photo- 
graphed females correctly as involyed or not in extracurricular activities, as 
beauty queens, or as honor students, Female Ss had difficulty correctly iden- 
tifying photographed males who were not involved in extracurricular ac- 
tivities and those who were honor students. This last result could have been 
due to the fact that eight of the photographed males and only four of the 
photographed females wore glasses. Although there was no relationship in fact 


‘between social category and wearing glasses, three of the females and seven 


of the males were identified by most of the opposite sex Ss as honor students. 
This tendency to perceive persons who wear glasses as intelligent, scholarly, 
etc, was highly significant (x? = 21.78; df = 1; p < .001), thus supporting 
Previous research (11, 12). 
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D. Summary 


From yearbook photographs 24 college males and females attempted to 
identify opposite sex persons who were involved or not in extracurricular 4 
activities, honor students, athletes (judged by females), or beauty quee 
(judged by males). Male Ss had higher veridicality scores than female §5, 
although both groups scored beyond chance expectations, supporting the as- 
sumption that consensual and veridical interpersonal perceptions can be 
formed from physiognomic cues. 
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; CLINICAL PSYCHOLOGY IN THE PSYCHOLOGICAL 
ABSTRACTS, 1927-1970* 
; es Washington University 
¥ H. MeEttzer 
‘ A. IntRopuctTIon 
; “For man, the unexamined life is, indeed, not worth living.” This is the 


way, Bertrand Russell ends his book, The Wisdom of the West (32, p. 313). 
_ According to Dr. Watson (38, p. 321), clinical psychologists have not pro- 
fited from this contribution to wisdom and have been “surprisingly ahistor- 
ical.” He expresses the state of affairs today as being reminiscent of the situa- 
tion found by Kimball Young in 1923 in tracing the history of mental testing, 
wherein he said, “Making history on every hand as we are, we have a notion 
that we somehow have escaped history” (40, p. 1). On the other hand, Dr. 
Garfield’s impression is quite to the contrary; namely, that clinical psycholo- 
gists have been “a particularly introspective, soul searching and a self-critical 
group” (14, p. 353). In a more negative way he describes them as having 
© “the appearance of being chronically dissatisfied.” In an historical paper con- 
sidering some of the same features (13, p. 14) he says: 

Whatever one’s view, it is apparent that clinical psychologists are 
deeply concerned about their professional role in society, the social 
demands placed upon them for professional services, the dynamic state 
of the society in which we live, and the implications of such matters for 
professional training in our field. Such concern, although potentially 
morbid, is not necessarily bad. Although a young science, and an even 

younger profession, clinical psychology has now attained some personal 
“ history and, perhaps, there is need for self-appraisal and the charting of 
pathways for the future. 


Actually, although clinical psychology can profit from a knowledge of the 
“there and then,” in the light of which it can confront the “here and now,” 
it has led a relatively much-examined if not always a well-examined life. 
_, When there was not enough work in the field of clinical psychology worth 
“» mentioning, it was already being reviewed. Brotemarkle (4) dates the beginning 

of the field to the autumn of 1896 when Witmer submitted a paper for the 
_ “American Psychological Association which he called “The clinical method in 
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psychology and the diagnostic method of teaching.” The history of the clinic, 
we are advised, was well known. The first decade was clearly and concisely 
recorded by Witmer in Volume I of the journal, The Psychological Clinic, 
which he founded in 1908; Fernberger brought the record to date in 1931 
Brotemarkle ends his review of the first 50 years with the statement (p, 4) 
that “The rapid development of clinical psychology could not have occurred 
had not its fundamental methods, procedures and techniques, and point of | 
view been so clearly and concisely stated by its founder.” He epitomizes the 
history of clinical psychology in the first 50 years as “1896, ‘a case of bad 
spells’—1946, ‘a spell of bad cases,’” and then advises us that “from the case 
of a ‘chronic bad speller,’ we have progressed to the most prodigious demand’ 
by the Veterans Administration for clinical psychologists to aid with America’s 
worst spell of neuropsychiatric cases.” In spite of Witmer’s predominant role 
in the early pioneering days, it was not Dr. Witmer but Dr. Healy who was 
responsible for influencing the role of psychologists in child guidance and | 
mental hygiene work as described in a number of sources (14, 26, 38). 

The story of the growth of the American Association of Applied Psychology 
can be learned about from two sources, First is the presidential address by 
Donald G. Paterson (30, p. 1) in 1939, wherein his contention was that 
“the birth labor of applied psychology began in April of 1917, and was com- 
pleted on the day of the armistice in 1918.” He considered the completion 0 
the organization of the American Association of Applied Psychology in 1938 
to be the time when it could be considered that applied psychology had reached 
its majority on scheduled time. In that article, he named people who hat 
achieved in the various areas of applied psychology. ‘There are 30 names 
under clinical and a much smaller number in other areas of applied psychol- 
ogy. The second source is The Handbook of Applied Psychology, edited by 
Douglas H. Fryer and Edwin R. Henry (12), which describes how th 
whole field of applied psychology appeared to the founders of the AAAP. The 
role of orthopsychiatry and its influence on the private practice of psychologists 
is described in several articles written by the author (21, 22, 23, 25). Th 
nature of private practice, as being responsible for the lack of empirical 
search by people in practice, and suggestions for increasing their productivity 
are comprehensively considered in a recent study in the Journal of Consulting 
and Clinical Psychology (39). a : 4 

An historical survey of the field is readily available in two books: Clinical 
Psychology: Science and Profession by Dr. Mensh (28) and The Clinicat 
Psychologist by Lubin and Levitt (19). In Dr. Mensh’s book, particuls 
in the section on selected readings, are included contributions by Kelly (17) 


i= 


H. MELTZER 211 


on the postwar decade, by Mechl (20) and Holt (16) on clinical vs. statis- 
tical predictions, by Hathaway (15) on the clinical psychologist, The volume 
ends with an evaluative article by Cook (7) on “The scientist-professional : 
Can psychology carry it off?,” which presents several possible courses of 
action open for carrying it off. This is very unlike what was expressed in the 
1969. APA presidential address by Dr. Albee (2, p. 1071) in which he ad- 
vised us that, “Clinical psychology has entered a paradoxical phase in its 
development where its problems of identity and relevance threaten it with ex- 
tinction at the same time that its opportunities seem boundless.” 

In Lubin and Levitt’s The Clinical Psychologist we have supplementary 
material of an historical nature, including such topics as training and back- 
ground, roles and functions, interprofessional relations and communications, 
and international aspects of clinical psychology. There also, of course, is 
David Shakow’s story in Clinical Psychology as Science and Profession: A 
Forty Year Odyssey (37). Research in clinical psychology has also been 
surveyed, particularly by a series in the Journal of Clinical Psychology by 
William Schofield (33, 34, 35, 36), and later continued in the same journal 
by W. Grant Dahlstrom (8, 9). 

The story of clinical psychology through the years, from its beginning until 
the present time, is one that has been examined, and knowledge about it is 
available in the forementioned reference sources. The purpose of the present 
article is to contribute to the history of clinical psychology in a limited but 
tangible manner by the study of contributions classified as clinical psychology 
in the Psychological Abstracts from the time jof its inception in 1927 up to and 
including 1970. ; 


B. Purpose of THE Present Stupy 


The availability of the Psychological Abstracts as a source makes it possible 
to compare yearly output in a given field from the time of its beginning in 
1927. The articles are abstracted in brief and do not lend themselves to 
evaluative study of the nature, comprehensiveness, or quality of the publica- 
tions; but they do lend themselves to a count study, making possible a com- 
parison of output over the years, By comparing the number of articles reported 
in any given year with any other given year, it is possible to compare produc- 
tivity considered in terms of output per year and to observe increases and 
decreases over the years. By so doing, it is possible to discover in what years 
the gains were made, the extent to which they have been sustained over the 
years, and the general growth trend as far as output per person is concerned. 
To measure the extent of growth considered on a year by year basis as well as 
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to compare this with output per person in proportion to membership in 
Division 12 of the APA, is the purpose of this study. More specifically, what 
is included is an investigation of growth trends considered in terms of (a) 
increase in total productivity as recorded in the Psychological Abstracts on 
a year by year basis, interpreting productivity merely to mean the number of 
contributions; (6) growth in membership; (c) productivity trends considered 
in terms of output per person; (d) comparison of productivity trends, partic: — 
ularly during the past 12 years, of clinical with industrial; and (e) growth 

. 


trends in doctorate degrees granted over the years. 


C. Conrriputions REpoRTED IN THE Psychological Abstracts, 1927-1970 | 


All the articles appearing in the Psychological Abstracts under captions 
signifying clinical psychology were tabulated for the purpose of making the 
comparative growth study over the years. From 1927, when the Psychological 
Abstracts first appeared as a separate monthly, through 1933, the articles 
signifying clinical psychology were classified under two main captions: 
“Special Mental Conditions” and “Nervous Mental Disorders.” In 1934, 
“Personality and Character” appeared as a caption and also included articles 
pertaining to clinical psychology, and this continued through 1936. In 1937, 
and continuing through 1946, the captions used were “Psychoanalysis, 
Dreams, Hypnosis,” “Functional Disorders,” “Personality and Character,” 
and “Crime and Delinquency.” In 1947, the same large categories were used, 
but “Functional Disorders” was subdivided into seven areas, including 
Clinical Neurology, Psychoses, Psychoneuroses, Psychosomatics, Speech Dis- 
orders, Therapy and Rehabilitation, and Clinical Psychology and Mental 
Hygiene. 

In 1948, a major change took place in the organization of articles pertain 
ing to clinical psychology. All articles appeared under two main captions: 
“Clinical Psychology, Guidance and Counseling” and “Behavior Deviations,” 
each of which had several subdivisions. These two categories continued until 
1966, when for the first time the main caption of “Clinical Psychology” # 
such appeared. From 1966 through 1970 the selection of articles was gte# y 
simplified for the reason that they are all classified under the same caption 
Clinical Psychology, representing one of the major areas within the field © 
psychology as a whole, In 1966 there are 16 subcaptions, and in 1970 there a1 
19 subcaptions. Greater consistency in classification of articles signifying clinical 
psychology from 1966 onward is assured by the present system. i 

The number of articles abstracted annually from 1927 through 1970 3s 
indicated in Table 1. What does the study of Table 1 show about the growt! 
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extent of productivity over the years, gains sustained, and the general trends 
considered in the light of increase in the present number of contributors who 
are identified with the field of clinical psychology? The number of contribu- 


TABLE 1 
CLINICAL PsycHOLOGY ARTICLES IN THE Psychological Abstracts, 1927-1970 

Year Number Year Number Year Number 
1927 636 1942 1269 1957 3026 
1928 884 1943 1116 1958 1928 
1929 1013 1944 1014 1959 3640 
1930 1198 1945 902 1960 2730 
1931 1011 1946 1144 1961 1695 
1932 1207 1947 1427 1962 1934 
1933 1424 1948 1838 1963 1842 
1934 1204 1949 2095 1964 1892 
1935 1296 1950 2089 1965 2686 
1936 1194 1951 2567 1966 3427 
1937 1492 1952 2524 1967 4202 
1938 1607 1953 2723 1968 5032 
1939 1568 1954 2917 1969 4381 
1940 1604 1955 3116 1970 6255 
1941 1414 1956 2645 


OM EE 


tions as reported in Table 1 indicates marked increases over the years from 
636 in 1927 to 6255 in 1970, From 1927 until 1929 the increase was from 
636 to 1013. Thereafter, with but one exception (902 in 1945) there were 
always more than 1000 contributions abstracted each year. The average for 
the Twenties (1927-29) was 884. The average for the Thirties was 1250, rang- 
ing from 1011 in 1931 to 1607 in 1938. The average for the Forties was only 
1201, ranging from a low of 902 in 1945 to a high of 2095 in 1949, There 
was a definite lull in productivity during the early 1940s; the most obvious 
reason seems to be that it was influenced by World War II with the greatest 
effect being in 1945. It is also apparent that some material accumulated during 
the war years was published during the first few years following the war; 
1948 and 1949 showed that kind of increase. The Forties definitely had 
peculiarities related to the war; psychologists in the service apparently were 
not in position to publish as usual. 

In the 1950s the trend seemed to be again normal as far as contributions 
reported in the field. The average for the Fifties was 2629, ranging from 
1928 in 1958 to 3640 in 1959. The low was 1928 in 1958, a year that had the 
special capacity of being low in contributions in clinical as well as in industrial 
psychology, and probably in other areas as well. The reason for 1958 being an 
extremely lean year will be considered when both clinical and industrial are 
compared later in this study. That it was an unproductive year, preceded and 
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followed by relatively more productive years, is apparent. There were 3026 
contributions in the preceding year of 1957, and 3640 in the successive year 
of 1959. 

In 1960 the number of contributions again went down to 2730 and receded 
further for the next four years. The membership (as indicated in Table 3) 
did not decrease during this period, but the productivity per person was 
reduced considerably. This may be related to two factors in the national 
scene: a lower national economy and the beginnings of greater involvement 
in Vietnam. In 1966 the number of contributions rose to 3427, and it stayed 
high, not going below 4000, thereafter. By far the most productive year in 
the past several years was 1970. 

A glance at the data in Table 1, then, shows that there were peak years 
and lean years and that there is no sustaining influence that contributes to 
continuing productivity. There are peak and lean years in every decades in 
the early 1930s this was probably due to the lingering effects of the Depres- 
sion; in the 1940s it apparently was the effect of World War II. In the 
1960s there was a substantial drop from 1960 (2730 publications) which 
continued for four years; in 1965 the number approximated the 1960 position 
but was still lower than 1959. After 1965 the number climbed again until it 
reached its pinnacle in 1970. Some of the contributing factors accounting for 
these fluctuations apparently included the national economy, war circumstances, 


and also the effect of the management of the Psychological Abstracts in its 


editorial staff. This is further explored to evaluate the role of changing editor 
ship as a contributing factor. 


D. Propueriviry in Cuinicat Psycuorocy, 1948-1970 


So far we have reported the output of contributions on a yearly basis from 


1927 to 1970, considering some of the reasons for the fluctuations betweel 


peak and lean years that appear. One way of obtaining a perhaps more mean 
ingful index of productivity is to compare the extent to which increased met 
bership made a difference in terms of total productivity and in terms of produe 
tivity per person. If membership in Division 12 (Clinical) of the Americam 
Psychological Association is considered as an index of potential contributors’ 
we can compare the gains made in membership to the amount of productivity: 
The writer is aware of the fact that, strictly speaking, only the contributions 
of members in Division 12 would yield an index that is unreservedly a vali 

measure for determining productivity per person by members of Division 12, 


1 Data concerning membership in the Clinical Divi ii to organize 
in 1946, are available from 1948 onward ae the NPA en Foe ais information 
we are gratefully indebted to Mrs. Jane D. Hildreth. 4 
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However, the difficulty of the task makes checking Division membership 
against the authors of articles inadvisable. Because all contributors were not 
necessarily members of Division 12, the indices indicated in Table 2 are 
higher than they would be if nonmembers were excluded. Since the general 
trend reported in this study shows that with increased membership there was 
not the proportional increase in productivity, but that productivity per person 
was lowered, exclusion of nonmembers would show a more marked decrease 
in productivity than is reported in this study. The number of articles per 
year, parallelled with the increase in membership from 1948-1970, is indicated 
in Table 2, which also shows productivity per person. 

In 1948 there were 787 members, and the increase in membership from 
then on was continuous to 1970 with 3640 members. In 1955 there were 1711 
members composed of 1123 members and 588 fellows. In 1964 the 2883 
members included 2119 members and 764 fellows. In 1970, the 3640 mem- 
bers included 742 fellows. If productivity per person were constant, there 
would have been a continuous increase in productivity, and that certainly is 
not evident. Increased membership quite obviously did not consistently result 
in more contributions. The availability of more contributors made possible 
more total production in many years, but not so considered in terms of produc- 
tivity per person. Actually, the year with the greatest productivity per person 
was 1948, the year for which these data were first recorded, with 2.34. This 
was the peak year not only for clinical, but also for industrial psychology (24, 
27), where in 1948 the output per member was 2.45, but in 1958 it was only 
.60. As a matter of fact in industrial (as seen in Table 3) it did not reach 
even 1.0 after that. The growth of membership in the Industrial Division 
had ups and downs and slow years, but in clinical the increase has been con- 
tinuous. The level of productivity, however, particularly when considered 
per person, is definitely not determined by the size of membership alone. 

In clinical psychology from 1949 onward, the ratio of membership to output 
never went beyond the 1.99 of that year. In 1958, the productivity per person 
dropped down to .95, which was somewhat higher than the figure for indus- 
trial which was only .60. The years preceding and following for clinical were 
almost identical: 1.59 in 1957 and 1.60 in 1959. The 1960s were markedly 
lower until 1970. The years from 1966 through 1970, when the editorial 
policy was consistent, did not result in constancy in productivity. 

During the 1960s there were three different editors for the Psychological 
Abstracts, each of whom may have had a different manner of classifying 
articles. That there was some difficulty in the editing of the publication is 
indicated in correspondence with Dr. Horace B. English who was the editor 
from April 1959 to December 1960. [For details concerning editors and the 
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editorial policy of Psychological Abstracts from 1927 through 1970, sce 
Meltzer (27, p. 129-130).] 

Since we have available data concerning productivity considered in terms 
of total output per year as well as productivity per person for industrial, some 
light might be thrown on the reasons for the various fluctuations in various 
years by comparison of the productivity of Division 12 and Division 14 
(Industrial), particularly for the years from 1959 through 1970. These facts 
are presented in Table 3, 

The general conclusion in analyzing the productivity per person in indus- 
trial psychology is that the trend was for more producers to produce less; in 
1959 the ratio was 1.36, and in 1970 it was .81, with all the years in between 
below 1.0. In clinical the trend was not that steady, and there was a marked 
difference, particularly an increase during the last few years. It began with 
1.60 in 1959, then went down to 1.15 in 1960, and for the next five years was 
below 1.0. In 1966 it rose to 1.12 and thereafter increased to 1.23; 1.48 in 
1968, the highest for the decade; and 1.24 in 1969. It rose to 1.72 in 1970. 

From 1961 until 1966 the trends were relatively similar in the two divi- 
sions, and the underlying causes are probably similar, The upward trend in 
clinical, which did not manifest itself in industrial, can inferentially be due 
to the fact that an increasing number of people industrial are connected with 
large consulting organizations, a number of consulting psychologists have 
moved into business schools, and there is documented evidence that people 
who work in industry definitely produce less than those connected with uni- 
versities. An article on “Productivity rate in research scientists” (11) reported 
the number of publications per year in the years 1959-1964 for a group of 
40 research scientists. The number was 4.81 for those who worked in uni- 
versities, 1.05 for those who worked in industry, and 4.5 for those who 
worked in government, That this was significantly larger than the produc- 
tivity for the psychology group is understandable, However, there are distinct 
differences between trends in industrial and clinical; whereas, in industrial, 
productivity was consistently less, in clinical there was the same general trend 
for more contributors to produce less, but it was not as consistent or gen- 
eralized as industrial. One possible difference between the two psychology 
groups is that more people from clinical are affiliated with universities, and 
this usually means more publications and more emphasis on research, partic- 
ularly in those universities that have a “publish or perish” policy. 


2 In thi dy, Il as other studies on skew of distribution in productivity, 
only the par 4g Sindee in terms of numbers of contributions made is studied, a 


out regard to size, quality, or comprehensiveness. 


JOURNAL OF PSYCHOLOGY 


218 


+05 ob +22 18f $le Li 852 10 14z £02 L91 802 


‘[eyuawradxy 
SL 65 lz 65 & sf 0s + sf If Is & [epasnpuy 
64S ses sIf 6+ 69E oss Zé 92E 9TE 96% 892 $92 peor 
payed 
so01d9p 
ay810;90T 
1° 09" os 6y" on 8x 99° 18° 6o" 06" +h" 9ET [eLysnpuy 
et +1 srt Laas rams wo 99° Ly $l 6o svt oot Teoray 
- 3 4 uosiad sad 
2 AAD N pos 
106 +89 £42 LOL 089 _ 689 +95 +89 «GES 089 StS 186 [elusnpuy | 
$Sz9 18t+ ze0s Zotr Lye 9892 2681 erst +861 S69T 022 OF9E Loi te) 
f [B01 
Ayaponpory 
6011 +2401 956 6I0T 896 166 158 658. SLL 9S +L €IL ‘yernsnpuy 
OF9E est S6£E P42 SHOE +062 £882 OLE LL87 994% = LES 6922 Teor 
dyyssaquioyy 
0461 6961 8961 L961 9961 S961 +961 £96T 7961 1961 0961 6S6T aevax 
0261-6561 ‘SOOIOHOASH IVMAISQGN] GNV ‘IVOINITD NI ALIALLOAGONd 
£ “aTavl Z 


H. MELTZER 219 


In interpreting data of this kind the fact must be remembered that the 
skew reported in all studies of research holds true for clinical, as indicated by 
Levy (18), in the distribution of contributions; 28.9 percent of Rabin’s 
group had no publications to their credit, 19.2 percent had only one publica- 
_ tion, and 10 percent of the group accounted for 45 percent of the output. 
Barclay (3) presented some evidence that indicates that the skew characterizes 
psychologists of every division, not just clinical psychology. It is not surprising 
to find that kind of skew also characterizing clinical and industrial; but in 
industrial the general trend seems fairly consistent in that more contributors 
produce less when productivity per person is considered. In clinical if all 
years are considered, a similar trend is found; however, there are inconsis- 
tencies particularly from 1966 on. There seems to be more consistency on a 
higher level than in previous years. The highest year for clinical and indus- 
trial, as reported in Table 3, was 1959. The relative gains in clinical, as com- 
pared to industrial, from 1966 onwards may be due to a number of factors, 
such as the system of classifying articles in the Psychological Abstracts which 
has been more consistent since 1966, and the ratio of members connected with 
universities. All evidence indicates that university contact definitely makes 
for more research and publication, whereas an increase in private practice 
definitely lowers per unit productivity (39). Identification with industrial 
organization has the most marked effect in reducing research and publication, 
Some evidence of change in authorship, affiliation, and research support is 
reported in a recent article in the Journal of Applied Psychology (5). 

Data are available concerning doctorate degrees granted in three subfields 
of psychology from 1940 to 1970, which may throw some light on potential 
contributors moving into each field and thus being in a position to contribute 
and increase productivity in the field. Both clinical and industrial had slow 
starts, but from 1953 onward clinical spurted as compared to industrial. In 
industrial there were no doctorates recorded for 1953, but there were 23 for 
clinical in that year; in 1954 there were 125 in clinical and only 9 in indus- 
trial. There were 11 industrial doctorates in 1955, 59 in 1967, 27 in 1968, 
and 75 in 1970 (see Table 3). 

In clinical there were 125 Ph.D.s awarded in 1954, 194 in 1955, but only 
121 in 1956, From 1957 through 1967, the number rose steadily from 236 to 
439; it went down to 313 in 1968 and up to 549 in 1970. In terms of possible 
contributors determining productivity index per person, it is understandable 
that clinical would have advantages over industrial. Interestingly enough, 
clinical compares very favorably with the number of doctorates awarded in 
experimental, physiological, and comparative psychology combined. In 1946, 
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when clinical had no doctorates granted, experimental, physiological, and 
parative already had 56. This number climbed to 139 in 1948 and then | 
281 in 1953; thereafter it had its ups and downs with a general trend. towar 
increasing doctorates awarded, with a high of 504 in 1970. Clinical is 
passing experimental, physiological, and comparative in sheer numbers 
doctorates awarded, but not in productivity per person necessarily, Mor 
comparable data for the experimental, physiological, and comparative group 
are not available to the writer at the present time, but the odds are in favor 
of more productivity in that group, since more of them are connected with 
universities or in settings where they primarily dd research. 

Thus, clinical, more than industrial, is producing so many Ph.D,s that it 
should be in a better position to be productive in total output per year, as 
reported in the Psychological Abstracts, and even more so in productivity per 
person each year. Both divisions would gain if more of the people doing applied 
work full time do more publishing, and the academic people publish relatively 
less. There is a certain trend in that direction at the present time as far as 
the nature of subject matter being covered, The present emphasis of ind | 
in organizational psychology makes it less susceptible to consideration }j 
academicians because it calls for study of actual organizational and manageti 
situations not available to academicians. 


E. Summary anp Discussion 


The purpose of this study was to examine the productivity of clinical 
psychology considered in terms of contributions reported in the Psychological 
Abstracts from its inception in 1927 through 1970. By use of this kind of 
analysis it has been possible to compare productivity to membership in Division | 
12 over the years. This study concerns itself with an analysis of Division 12 
considered from the point of view of (a) total productivity per year as 
corded in contributions to the Psychological Abstracts; (b) increase in mem 
bership; (c) productivity per year considered per person from 1948 throt 
1970, 1948 being the first year for which information of this kind was a 
able; (d) comparison of productivity per person per year with ind 
(Division 14) over a 12-year period; and (e) trend in doctorates degress 
granted in clinical, industrial, and experimental, physiological, and comparative 
psychology. i 

Clinical psychology was described by Dr, Watson (38) as being ahistorical 
Included in this study is a report of some of the many examinations, desctit 
tions, and evaluations that have been made over the years, and they indicaté 
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that by and large the field has been much-examined if not always well- 
examined. The findings of the present study may be summarized as follows: 

(a) There has been a general trend toward an increased number of articles 
abstracted in the Psychological Abstracts over the years. In the beginning in 
1927, there were 636 articles, which increased to 1198 in 1930. In 1940 
there were 1604 articles, 2089 in 1950, and 6255 in 1970. 

(b) There have been peak years and lean years. Quite often peak years 
were preceded or followed by lean years. The leanest year in the 1940s was 
1945, and it was preceded and followed by somewhat larger numbers which 
were also relatively small as compared to earlier and later years. The out- 
standing lean year in the 1950s was 1958, which was preceded and followed by 
markedly higher output. The leanest year of the 1960s was 1961, but 1961, 
1962, 1963, and 1964 were all relatively lean years as compared to 1960. From 
1965 through 1970 the number increased, but was significantly lower in 1969 
than in 1968, and there was a sharp increase in 1970. 

(c) The productivity trend considered in terms of output per person was 
highest in the very first year in which membership in Division 12 was re- 
ported, 1948,@WH@h the ratio was 2.34, a ratio never reached again. In the 
1950s there was one extraordinarily low year—1958—in which the ratio of 
productivity was only .95. All other years in the 1950s had productivity rates 
ranging from 1.49 in 1956 to 1.99 in 1951. In the 1960s, the four years 
beginning with 1961 had extremely low rates, well below 1.0, but in 1960 it 
was 1.15, It went up again ‘in 1965 to .92 and thereafter was always above 
1.0, the highest being 1.72 in 1970, which was much higher than the 1.48 of 
1968 and the'l.24 of 1969. 

(da) The number of doctorates granted by the various universities which 
offered Ph.D.s increased over the years. The doctorates in the clinical area 
began to increase significantly in 1954, when there were 125, whereas in 
1953 there were only 23. In 1952 there were seven, preceded by five and three 


‘in the two preceding years, with none in 1948 and 1949. The number of 


doctorates in clinical psychology from 1957 onward was above 200 per year, 
and from then on clinical surpassed experimental, physiological, and com- 
parative-in doctorates awarded, while during this period industrial definitely 
did not show that kind of increase. Several factors may have contributed to 
the lowering of doctorates in industrial as compared to clinical, A number 
of business schools have taken over organizational psychology, and students 
formerly registered as taking industrial psychology in psychology departments 
have probably moved into organizational psychology in schools of business. 
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Clinical has a tempo all its own that is significantly higher than industrial as 
far as production of doctorates in the psychology department is concerned, In 
the larger schools the number of people in clinical is larger than in expert 
mental, physiological, and comparative. a) 
According to Wolff (39, p. 281), “private clinical-consulting settings hay 
produced remarkably influential observations and theory but minimal empirical 
research. Output over the past 25 years is limited to 15 empirical studies.” 
[For a fuller discussion of skew of distribution in productivity in vario 
fields of psychology and in different settings as considered by Albee (2), Clarl 
(6), and Eidusun (11), see Meltzer (27, pp. 135-136).] 1 
The findings of this study certainly do not indicate that the field is dying 
Membership in Division 12 is on the increase, numbers of doctorates awarded 
annually are on the increase, and both are keeping up a relatively high leve 
Productivity per person is lower than one would like it to be, but it is stil 
somewhat productive even though productivity is uneven and skewed. 
It would be well to supplement the facts about productivity with a study 0 
the nature of the content of the material reported, in terms of both research 
and applied articles. Information about research in clinical psychology is avail 
able in studies reported by Schofield (33, 34, 35, 36) and follow-up studit 
by Dahlsrom (8, 9). A study of the nature of the publications in terms ¢ 
topics considered and studies made may not give what Dr. Rotter (31) give 
in his book on Clinical Psychology as the answer to the question, “What d 
clinical psychologists do?” but it may give some idea of what clinical psychol 
gists publish or what is published in the name of clinical psychology. 
It might be well to end this article with a warning given by Mart 
Deutsch (10, p. 18): “Not only will it be necessary to develop the requisi 
knowledge and understanding, it will also be necessary to feed the new kno 
ledge past the organizational barriers and into the structures of society 
institutions.” 
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A NOTE ON THE SEMANTIC STRUCTURES OF THE 
SCHOOL-RELATED ATTITUDES IN 
EXCEPTIONAL CHILDREN*1? 


University of Arkansas and Pennsylvania State University 


EuizasetH C. THomas AND Kaoru YAMAMoTo 


A. InrRopucTion 


As a means of studying attitudes the semantic differential has been used 
frequently during the last decade (8, 12, 14). More recently the applicability 
of the semantic differential across cultures (13) and across age range (2) 
has come under systematic study. Di Vesta and his associates (3, 4, 5) pro- 
duced a series of investigations in which children from grades two to seven 
rated heterogeneous concepts. These studies indicate that (a) the children’s 
semantic structures are very much the same as those of adolescents and adults, 
the evaluation, potency, and activity factors being salient in both; (b) the 
semantic differential technique is acceptably stable (one- to two-day retest rs 
clustering around .60) with children as young as third graders; and (c) the 
reliability of concepts scores may approach the level obtained for adults (rs 
about .90) if at least 15 to 20 subjects are used in the rating process. A 
number of studies conducted in various settings have led most researchers to 
conclude that the semantic differential is a flexible and economic technique 
which possesses acceptable construct validity for psychoeducational use (1, 
OAT'S9% 

Husek and Wittrock (7) used a semantic differential composed of 80 
adjective pairs for rating the concept, teacher, with 259 college students in 
elementary and secondary education. Rather than finding separate factors of 
evaluation, potency, and activity, they found a large general evaluative factor 
which accounted for 30 percent of the common variance. It was accompanied 
by smaller factors of restraint, tenacity, predictability, and stability, each of 
which accounted for four to six percent of the common variance. More recently 
a version of the semantic differential was used by Neale and Proshek (11) 
with 350 culturally deprived elementary school students. A large evaluative 
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factor emerged which accounted for over one-third of the total varia 
secondary factors, activity and potency, were found to account for | 
10 percent of the total variance. Yamamoto, Thomas, and Wiersma 
using two categories of school-related concepts (people and curric 
explored the semantic space of 400 seventh- and ninth-grade students. 
factors were identified related to people: Merit, Movement, and § 
‘These factors accounted for 20, 19, and 21 percent, respectively, of the 
common variance. The curriculum concepts also produced three fat 
Vigor, Certainty, and Nameless, which accounted for 26, 26, and 13 
respectively, of the common variance, 

When one turns from normal children to exceptional children, 
there is little information on the semantic structure of their school-r 
attitudes. One exception is a study of Rybolt (16) who administeres 
modified semantic differential to 79 educable retarded adolescents. A fat 
analysis yielded results similar to those for nonretarded populations, yieldi 
an evaluative factor accounting for 35 percent, a potency factor ac 
for 23 percent, and an activity factor accounting for 26 percent, respect 
of the common variance. i 

The present study investigated by factor analysis the semantic space ©: i 
groups of exceptional middle-school and high-school age children: namely, 
retarded, disturbed, blind, and deaf. 


B. Mernop 
1. Subjects { A 
a The retarded. All 175 educable retarded students (IQ 60 to! 
ranging in age from 10 to 17 years from a southern urban school sy 
participated in the original testing, Of this pool, 35 boys and 35 gitl 
randomly selected from the 10 through 13 year age group, and li 
boys and 35 girls were randomly selected from the 14 through W7y 
group. Thus, a total of 140 educable retardates were involved. : 
b. The disturbed. Two hundred seriously disturbed children from 4 
17 years who were patients in five psychiatric centers for children in a n0 rt 
state participated in the testing, Seriously brain-damaged and multihi 
capped children were excluded from the sample. All five institutions ™ 
psychoeducational approach (10, p. 29) in their schools. From this P 
boys and 35 girls from the lower age group (10-13 years) and 35 
35 girls from the older age group (1417 years) were randomly 
Thus the total sample was 140. 
c. The blind. The blind subjects were 200 students from a southe® 
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state institution for the blind. Fifty boys and 50 girls each in two age groups 
(ages 10-13 and 14-17) participated in the investigation. 

d. The deaf. Two hundred deaf children from educational institutions in 
two southwestern states took part in testing. From this pool 35 boys and 
35 girls were randomly selected from the 10 to 13 year age group; likewise 
35 boys and 35 girls were randomly selected from the 14 to 17 year age 
group. Thus the total deaf sample was 140 students. 


2. Instrument 


The semantic differential was constructed to include nine seven-point, bi- 
polar scales for four people concepts (classmates, parent, teacher, and myself) 
and another, but overlapping, set of eight seven-point, bipolar scales for cur- 
riculum concepts (social studies, language, science, and mathematics). These 
17 scales had been identified in a previous study (18) with normal middle- 
school age children to represent three people factors (Merit, Movement, and 
Security) and two curriculum factors (Vigor and Certainty). 

In constructing the original instrument, the criterion of word and concept 
difficulty was established at the second-grade level by reference to a well- 
known listing (17). This was done for the specific purpose of using the in- 
strument with handicapped as well as normal children. The semantic dif- 
ferential was prepared in two separate forms. On each, people and curricular 
categories were alternated, while the sequence of concepts within each cate- 
gory was randomly varied. Individual scales were also randomized within 
each form, while the polarity of scales was randomly balanced in direction 
half and half on each form, and it was reversed between the two forms. 
Further, these forms were randomly distributed to subjects upon administra- 
tion, 

In addition, two forms of a practice instrument, using two concepts (doctor 
and spelling), were administered on two consecutive days prior to the main 
data collecting instrument. This provided an opportunity for the handicapped 
subjects to acquaint themselves with the task. 

In the braille form, the concept to be rated appeared at the top of the page, 
and the subject read down vertically the ratings written: for example, very, 
very good; very good; a little good; neither good nor bad; a little bad; very 
bad; and very, very bad. This enabled the blind subject to keep his place 
easily and maintain the concept in mind as he read through the choices, This 
vertical format was also used for practice braille forms in order to acquaint 
the subjects with the instrument and the procedures. The student was in- 
structed to mark his rating at the end of the line of his choice. 
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3. Procedures { 


The ratings were secured in small group settings by the classroom teachers” 
between May and September of 1969. The students were asked to reveal their 
feelings about different school subjects and people by responding to these scales 
anonymously. 

For the purpose of the present study, a mean score for each subject was” 
obtained on each scale across the four concepts in each category (people and 
curriculum). For each category and in each group of handicapped children, 


correlated, and the resultant matrix was analyzed by a principal component 
method, with unities in the diagonal. Those factors with an eigenvalue greater 
than one were then rotated by means of the varimax method. 
C. Finpines anp Discussion 
1. People Ratings 


The results of factor analyses of people ratings are summarized in Table 1. 
Two factors each were identified among the retarded and the disturbed, four” 


| 


separately, the nine (people) or eight (curricular) scores for subjects were 


; 


in the blind, and three in the deaf. In spite of this, further inspection revealed — 
considerable similarities in factor structure among the four exceptional groups } 
and the normal group here added for comparison from an carlier study (18). 
Thus, within each group, the rotated factors accounted more or less evenly 
for a total of 50 to 65 percent of the variance. The deaf group's semantic | 
space was essentially identical with that of the normal group, and the blind” 
also exhibited much similarity with the normal. The coefficients of congruentt | 
(6, p. 257) between the first factor in the normal (Merit) and the first 
factor of the deaf, the second factor of the retarded, and the second factor aff 
the blind were, respectively, .9471, .9284, and .8767. The second factor in the 
normal (Movement) yielded congruence coefficients of .9244 with the secot 
factor of the disturbed, .9235 with the second factor of the deaf, and 6648 
with the third factor of the blind. Finally, the third factor in the normal 
(Security) gave coefficients of .8586 with the fourth factor of the blind ani] 
.7500 with the third factor of the deaf. The first factor in the retarded 4" 
peared to be a cross between Security and Movement, while the first factor 
in the disturbed seemed one between Security and Merit. The first factor 
the blind suggested a splinter of the Movement factor. 
All told, therefore, the semantic structures of the people ratings in these 
groups of exceptional children are similar to those revealed by normal childset 
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2. Curriculum Ratings 


Factor analyses of curriculum ratings yielded results summarized in Table 
2. Three factors each were extracted in the retarded and the deaf groups, 
while two factors each were found in the disturbed and the blind groups. 

Correspondence among the groups was less convincing here than in the 
people ratings. However, the first factor in every group, except the retarded, 
accounted for more than 30 percent of the total variance and could be more or 
less identified with the first factor (Vigor) in the normal children (18), The 
coefficients of congruence with Vigor were .9027 (the first factor of the dis- 
turbed), .7636 (the first factor of the deaf), and .4413 (the first factor of 
the blind). The second factor in two exceptional groups, disturbed and deaf, 
showed a reasonable amount of correspondence with the second factor of the 
normal (Certainty), the coefficients being .7502 and .6858, respectively, The 
third factor of the deaf and the third factor of the retarded seemed like a 
splinter of Certainty, while the first and second factors in the retarded ap- 
peared to represent two different facets of Vigor itself, 

The normal-exceptional correspondence in semantic structure relative to 
curriculum concepts must be postulated with caution, The agreement between 
the normals and the retarded reported by Rybolt (16) could very well be the 
result of the particular concepts rated in his study: namely, Fire, Sword, 
America, Me, and Mother. Further investigations with a larger number and 
variety of concepts are indicated, particularly with retarded children, to 
clarify some of the ambiguities observed, 

It may nevertheless be suggested that, in general, the semantic space of 
the school-related attitudes in various exceptional groups of children is not 
structured in a manner radically different from that in the normal middle- 
school age children, In most cases, semantic structure did not seem to vary as 
a direct function of the form of exceptionality. 


D. Summary 


The semantic structures of school-related attitudes in exceptional middle- 
school and high-school age children—namely, the retarded, disturbed, blind, 
and deaf—were investigated by factor analytical techniques. The semantic 
differential was constructed to include nine seven-point, bipolar scales for four 
people concepts (classmates, parent, teacher, myself) and another, but over- 
lapping, set of eight seven-point bipolar scales for curriculum concepts (social 
studies, language, science, mathematics). Considerable similarities in semantic 
structure on people concepts among the four exceptional groups were found 
when compared with a normal group of subjects from an earlier study. The 
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normal-exceptional correspondence in semantic structure relative to curriculum 
concepts was less convincing than on the people concepts. However, it ap- 
peared that the semantic space of the school-related attitudes of the various 
exceptional groups of children was not structured in a manner radically dif- 
ferent from that of normal children of the same age. 
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CONTRAST EFFECTS IN ANIMAL AND HUMAN LEARNING: 
RECENT RESULTS AND INTERPRETATIONS* 


University of Melbourne, Australia 


LawrENCE WEINSTEIN! 


A, INTRODUCTION 


Negative incentive contrast effects occur when a reduction in incentive 
magnitude produces a reliably lower level of behavior than the level of a con- 
trol group maintained on the smaller amount of reward throughout training; 
positive incentive contrast effects are defined by a significantly higher level 
of performance, consequent to an increment in reward quantity, than a control 
group level. While the recent literature is marked with frequent demonstra- 
tions of upward and downward incentive contrast with appropriate postshift 
control groups employed (8, 13, 28, 29, 34), it is not entirely clear which 
variables control (contribute to) the occurrence or magnitude of the phenom- 
ena. Some factors, presented in Table 1, have been examined. It would appear 
that occurrences and the size of contrast effects depend on specific independent 
variables some of which have been determined. 


B. THeoRETICAL INTERPRETATIONS OF CONTRAST 


An extension of the Amsel-Spence frustration hypothesis has been used to 
explain negative contrast (1, 23). According to this hypothesis, a classically 
conditioned expectancy of reward evokes frustration, fear, or increased 
emotionality in general, when the reward is less than the expected amount 
(4). Frustration elicits a variety of internal stimuli which elicit competing 
responses that produce decrements in the instrumental response below the 
low reward level. Evidence for increased vigor of responding following a 
reduction in incentive size comes from Amsel’s (1) double runway studies. 
Amsel found that animals who experienced a reduction in incentive magnitude 
in the first goal box of the first runway ran faster in the second runway alley 
relative to Ss who continuously experienced the reduced amount in the first 
goal box. A variety of evidence exists that is consistent with such an emotional- 
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TABLE 1 
VARIABLES STUDIED, RESULTS, AND INVESTIGATOR 
Independent variable Results and investigator 
Age Older rats and humans showed contrast effects; 
younger organisms did not (e.g., 20, 34). 
Amount of preshift training Larger contrast accompanied increased preshift 
training in rats and humans (e.g., 24, 25, 34). 
Magnitude of reward change Larger changes in magnitude of reward produced 


larger contrast effects in animals and humans 
(eg. 11, 15, 28). 
Drive level High drive (85% body weight) animals displayed 
contrast, while low drive rats (98% body weight) 
did not (e.g., 13). 
Shift and control preshift body In the absence of a difference in body weight, con- 
weights trast effects were obtained; so, the effects are not 
due to body weight differences (e.g., 30). 


Pre-postshift interval (time be- Contrast not found in 68-day pre-post rats, but 
tween last preshift and first found in one-day rats (e.g., 14). 
postshift session) 


motivational account of incentive contrast effects and will now be considered. 

Pharmacological agents have been widely used in the laboratory and the 
clinic to control fear and, more generally, to combat the heightened emotion- 
ality associated with noxious events (e.g., 2). A number of investigations 
(e.g., 26) have pointed out the aversive emotional similarities between fear, 
anxiety, and frustration, all of which appear to share certain properties. It has 
been demonstrated that injections of chlordiazepoxide-Librium (2) and. of 
amobarbital sodium (19) are effective in controlling shock induced emotion- 
ality in the laboratory. These drugs are classified clinically as antianxiety 
agents. It follows therefore that if negative contrast effects are due to a state 
of frustration or heightened emotionality, these drugs should attenuate or 
eliminate downward contrast effects. Rosen, Glass, and Ison (22) reported 
that negative contrast found in control animals was not found in animals 
injected with amobarbitol sodium. Rosen and Tessel (21) demonstrated 
negative contrast effects with saline control animals, but reported that down- 
ward contrast did not occur in Librium injected animals, The hippocampus 
has been suggested as a physiological mediator of frustration (aversive motiva- 
tional responses in general). Evidence for this position comes from studies that 
have shown that hippocampal lesioned animals showed deficits in frustration 
or fear mediated tasks, such. as: passive avoidance .(17) and conditioned 
emotional response (5). On the assumption that there is hippocampal media- 
tion, if negative contrast is caused by frustration, hippocampal lesioned animals 
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should exhibit an attenuated: downward effect. Brown and Murphy (6) have 
reported that hippocampal lesioned rats showed no contrast effect, while con- 
-trol Ss exhibited downward contrast. 

The aversive (frustrative) emotional account of negative contrast leads one 
to believe that contrast effects will be diminished in size or will not occur 
following training under partial reinforcement. Subjects trained under partial 
reinforcement presumably have frustration cues conditioned to the ongoing 
instrumental response when responses are not rewarded. Subsequent to a 
reduction in reward size, the frustration evoked by the lower reward should 
now produce familiar frustration cues; that is to say, Ss are now less sensitive 
to the disrupting effects of the emotional cues than Ss who are trained under 
continuous reinforcement. Evidence for the position of decreased sensitivity 
comes from studies that have demonstrated that experience with a noxious 
event renders behayior less sensitive to this or similar aversive situations (¢.g., 
27). A reduction in reward should produce few, if any, competing responses 
in partial reward Ss. Thus, little or no depression in performance after a 
reward reduction should be found in partial reward Ss as compared to con- 
tinuous reinforcement Ss. Mikulka, Lehr, and Pavlik (18) reported that 
negative contrast effects found with continuous reinforcement were not 
obtained with a partial schedule of reward in rats. Weinstein (35) found that 
humans rewarded for every correct response (continuous reinforcement) to 
mental multiplication problems displayed a depression effect, while people 
reinforced on a partial schedule of reward (fixed interval 10 seconds) did not 
exhibit contrast subsequent to a reduction in amount of reward, While most 
studies have failed to obtain negative contrast under partial reward, some 
investigations have found the depression effect with partial reward (e.g., 30). 

According to an emotionality interpretation of downward contrast, Ss who 
experience a decrease in amount of reward have emotional cues conditioned 
to the ongoing response. If this is the case, a second reduction in incentive size 
should produce familiar frustration cues, fewer competing responses, and 
smaller, if any, downward contrast, as compared to Ss who experience this 
second reduction in reward quantity subsequent to no change in amount of 
reward. Capaldi and Lynch (7) have demonstrated that negative contrast, ob- 
tained in rats with a decrease in incentive size subsequent to no reduction in 
reward quantity, was not found in animals who experienced the decrease in 
incentive magnitude after a reduction in amount of reward. Weinstein (35) 
using humans reported that a prior reduction in incentive magnitude eliminated 


downward contrast. 
If downward contrast effects are caused by emotional factors, one might 
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expect a neutral stimulus that is paired with a reduction in incentive magni- 
tude and consequent negative contrast (a frustrative or aversive motivational 
state) subsequently to produce more active avoidance in the typical two way 
shuttlebox situation than a stimulus that is initially associated with the smaller 
incentive amount. That is to say, if a once neutral cue comes to serve as an 
aversive (unconditioned) stimulus as a result of being associated with a 
reduction in incentive magnitude, these data would be compatible with the 
notion that a reduction in incentive quantity in infrahuman organisms results 
in an aversive motivational state which accounts for negative contrast effects. 
The experiment presented below examined the effects on active avoidance be- 
havior of an auditory stimulus paired with a reduction in incentive quantity, 
as compared to the stimulus when it was associated with the lower reward 
magnitude of the control group. 


C. Merxop 
1. Subjects 


The Ss, 100 male, experimentally naive, albino rats, 90-110 days old at the 
start of the experiment, were randomly assigned to each of four equal groups. 


2. Apparatus 


The operant conditioning chamber had plywood side walls, Plexiglass top, 
back, and door, and a grid floor; it measured 305.8 < 254 X 254 mm. The 
operant box is more fully described in the Weinstein (28) article. 

The auditory signal, a 1500-Hz, 69-db SPL, 3-second tone, produced by an 
audio oscillator, was presented by a hand switch through a 76.2-mm, 8-ohm 
speaker mounted 25.4 mm above the key. 

The shuttle box was 687.8 mm long, 177.8 mm wide, 305.8 mm high, 
painted gray, with a clear Plexiglass top and a floor of 3.10 mm stainless steel 
grids, 12.7 mm on center. The box, located in a dimly lit room, was illuminated 
from underneath by the light of two 25-watt bulbs covered by a layer of 
frosted glass. A hand switch permitted the 1500-Hz tone used in the operant 
chamber to be connected to either of two 76.2-mm, $-ohm speakers, one 
mounted at either end of the box. Another hand switch allowed a 4-Hz click 
to be presented through either of two 76.2-mm, 8-ohm speakers, one placed 
25.4 mm above each 1500-Hz speaker. 


3. Procedure 


Saccharine concentration treatments of .10% and 1.2% by weight were 
used with two shift groups and two control groups. One shift group experi- 
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enced 1.2% saccharine for 11 40-reinforcement (dipper presentation) sessions 
followed by a reduction to .10% saccharine for the Temaining? seven sessions 
of the study (1.2-.1); another group of Ss received: .10% saccharine through- 
out the study (.1-.1) some animals were exposed'to 1.2% saccharine followed 
by a shift to .10% plus @3-second tone on session’ 12 and for the remaindér of 
the 18 sessions (1.2-.1T) } finally, Ss received 10% through session 11and 
then experienced the tone plus .10% saccharine, for the remainder of ‘the 
study (.1-.1T). ' f 1 
Following several days of adaptation to a 23-hour food and water deptiya- 
tion schedule Ss were trained on successive days to press the bar for continuous 
reinforcement; with each depression the’bar operated the magazine. Through- 
out the study, body weights were recorded’ daily before each session, and Ss 
had access to food and water for one hour in the home cage after each séssion. 
For the 23 hours following home cage feedirig, and watering, Ss were food 
and water deprived. Beginning with session 12, E activated the tone™ofice 
for the 1.2-.1T and .1-.1T Ss per dipper presentation, when the animal’s 
tongue initially touched the dipper.The E was seated in the darkened roo’ 
beside the operant box. tay 
Immediately following session 18 (Ss received no-usual home cage food and 
water, but received only the experimental saccharine.solutions in the operant 
box), within 10 minutes;the speakers in the shuttle box were connected, and 
each § in the .1-.1T and 1.2-.1T groups was given 30 conditioning trials with 
a.constant 60-second intertrial interval,.the time between tone offset and click 
onset or-avoidance; response. and click,onset. The § was: placed into the shuttle 
box, and.60 secorids later the click was activated at the.end of the box in which 
S was located. Ten seconds. after-clicky onset the postshift 1500-Hz tone was 
continuously presented until S.escaped to. the opposite end of the box. If Ss 
crossed to the opposite end of the box before tone onset, the click was dis- 
continued, no’ tone was given, and an avoidance was scored for the trial. 
Intertrial interval crossings were not .scored. as. avoidance responses, and S 
had to avoid or escape from.the end of the box where he was located at the 
start of a trial, click onset. During the avoidance phase, Ss performed under 
the same deprivation conditions as in the operant box phase. : 


D. REsutts 


Body weight data (not presented here) indicated no significant difference 
between the four groups in the pre or postshift. : 

‘The average number of bar presses made per minute was examined in the 
analysis of the bar press data, From Figure 1 it is clear that for the preshift 
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FIGURE 1 
MEAN Bar Presses PER MINUTE (ARROW DENOTES PostsHIFT) 
the shift groups performed at a higher level than their respective control con- 
ditions, while 1.2-.1 and 1.2-.1T responded at about an equal level as did the 
control groups. The mean number of bar presses per minute from sessions 1-11 
differed significantly among the four groups by an analysis of variance, F (3, 
96) = 3.24, p < .05. By Duncan’s multiple comparisons the differences be- 
tween 1.2-.1 and .1-.1 and between 1.2-.1T and .1-.1T were each reliable (p < 
.05), while the differences between 1.2-.1 and 1.2-.1T and between 1.-1. and 
-1-.1T were each not significant (p > 05). 
Figure 1 indicates that on session 12 the shift groups reduced their rate of 
bar pressing to about an equal level below their respective control groups 
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(equal negative contrast effects). The mean number of bar presses per minute 
from sessions 12-18 differed significantly among the four groups by analysis of 
variance, F (3, 96) = 4.12, p < .05. By Duncan’s comparisons the differ- 
ences between the shift groups and between the control groups were each not 
significant (p > .05), while the differences between each shift group and its 
respective control group were statistically reliable (p < .05). 

One response examined in the analysis of the shuttle box data was mean 
percentage avoidances in the 30-trial session (number of avoidance responses/ 
30 X 100). Figure 2 displays mean percentage avoidances as a function of 
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FIGURE 2 
MEAN PERCENTAGE AVOIDANCES FOR THE EXPERIMENTAL (1.2-.1T) AND 
Controt (.1-.1T) Groups 
shift-control group. Examination of Figure 2 indicates that Ss who experi- 
enced the tone paired with a reduction in concentration of saccharine solution 
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displayed a higher level of active avoidance’ to the tone than Ss,in the control 
condition, where the signal was paired: with the lower concentration of sac- 
charine, Analysis of variance revealed a significant difference between. the two 
groups, F (1, 48) = 8.21, p <.01. 

Another dependent measure examined in the analysis of the shuttle box data 
was the mean latency of the first avoidance response in the 30-trial session. 
This measure’ (data not shown’ here) yielded similar effects to percentage 
avoidances. That is to say, 1.2-.1T Ss took less time to make the first avoidance 
response in the session than did the .1-.1T group, by analysis of variance, F (1, 
48) = 4.18, p < .05. The results indicate that animals avoided a tone pre- 
viously paired with a reduction in incentive magnitude, which resulted in 
negative contrast effects more often than when Ss avoided the tone that was 
initially associated with the lower, control group’s, magnitude of reward. As 
already discussed, these data are consistent with the notion that downward 
contrast effects in animal behavior are accounted for by some form. of aversive 
motivational state. The finding in the active avoidance situation that the tone 
acquired aversive motivational properties (as a result of being paired with a 
reduction in reward magnitude) agrees with a number of investigations (e.g., 
9, 10) that have demonstrated that a once neutral cue became aversive as a 
consequence of being associated with a decrease in’incentive magnitude. 

The findings that for sessions 12-18 the control groups performed at the 


same level indicate that the tone had no effect on behavior. 
/ 


E. Discussion 


Using the same reasoning a8 in the above study, but with humans as $s, 
Weinstein (32) paired a”tone with a reduction in quantity of monetery 
reward (and consequent contrast effects) in a problem solving situation. He 
then used the tone as an aversive unconditioned stimulus with an active 
avoidance task»The Ss could avoid the tone by pressing a’ button the onset 
of which was signalled by a light. Subjects who received the tone previously 
paired with a decrease in amount of money actively avoided the tone more often 
than. did.those individuals that avoided the tone when it was associated with 
the lower, control group’s, amount of reward. 

One further line of evidence relevant to a frustration account of negative 
contrast comes from Weinstein’s (36). inyestigation. It was suggested that 
if a reduction in incentive magnitude produced. frustration “responses that 
impaired behavior (resulted in downward contrast) one would expect in- 
dividuals more sensitive to the impairing effects of frustration, or, more 
emotional in. general, to exhibit larger negative contrast than cals who are 
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less emotional, On the basis of scores from a test that measured sensitivity to 
frustrating events, two groups of people were selected: highly emotional and 
relatively less emotional. All Ss attempted mental multiplication problems for 
monetery reward. A reduction from 50¢ to 5¢ produced negative contrast in 
the highly emotional individuals, but not in the less emotional people. 

A separate hypothesis based on appetitive motivation can be used to account 
for positive contrast effects. According to this notion, a classically conditioned 
expectancy of reward evokes happiness, or an appetitive emotional response’ in 
general, when reward is larger than the expected amount. Happiness, being a 
motivational state, produces internal cues which, different from the frustration 
internal cues, energize the instrumental response, thus resulting in an increase 
in performance, positive contrast. Independent evidence for the supposition 
that an increase in reward magnitude produces an increase in instrumental 
response strength comes from Weinstein’s double runway study (31). Wein- 
stein found that rats that received an increase in incentive size in the first 
goal box of the first runway (from zero grams to .5 gram) ran faster in the 
second runway alley relative to Ss who continuously experienced .5 gram in 
the first goal box. If the appetitive motivational account of positive contrast 
is correct, one would predict that a neutral stimulus associated with an increase 
in amount of reward (which produces positive contrast effects) will sub- 
sequently produce a higher level of performance than will a cue that is initially 
paired with the control group’s larger amount of reward. That is to say, as a 
result of occurring with an appetitive emotional state the cue should acquire 
secondary motivational properties. Weinstein (37) increased the concentration 
of a saccharine solution from .01% to 1.2% with rats in an operant box; 
with the increment, which produced positive contrast, he paired a tone. After 
the operant box experience, animals that experienced the tone paired with an 
increase in saccharine concentration ran faster in a runway when the tone 
preceded each trial, than rats that received the tone paired with the higher, 
control group’s concentration. Weinstein (33) found that people who re- 
ceived an increase in money (from 2¢ to 20¢), along with a 3-second tone 
administered whenever E handed § 20¢ in the postshift, performed at a higher 
level in a separate operant conditioning task when the tone preceded each 
operant trial, relative to postshift control Ss who initially received the tone 
paired with 20¢, 

According to an emotional interpretation of positive contrast effects, Ss who 
experience an increase in amount of reward (and positive contrast) have 
appetitive emotional cues conditioned to the ongoing response. If this is so, a 
second increase in incentive magnitude should produce familiar emotional 
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cues; that is to say, Ss are now less sensitive to the facilitating effects of the 
emotional cues than Ss who do not experience a prior increase in amount of 
reinforcement. An increase in reward magnitude should result in a smaller 
increase in responding and little, if any, positive contrast, as compared to Ss 
who experience this second increase subsequent to no variation in reinforce- 
ment magnitude, In the previously cited Weinstein (37) paper, concentrations 
of saccharine solutions were increased in one group of rats from .01%, to 
1.1%, to 1.25%; positive contrast on the third increment was not obtained. 
Another group received 1.1%, 1.1%, and finally 1.25% ; with an increase to 
1.25% upward contrast effects were found. Weinstein (33) using humans 
found that an increase in amount of money for solving mental multiplication 
problems from 2¢, to 20¢, to 50¢ produced no positive contrast on the last 
increment; other people who received 20¢, 20¢, and finally 50¢ exhibited 
positive contrast consequent to the increase. 

The frustration (aversive emotional) interpretation of downward contrast 
effects, and the happiness (appetitive emotional) account of positive contrast 
effects, both make the empirical prediction that the duration of contrast effects 
should be short. For example, following a shift from a large amount of reward 
to a small reward, the small reward should come to elicit less and less frustra- 
tion with successive trials (as discussed earlier) and, hence, less and less 
incentive contrast. This prediction is supported by the experimental evidence. 
In all of the Weinstein contrast studies cited previously the duration of 
positive and negative contrast effects was short, not lasting throughout the 
experiment. Ehrenfreund (13) and Dunham and Kilps (12) both demon- 
strated that the magnitude of negative contrast decreased over time. 

The emotional account of contrast makes the 
that the magnitude of incentive contrast effects 
of the amount of preshift training and magnitud. 
ing the length of preshift training, following a long preshift period the 
expectancy of receiving the preshift amount of reward should be more firmly 
established than following a shorter training period. The emotional inter- 


further empirical prediction 
should be a positive function 


producing larger contrast effects than a less well established (based on a 


ce between the quantity 


of reward an animal expects and the actual amount he receives, A larger 
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expectancy (produced by a larger amount of reward) should result in a 
greater amount of frustration or happiness consequent to a shift in incentive 
magnitude, and thus produce larger contrast effects than would a smaller 
expectancy. Both of the above predictions are borne out by experimental 
results, Vogel, Mikulka, and Spear (24, 25) and Weinstein (34) have 
reported that the magnitude of positive and negative contrast was a monotonic 
function of the amount of training; Dilollo and Beez (11), Gonzalez, 
Gleitman, and Bitterman (15), and Weinstein (28) have demonstrated that 
larger changes in amounts of reward produced larger incentive contrast effects 
than smaller variations in incentive. 

While the adaptation-level model of Bevan and Adamson (3) and Helson 
(16) can handle some of the above data, it does not account for as many 
empirical findings as does the emotional interpretation. The perceptual account 
maintains that the effectiveness of a certain amount of reward depends upon 
the perceived magnitude of that quantity of reward; that is to say, when 
reward magnitude is changed, the size of the new amount is perceived in con- 
trast to the previous amount and thus appears larger or smaller than the 
control Ss’ perceived postshift amount. This model implies that the magnitude 
of positive and negative contrast should decrease with time and increase with 
amount of training. The model also implies that large shifts in incentive size 
should produce larger contrast effects than smaller variations in incentive 
magnitude. The existing data (e.g., 11, 34, 38) support these notions. It is not 
readily apparent, however, how this perceptual model accounts for hippocampal 
lesioning studies, high and low S emotionality studies, the results of pharma- 
cological intervention studies, the effects of partial reinforcement on contrast, 
repeated reduction and increment investigations, the Weinstein (32, 33, 37) 
studies, and, finally, the animal avoidance data reported herein. For example, 
it is not clear how the perceptual model would predict the heightened respon- 
sivity to the tone in any of the Weinstein tone investigations. The model would 
imply that the tone should not subsequently energize behavior any more than 
when the signal is paired in the postshift with the control group’s receipt of 
reward. As already discussed, however, the tone does subsequently activate 
behavior. 

While all the data reviewed in regard to an account of contrast effects are 
not entirely consistent with respect to supporting one interpretation (per- 
ceptual or emotional) and rejecting the other, the results nevertheless strongly 
suggest that the emotionality account of contrast effects is more useful than 
the perceptual explanation, as the former can explain a wider diversity of 
phenomena than the latter. 
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F. Summary 


Both negative and positive incentive contrast effects appear to be empirically 
valid in human and infrahuman organisms. Current findings indicate that the 
downward and upward phenomena are controlled by specific variables. While 
it is not entirely clear how to explain depression and elation effects theoreti- 
cally, aversive and appetitive motivational accounts appear to fit best the 
variety of situations in which incentive contrast effects have been demonstrated. 
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RELATIONSHIPS AMONG RISK PREFERENCES, 
. IMPORTANCE, AND CONTROL* 


Brigham Young University 


Kennetu L. Hicsrs! anD TERENCE LAFFERTY 
’ 


A. IntTRopuUcTION 


Two questions were of interest in the present study. First, do people 
prefer greater risks in situations they judge as unimportant than in situations 
they judge as important? Second, do people prefer greater risks in situations 
where they feel that they have control over the outcomes of their decision 
than in situations where they feel that the outcomes of their decisions are due 
to forces beyond their control? 

The results of a study by Clark and Willems (1), using the Choice- 
Dilemma Questionnaire (CDQ), would suggest that the first question be 
answered in the affirmative. The CDQ is a measure of hypothetical decision 
making. Twelve situations are described in which the central person is faced 
with a choice between two alternative courses of action, one of which is more 
rewarding but also less likely to succeed. Ss indicate for each situation the 
lowest probability of success they would accept before recommending that 
the more rewarding alternative be chosen. These probabilities are the Ss’ risk 
preferences [the complete CDQ may be found in Kogan and Wallach (4) ]. 
In addition to completing the CDQ, Ss in the Clark and Willems study 
indicated for each situation how serious they felt the consequences would be 
if the person failed. A negative relationship was found between risk pref- 
erences and the judged seriousness of the consequences of failure. The rela- 
tionship between risk preference and importance investigated in the present 
study is very similar to the Clark and Willems study and should provide 
results consistent with their finding; that is, risk preferences and judged 
importance should be negatively related, 

There have been some conflicting findings relevant to the second question 
above. In some situations people who feel that they are in control of the 
situation may be more risky, while in other situations people who feel that 
they are not in control may be more risky (cf. 2, 3). Thus, the second purpose 
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of the present study was to investigate the relationship between perceived 
control and risk preferences on the CDQ, without any specific prediction as to 
the expected direction of the relationship. 


B. MetHop 


Sixty females and 56 males enrolled in introductory psychology courses 
completed the CDQ, with the usual instructions to indicate for each of the 
12 items the lowest probability of success they would accept before recom- 
mending that the more rewarding alternative be chosen. In addition, each 
of the situations was followed by two questions. The first question said, 
“Situations may vary in their importance to the person making the decision. 
If you were faced with the situation described above, how important would 
this decision be to you?” Ss indicated their answers by checking a seven-point 
scale ranging from 1, “Very Unimportant,” to 7, “Very Important.” The 
second question following each item said, “Situations may vary in the extent 
to which you control the outcome of your decision as opposed to the out- 
come’s being caused by forces beyond your control. If you were faced with 
the situation described above, to what extent would the outcome of your 
decision be under your control?” Ss indicated their answers by checking a 
seven-point scale ranging from 1, “Outcome due to forces beyond my control,” 
to 7, “Outcome due to forces under my control.” 


C. Resutrs anp Discussion 


Since the interest of the study was in the characteristics of the CDQ situa- 
tions rather than of the Ss, the analysis was computed on three means cal- 
culated for each of the 12 items: (a) the mean risk preference for the item, 
(4) the mean importance score, and (c) the mean control score. These item 
means were calculated separately for males and for females. 

Clark and Willems (1) found that the 12 items varied considerably in 
terms of the seriousness of the consequences of failure; the items had a range 
of about three points, on a five-point scale, from the lowest (item 7) to the 
highest (item 2). In the present study, items did not vary quite as much in 
judged importance. On a seven-point scale, they tanged from a low of 4.50 
on item 3 to a high of 6.63 on item 12 for males, and 4.25 to 6.98 on the 
same two items for females. Items 7 and 2 were the next-lowest and next- 
highest items. 

Product-moment correlation coefficients were calculated among risk pref- 
erences, judged importance, and perceived control. In interpreting the cor- 
relations it should be remembered that a high score on risk preference indicated 
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less riskiness, while high scores on importance and control indicated greater 
judged importance and greater perceived control. There was a significant 
negative relationship between riskiness and importance for males (r = .51, 
~ < .05), but there was no relationship for females (r = .07). There was 
no relationship between risk and control either for males (r = .06) or for 
females (r = .03). 

The conclusions indicated by these results, relevant for the two questions 
investigated, are as follows: First, males tended to prefer greater risk in 
situations judged as unimportant than in situations judged as important, while 
there was no relationship between riskiness and importance for females. 
Second, there was no tendency for risk preferences to be related to the extent 
to which either males or females perceived the situation to be under their 


control. 


D. Summary 


Two questions were investigated. First, are people more risky in situations 
which are judged as unimportant than in situations judged as important? 
Second, are people more risky when they feel that they are in control over 
the outcomes of their decisions than when the outcomes are due to forces 
beyond their control? Correlations were computed among risk preferences, 
judged importance, and perceived control in 12 hypothetical decision situa- 
tions. There was a negative relationship between risk preferences and im- 
portance for males, but no relationship for females. No relationship was 
found between risk preferences and perceived control for either sex. 
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CORRELATES OF THE GENERATION GAP* 


Institute for Survey Research and Department of Psychology, Temple University 


Lronarp A. LoScruto AnD Rogert M. Karin 


A. INTRODUCTION 


Few discussions of adolescence have been based on any kind of data, ‘and 
fewer still on well-designed, large-scale survey research studies. Yet, if the 
current worrisome questions about contemporary youth are to be reliably an- 
swered, there would seem to be few alternatives to asking the adolescents 
themselves. 

Perhaps the most troubling and frequently asked questions about youth are 
concerned with the so-called ‘generation gap.” How widespread is it, and on 
what issues are the conflicts greatest? Who are the student dissenters, and to 
what extent are these students characterized by indices of alienation from 
traditional and contemporary societal values and institutions? These are the 
kinds of questions to which the current study addressed itself in studying a 
probability sample of 2362 high school students in Pennsylvania. The study 
which was funded by the Pennsylvania Governor's Council for Human 
Services formed the basis for one part of the Governor’s report at the 1970 
White House Conference on Children and Youth. 

While the psychological and sociological literature is capable of supporting 
diverse points of view on these questions, there seems to be some tentative 
agreement at least that the present “generation gap” presents some features 
unique in our history. To conserve time and space, only two examples will be 
given here. 

Friedenberg feels that contemporary young-old confrontations differ greatly 
from previous encounters, and he expresses the feeling that certain groups of 
young people are now rejecting society in a new fundamental way (2). He 
feels that the different values that guide today’s students are not likely to be 
changed, even by the social and legal coercion that purports to be in their best 
interests. In fact, efforts at coercion by adults may result in even greater 
alienation and isolation of youth from traditional social and familial institu- 
tions. 

Mead also sees the current generation gap as different from past youth 
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movements, as those adolescents who are not satisfied with present philosophy 
and practice séék'to change or withdraw from:the structures and institutions 
of society (4). In her efforts to explain alterations in adult-youth relation- 
ships, Mead emphasizes “‘pre-figurative” culture—one which parents learn 
from children. 

With such rapid changes in all phases of life, and with no elders having 
been raised in this “new” environment, very. seldom in the western world are 
there adults. who know what. children know. Mead. concludes that today’s 
parents are uncertain and do not know how to teach their children, since 
their own childhoods were so different from those of their offspring. 

Quantitative studies, insofar as they exist, have tended to focus on college 
populations. Yet, high school student dissidence cannot be ignored. For 
instance, Westin in 1969 reported unrest to be as prevalent in the high schools 
as in the colleges, as more than half of the rural high schools and 56% of the 
junior high schools sampled reported student protests (5). Ominously, racial 
disorders seem to be the most ;prevalent form of high school disturbance, 
since regardless of, the issue, the triggering incident often unleashes black- 
white animosities. : 4 

Another and slightly more optimistic.quantitative study. which focused upon 
high school students was conducted in 1969 for Life magazine (3). The study 
cited actual student responses to questions regarding satisfactions and frustra- 
tions in their lives, Interviews, were also conducted with parents and teachers. 
Some questions clearly. differentiated the generations—for example, while 
the majority of students desired more participation in policy making, both 
their parents and teachers thought that the students already had enough 
power. However, other issues, such as student voice.in curriculum matters, 
provided less controversy. Lack of adolescent respect for authority was cited 
by parents and. teachers as. most. prevalent and aversive. However, overall, 
77% of parents and 85%. of-teachers were ‘optimistic about how the current 
generation, “will turn out.” 


‘B. Merion * 


j The central aim of the present study was. partially to-measure the genera- 
tion gap and to compare on a number. of demographic and attitudinal variables 
the adolescents for whom the gap is relatively large and those for whom it is 
practically. nonexistent; that is, we compared those who-perceive themselves 
as farthest away from the values of their parents to those who find themselves 
in closest agreement, . at 


The design for this study: employed a-thultistage cluster: probability nya 
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stratified by the six geographic “common regions” into which the state of 
Pennsylvania is divided, and by Catholic versus public schools. Sophomore, 
junior, and senior classes were then chosen within selected schools. Since 
selections were made from required course lists, each student had an equal 
chance of being drawn into the sample. 

The study yielded a total of 2362 questionnaires completed by students in 
54 high schools across the state of Pennsylvania. During the week of April 
28, 1969, these self-administered questionnaires were distributed and collected 
by volunteer workers who were recruited by the Governor’s Council for 
Human Services and trained by the Institute for Survey Research at Temple 
University in the standardized procedures to be followed in questionnaire 
administration. } 


1. Degree of Dissidence 


A questionnaire dealing with perceived extent of agreement with parents on 
15 social issues was used as the basic criterion measure for the present study. 
Table 1 displays these items and the percentage of perceived agreement-dis- 
agreement for each. In looking at the items individually, it is apparent that 
only for three items (homework, spending money, and hair style) are there 
more students expressing total disagreement than there are expressing total 


TABLE 1 
“Do You anp Your PARENTS FeeEL EXACTLY THE SAME, MucH THE SAME, 
SOMEWHAT THE SAME, oR NoT aT ALL THE SAME AsouT EacH 
OF THE FoLLOwING SusjecTs?” 


Not 
Exactly Much Somewhat _ at all 
the the the the No 
Social issues same same same same answer 
Religion < 16.2 38.6 30.4 11.6 aT 
Your homework 16.3 32,1 31.6 17.0 6 
Your choice of friends 18.9 424 25.2 10,7 5 
How you spend your money 13.9 35.8 31.5 16.0 4 
Your choice of clothing 18.5 42.3 25.7 10.7 4 
The way you wear your hair 18.8 33.1 25.2 20.1 4 
A college education 4TA 23.6 15.5 10.6 4 
Living in your neighborhood 46.9 30.6 13.5 6.0 +5) 
The war in Vietnam 25.9 32.6 26.3 11.9 a 
Smoking cigarettes ~ 49.0 15.8 11.8 20.6 3 
Smoking marijuana 69.2 7.9 5.2 14.8 4 
Drinking alcohol 49.9 17.4 31.1 16.8 3 
Driving cars 34.2 36.4 18.3 8.3 3 
Sex before marriage ~ 41,3 18.9. 178. 18.7' 8. 
Discipline 22.0 35.5 26.9 12.8 4 


RISCIPUNG sober ule golly sul ee 
Note: Data are given in. percentages. Totals across Table 1 are not equal to 100%, 
since some respondents (2.4%) were not living with parents. 
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agreement. Further, these three items may be categorized as rather innocuous. 
They are at least perennial parent-child conflict sources and may not reflect 
the depth of philosophical disagreement proposed by such authors as Frieden- 
berg. It is interesting that greatest degree of total agreement is found for 
health-related issues, such as smoking marijuana and cigarettes and drinking 
alcohol. Also a substantial number claimed to agree totally with their parents 
on two issues reflecting more directly on the middle-class delayed gratification 
pattern—i.e., going to college and premarital sex. 

In general, then, it must be said that the students in the sample exhibited 
more harmony. than discord with parental viewpoints, and that the questions 
on which there was most disagreement may be less important than those on 
which agreement was highest. However, it should also be pointed out that a 
small but consistent minority of students reported complete disagreement with 
parents over a variety of issues. It therefore remains to focus on the discord 
that does exist and to identify correlates and possible sources. 


2. Establishing the Dissidence Scale 


The 15 items were combined into a scale by totaling scores for each student 
on each item. The highest possible score would be obtained if the student 
answered “‘not at all the same” for all 15 items, while the lowest possible 
score would reflect complete perceived agreement on all items. The top- and 
botom-scoring 27% of these students were then selected as criterion groups 
to represent the extremely dissident and extremely nondissident students. The 
split-half reliability of the scale was found to be .83, while the “odd” and 
“even” totals correlated with the total question .985 and .955, respectively. 

The remaining analyses deal with the ability of a variety of demographic 
and attitudinal variables to discriminate the low dissidence from the high 
dissidence students, in an attempt to identify correlates of the generation gap 
insofar as it exists among these students. 


3. Hypothesized Correlates 


General fundamental hypotheses were tested in the present study, regarding 
both demographic and attitudinal variables. Specifically, it was predicted that: 

1. Males would tend to be more dissendent than females. 

2. Students in higher grades would tend to be more dissident than those 
in lower grades. 

3. Black would tend to be more dissident than whites. 

4. Catholics would tend to be less dissident than other students, while 
Jewish students would tend to be more dissident than others, 
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5. Those in the high dissidence group would tend to spend less time with 
their parents than those in the low dissidence group. 

6. Those in the high dissidence group would tend to feel more separated 
from members of their family and school personnel, and closer to other 
adolescents than those in the low dissidence group. 

7. High dissidence more than low dissidence students would tend to reject 
traditional sources of advice and comfort, such as religion. 

t tests for difference between proportions were used to check the significance 
of results, All findings that are reported are significant at the .05 level of con- 
fidence unless otherwise indicated. 


C. ReEsuxts 
1. Demographics 


The general notion that males tend more often to be in disagreement and 
conflict with parents, while girls tend to be more acquiescent and family- 
oriented, is substantiated in Table 2. As predicted in hypothesis 1, there was a 
significantly larger proportion of males in the high group than the low 
category. Conversely, females showed a significantly larger percentage falling 
into the less dissident group, as opposed to the high group. This seems to be 
in line with the findings of Douvan and Adelson who found males, as opposed 


TABLE 2 
DEMOGRAPHIC VARIABLES BY DissiDENCE GROUP (PERCENTAGES) 

Variables Low High 
Sex 

Male 44.7 64.3" 

Female 53.5 “Shed 
Grade in school 

Sophomore 38.0 30.9* 

Junior 32.3 35.2 

Senior 29.7 33.9 
Race 

White 96.3 94.0 

Negro 3.6 6.0 

Oriental “ _- 
Religion 

Catholic 46.2 50.2 

Protestant 478 46.1" 

Jewish 6.0 3.7 


Note: The percentages reported here are based on sample sizes of 590 for the low 
dissidence group and 602 for the high dissidence group. Any column total less than 
100% was due to nonresponse to the item. 

* Significant at the .05 level or beyond. 
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to females, to be less well integrated into, and more uncomfortable with, 
family structure and patterns of living (1). 

When the data are examined by grade of student, some support is given to 
hypothesis 2. Significantly more sophomores are in the low dissidence group 
than in the high dissidence group. Juniors and especially seniors, however, 
have greater representation in the high dissidence group than in the low, 
though the results for these two groups are not statistically significant. If a 
linear relationship holds for future development, one might expect to witness 
an extension of this trend as the same students progress through college. 
Rather than coming closed to agreement with parents’ perceived opinions and 
attitudes as these students grow older, they seem to find more basis for dis- 
agreement. It is also of interest that the two individual items in the scale 
showing the greatest increase in disagreement with year in school are two of 
the most controversial—smoking marijuana and premarital sex. (Those data 
are not presented here because of time and space limitations.) 

Data concerned with race of the student lend some support to hypothesis 3, 
although the results are not statistically significant. It was thought that white 
students would be concentrated, to a larger exent, in the low rather than high 
group, and blacks would be characterized by greater dissidence. It should be 
noted, however, that since blacks constitute a relatively small percentage of 
both the high and the low groups, the difference between the two proportions 
is, of course, restricted in potential size. 

Investigation of students’ religion yields a somewhat surprising finding 
which should be discussed, although the results are not statistically significant. 
Contrary to the trend predicted in hypothesis 4, Jewish students are more often 
contained in the low dissidence group, while a slight majority of Catholics 
appears to be discordant. The high percentage of Catholics in both groups is 
due to the survey design which called for an oversampling of parochial school 
students. It may be that the Catholic students drawn primarily from parochial 
schools are reacting at least verbally against the traditional staunch discipline 


imposed upon them, in light of the contrasting contemporary adolescent 
permissiveness. y 


2. Behavioral and Attitudinal Variables 


The preceding data have served to distinguish between the two criterion 
groups on several standard antecedent, demographic dimensions. The remain- 
maining discussions will concern results and trends indicated by a set of inter- 
mediate factors, including student’s activities and associated attitudinal vari- 
ables. Through the years, but especially recently, a number of writers have 
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spoken of the estrangement and alienation of contemporary youth. As will he 
seen, this idea is supported by a number of reported behaviors and expressed 
attitudes. 

First, in hypothesis 5, it was predicted that contact between parents and 
their children might be crucial factor in determining concordance or dissent. 
It was thought that increased contact would foster, or at least reflect, better 
relations and acquiescence, whereas lack of contact would be indicative of 
dissidence. The data in Table 3 substantiate this hypothesis; that is, in every 
case, those students engaging in a particular form of recreational, or at least 
extra-domestic activity with their parents constitute a higher proportion of 
the low dissidence group than of the high dissidence group. Only the differ- 
ences found for camping were not statistically significant. Although the data 
are not presented here, those who did not participate in familial activities 
make up a significantly higher proportion of the high dissidence group in each 
case. Further, if large overall differences in the frequency of each type of 


TABLE 3 
ArrirupiNaAL VARIABLES BY DissipeNce Group (PERCENTAGES) 


Variables Low High 


During the past month, did you do any of the following 
with one or both of your parents? 


Gone camping 4.9 3.4 
Gone on a weekend holiday (or a longer holiday) 21.0 13.3* 
Gone to a park or museum 12.8 opt} 
Gone to a sports event 25.4 11.4# 

26.5 1535% 


Attended a party 


Attended a concert, symphony, or play 20.3 6.1% 

Eaten in a restaurant 55.5 Cha A 

Attended religious services 70.6 50.6" 

Gone to the movies 16,1 ibe 
When you have a social or personal problem, from whom 

do you get good advice? 

Your parents 76.4 37,9" 

Your brothers and sisters 23.4 15.8" 

‘Teachers 12.5 6.8" 

Your friends 45.1 52,7 

Relatives and others 38 1.3 

No one 41 18.6* 
Do you find your religious beliefs helpful in dealing with 

problems of living? 

Yes 83.7 50.4% 

No 15.4 43.6% 

3 6.0* 


No religious preference 


Note: The percentages reported here are based on sample sizes of 590 for the low 
dissidence group and 602 for the high dissidence group. Because of multiple responses 
to the first two questions, those columns sum to more than 100%. 

* Significant at the .05 level or beyond. 
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activity are taken into account, the relative differences between the dissident 
and nondissident groups do not shift remarkably from activity to activity, It 
is not clear, however, whether the family activities lead to increased con- 
sonance between youth and adult, or merely reflect the agreement that is 
already present. It may well be that both are true to some extent, with basic 
agreement reflected in mutual activities which, in turn, reinforce parent-child 
understandings. 

Perhaps more directly indicative of the greater psychological isolation 
experienced by those in the high dissidence group are responses to a number of 
perceptual and attitudinal questions. For example, the question was asked, 
“rom whom do you get good advice?” Data obtained from responses to this 
question lend some support to hypothesis 6, Table 3 shows that about 19% 
of the high dissidence group, compared to 4% of the low dissidence group, 
reported that they received good advice form no one. Also, parents, siblings, 
and teachers all are regarded as good sources of advice more often by the low 
than the high dissidence group. On the other hand, friends are more often 
regarded as good advisors by the high dissidence group than by the low group. 
This is perhaps indicative of the relative alienation of the more dissident 
youth from parental and general societal values, and of a concomitant close- 
ness and involvement with the adolescent society. 

Another item significantly distinguishing low from high dissidence students 
is one regarding the practical value of religion in helping to solve every day 
problems. As predicted in hypothesis 7, the highly dissident youth are far 
more likely than the low dissidence group to say that religion does not help 
them. In addition, the high dissidence group members often answered that 
they had “no religious preference.” 

Both traditional sources of advice and a traditional source of comfort in 
adversity, then, tend to be rejected at a much higher rate by students who are 
in general disagreement with their parents on a variety of contemporary issues. 


D. Conciusions 


: In general, less dissidence than harmony between parental and adolescent 
viewpoints was found in this study of the generation gap. Further, the issues 
involving most disagreement were traditional and perhaps relatively super- 
ficial. It is evident that these students were not categorically rejecting their 
parents’ values or their company, and that they seemed to have been enjoying 
some rapport with them. 

However, it is also true that students differing from perceived parental 
point of view may be strong in influence though limited in number, and, on 
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every issue, a small but consistent minority reported extreme divergence from 
the feelings they ascribed to their parents. It was therefore of some importance 
to seek correlates of the disagreement that does exist. 

Regarding these correlations, it would seem that the dissident group could 
be characterized as largely male and upper classmen. The increase of dissidence 
with year in high school indicates that the alienation and active dissent as- 
sociated with many college youths may germinate long before it is behaviorally 
manifested, and that efforts to reduce the alienation should be initiated early 
in the home and school educational processes. Blacks and Catholics tended 
to be somewhat, though not significantly, more dissident than whites and 
other religious groups, respectively. All but the findings concerning religion 
were in the direction of hypothesized relationships. The dissident group was 
also characterized by a significant degree of physical and psychological aliena- 
tion, especially from other family members. 

Most behaviors reported in the questionnaire perhaps reflect directly what 
may be termed a general alienation hypothesis. It was predicted, for example, 
that those students most in conflict over values with their parents would also 
spend least time with them, and this was confirmed on a number of recrea- 
tional activity dimensions. It was also predicted and confirmed that relative 
physical and attitudinal estrangement would be related to a feeling of isola- 
tion and avoidance of traditional sources of advice. 

Unfortunately, because of the dearth of existing empirical data, no concrete 
conclusions may be drawn concerning either the ultimate achievements and 
adjustments of these adolescents or their uniqueness compared to their pre- 
decessors. From a research point of view, it is to be hoped that a marked 
increase in empirical investigations will follow so that changes may be 
detected, and long-range effects assessed. 


E. Summary 


In 1969, self-administered questionnaires were completed by a sample of 
2362 high school students in the state of Pennsylvania. Questions about dif- 
ferent social attitudes and behaviors were asked in order to determine cor- 
relates of student dissent and alienation. A scale measuring the perceived 
amount of agreement with parents on 15 social issues was used to establish 
criterion groups of extremely high and extremely low dissidence students. 
These two groups were then compared on a number of demographic and at- 
titudinal variables. 

Sex and grade in school emerged as the strongest predictors of dissidence. 
Male students and students in higher grades were more likely to be highly 
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dissident than females and students in lower grades. Differences found for 
race and religion were not statistically significant. 

It was found that among the most dissident students were more individuals 
who spend little time with their parents, who tend to feel more separated 
from adults, and who are more inclined to reject traditional sources of advice 
and comfort, such as religion, than in the low dissidence group. 
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NATHAN RosENBERG,’ AND SipNnEY Mer is 


A. IntTRopuUcTION 


There are theoretical significance and clinical utility for developing a 
personality measure predictive of a propensity toward heroin addiction. It can 
be theorized that drug abusers represent one instance of a personality that is 
vulnerable to addicting agents; that is, there is a similar underlying 
personality structure that gives rise to addictions. The particular agent 
initially used for addiction may be a function of its availability, psychological 
and economic costs, etc. If this is the case, we would hypothesize no differences 
on such a scale between alcoholics and heroin addicts. 

Because of the continued problem of drug abuse in our society, the clinical 
.value of a scale that has utility in identifying potential heroin abusers and 
present users of heroin has obvious practical meaning which includes a basis 
for intervention. In this regard, we would hypothesize that heroin abusers 
and addicts would score higher on such a scale than nonheroin addicts or 
abusers, 

The first step in a process of developing such a scale has been provided by 
_Cavior, Kurtzberg, and Lipton (1). Fifty-seven items were gleaned from the 
Minnesota Multiphasic Personality Inventory (MMPI) self-report which 
successfully discriminated male adolescent and adult heroin addicts from a 
sample of inmates from the New York City Department of Correction who 
were similar in age but had no history of drug abuse. 

The purpose of this study is to cross-validate and extend the use of the 
heroin addiction (He) scale while gathering data that may also reflect on the 
validity of the hypothesis of an addiction-prone personality type. Specifically, 
this study measures the ability of the He scale to differentiate a second larger 
sample of male heroin addicts from the normative samples. Secondly, it deter- 
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mines the ability of the scale to discriminate between samples of male alcoholic 
and heroin addicts. 
B. MetHop 


Data are reported for 274 male heroin addicts committed to Central Islip 
State Hospital (CISH) by a probate court procedure for a 90-day treatment 
period.? Within five days of admission, a social history was taken for each 
patient to obtain biographical, educational, and occupational data and a history 
of drug use, and each patient was offered a battery of psychometric tests, one 
of which was the MMPI. In an effort to ensure validity of patient response, 
no one was coerced into testing. All patients had undergone physical with- 
drawal prior to their admission to CISH. Although lower cutting scores were 
indicated by Cavior, Kurtzberg, and Lipton (1) for Puerto Rican samples, 
for the purposes of this analysis heroin addict patient data were dichtomized 
on a Caucasian-non-Caucasion basis. 

The alcoholic data were derived from a sample of 111 male veterans ad- 
mitted to the Fort Meade (South Dakota) Alcoholic Treatment Unit.’ All 
patients in this sample had been diagnosed as alcoholic and entered in a treat- 
ment program for alcoholism. None were experiencing symptoms of acute 
alcoholism or withdrawal at the time of testing. A second sample of 117 male 
alcoholics newly admitted to Rusk State Hospital (Texas) represented a 
continuum of drinking behavior ranging from moderate drinkers to heavy 
abusers of alcohol.* 

He normative data were abstracted from the article by Cavior, Kurtzberg, 
and Lipton (1).5 All patient MMPI responses were key punched and scored 
for the He scale by computer.® Tests of significance (#) were computed to 
identify the ability of the He scale to differentiate among these samples. 


C. Resutts 


Hypotheses regarding cross-validation, extension, and clinical utility of the 
He scale, rather than theoretical implications, are considered first. Did the He 


2 We thank Francis J. O'Neill, M.D., Senior Hospital Director of Central Islip 
State Hospital, for his Cooperative efforts in making the patients available for testing. 

8 We thank Albert Uecker, Clinical Psychologist, Veterans Administration Hospital, 
Fort Meade, South Dakota, for provinding the MMPI protocols of the South Dakota 
sample of nee alcoholic patients, 

4 We thank Jerry H. Patrick, Assistant Professor, Universi f T. Medical 
Branch, for providing the MMPI protocols of the Texas ata rt ale, ete 
patients, . : m 

5 We thank Stanley Einstein, Editor of the International Journal of the Addictions, 
for permission to reproduce the Cavior, Kurtzberg, and Lipton data. 3 

6 We thank Charles Pautler, Programmer, Computer Service, National Institute 
of Mental Health. 
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scale replicate; are there significant mean differences between the CISH 
samples of heroin addicts and the normative nonaddict groups? Tests of 
significance (t) were computed between the CISH Caucasian addicts and the 
Caucasian, Negro, and Puerto Rican normative nonaddict groups. Significaygigy 
mean differences between CISH Caucasian and nonaddict Caucasian (¢ = 
4.20, < .001), Negro (¢ = 6.88, < .001), and Puerto Rican groups (¢ = 
7.15 < .001) are reported in Table 1. Similarly significant mean differences 
between CISH non-Caucasian addicts and nonaddict Caucasian (¢ = 4.52, 
< .001), Negro (¢ = 7.29, < .001), and Puerto Rican samples (¢ = 8.05, 
< .001) are seen jn Table one. The first aspect of the cross-validation pro- 
cedure demonstratge that the CISH samples of heroin addicts scored signifi- 
cantly higher the nonaddict inmate normative sample. These data extend 
the ability of e scale to separate addict from nonaddict samples on the 
basis of mean scores in a psychiatric sample. 

A related issue is the mean difference in scores between ethnic and racial 
addict groups on the He scale. Significant mean differences between the CISH 
Caucasian and normative Caucasian (¢ = 4.84, < .001), Negro (¢ = 3.55, 
< .001), but not Puerto Rican (¢ = 1.31, p was NS) addicts are seen by 
comparing row one with row three in Table 1. Similarly, significant mean 
differences between the CISH non-Caucasian and normative Caucasian (¢ = 
5.29, < .001), Negro (¢ = 3.87, < .001), but not Puerto Rican addicts 
(t = 1.49, p was NS) are seen in Table 1. These data show that both CISH 


TABLE 1 
Means, STANDARD DEVIATIONS, AND SAMPLE SIZES FOR THE VARIOUS NORMATIVE 
AND CRoss-VALIDATION SAMPLES ON THE MMPI Herorn AppicrTion SCALE 


Sample types Xx o N 

CISH addicts 

Caucasian 35.3 6.3 155 

Non-Caucasian 35.3 5.3 119 
Inmate nonaddicts* 

Caucasian 314 6.0 60 

Negro 28.6 6.8 60 

Puerto Rican 27.6 5.1 40 
Normative addicts* 

Caucasian 39.8 Sa 60 

Negro 38.6 5.5 60 

aes Rican R 35.7 4.8 40 
Alcoholic patients’ 

South Dakota 29.5 5.5 111 

Texas 28.9 6.6 117 


@ Data from Cavior, N., Kurtzberg, R. L., & Lipton, D. S, The development and 
validation of a heroin addiction scale with the MMPI. Internat. J, Addict., 1967, 2, 


129-137. Printed with permission. , . 
D See footnotes 3 and 4 for source of data on alcoholic patients. 
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addict samples have scores similar to the Puerto Rican inmate addict sample, 
and score lower than the Caucasian and Negro inmate addict samples (see 
Table 1). 

The theoretically relevant hypothesis indicated no significant differences 
between the samples of heroin addicts and the two samples of alcoholic 
patients. Tests of significance (t) were computed between the CISH Caucasian 
addicts and the South Dakota (¢ = 7.87, < .001) and Texas (¢ = 8.24, 
< .001) alcoholic patients. Similarly, significant mean differences between 
the CISH non-Caucasian addicts and the South Dakota (t = 8.23, < .001) 
and Texas (t = 8.33, < .001) alcoholic patients are seen in Table 1. The 
second aspect of the cross-validation procedure demonstrates that the CISH 
samples of heroin addicts scored significantly higher than the alcoholic samples 
on the He scale, While these data cannot be considered sufficient to reject the 
hypothesis of a similar underlying personality structure giving rise to addiction, 
it certainly raises some questions in this regard, 


D. Discussion 


The MMPI is a frequently used clinical instrument. There is adequate 
support for a wide variety of information with regard to personality, character 
structure, and psychopathology that can be gained from the MMPI self- 
report, The data reported in this study support cross-validation of the He 
scale. The development of the scale in a prison population has been partially 
extended to a psychiatric population. Presumptive evidence has been presented 
suggesting certain dissimilarities between patients addicted to heroin and 
alcoholic patients in their responses on the He scale. If both groups of patients 
tepresented homogeneous groups, we would have expected similar responses 
to the He scale. Measures of central tendency and variability were statis- 
tically similar for both alcoholic samples reported in this study on the He 
scale, These data would suggest that alcoholic patients may be similar, as 
viewed by this scale, but they are different from heroin addicts. 

There is need for further cross-validation and item analysis of the He 
scale. The racial and ethnic differences in response to He items indicate the 


1. General Emotional Tone (. Depressed) 


Life is uninteresting, morale is low 3 they are inert. They lack meaningful 
goals and a strong system of values, They acknowledge anhedonia, feel weak, 
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inadequate, and hopeless, and tend to focus on past losses and mistakes. They 
experience daily tension, are resentful, and feel as if they have beep “dumped 
on.” 4 


2. Religious Attitudes (A mbivalent) 
Religious conflict seems apparent in these people. It is manifested in denial 


and rejection of religious attitudes and feelings, or a seeking for proof for 
religious existence. Their responses reflect ambivalence in this regard. 


3. Family Relationships (Rejecting) 


Family interactions can be characterized as marked with tension, irritability, 
resentment toward parents and siblings, and a general lack of support from 
family members, Parental conflict over choice of friends and a feeling that 
the family has been demanding of them is emphasized. This may driye them 
away to seek solace in peer activities, 


4. Peer Activities (Undermining) 

They are sensitive to others’ feelings and acknowledge the need to conform 
to peer pressures at an early age. They displace their families and become 
loyal to peer groups which, in fact, have no basic loyalty. This only reinforces 
their emptiness and loneliness, They conform to a code that is different from 
larger society and is supported by their strong feelings of resentment toward 
those they dislike who are symbols of the society and feelings they cannot 
regain, 


5. Authority (Resentment) 


They feel as if they are the object of a major injustice which is traceable 
to the family and supported by other institutions in society: i.e., school, etc. 
They are resentful of this, reject the major values and customs of society, 
and enjoy flaunting the rules. They develop their own rules. They lack social 
interests and are rebellious toward authority. 


6. Psychosexual Development (Confused) 
They acknowledge preoccupation with sexual matters, chaotic sexuality 
represented in bisexuality, and/or homosexuality. 
7. Defense Mechanisms 


Denial, distortion, projection, role playing, and adopting grandiose attitudes 
are their major defense mechanisms. 
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E. SuMMARY 


A heroin addiction scale (He) gleaned from MMPI self-report was de- 
veloped on a sample of 63 prisoner addicts. It was later cross-validated on a 
sample of 160 prisoner addicts. One purpose of this study was to cross-validate 
the He scale on a larger sample of male narcotic addicts (N = 274) while 
extending the use of the scale to this sample drawn from a psychiatric installa- 
tion. Additionally, attempts were made to evaluate responses of narcotic 
addicts and two samples of male alcoholic patients (N = 117 and N = 111) 
on the He scale. 

On the basis of the data reported, the following conclusions seem justified. 
‘The He scale discriminates heroin addicts from alcoholics in samples treated at 
psychiatric installations. Whatever the attribute or attributes underlying the 
He scale, alcoholics and heroin addicts differ in intensity. Heroin addicts score 
significantly higher. There appears to be possible racial and ethnic response 
differences on the He scale which indicate the need for further item analyses 
and refinement of items. But, in view of the incidence and prevalence of drug 


abuse and addiction in the society, continued reporting of these data seems 
warranted. 
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STRATEGIES IN ARGUMENT** 


Institute for Juvenile Research, Chicago 


Merton S. Krause 


A. INTRODUCTION 


An argument focused on resolving a difference of opinion is a kind of serious 
“same” of interpersonal conflict. It consists of a sequence of moves by which 
the parties take their alternate turns. Each move has two aspects, for it is 
both a response and a presentation (or stimulus). A series of moves concern- 
ing the same issue constitutes a functional phase of the argument, a discrete 
engagement. The plan of an engagement (with regard to the sequence of 
points and concessions to be made and the methods of making them) is the 
lowest or most concrete level of strategy in argument.? At this level there are 
a variety of both offensive and defensive strategies distinguishable. 

The class of argument engagement strategies we shall consider constitutes 
an aspect of rhetoric concerning (a) what beliefs of an opponent are to be 
altered, (2) which of his (and one’s own) beliefs are to be discussed, (c) 
what issues (or disagreements) are to be disclosed as such, (d) what role 
is the opponent to be induced to play in the engagement, and (e) how are his 
moves on one’s own various beliefs to be countered. Discrepancies between 
which opponent beliefs are to be altered and which are to be discussed indicate 
indirection, which is a first variable for characterizing such strategies. Dis- 
crepancies between which opponent beliefs are to be altered and which are to 
be permitted openly to define currently arguable issues indicate deception (or 
masking), a second “metastrategic” variable. 

It is possible to be indirect without deception, by openly attacking a belief 
that is only of interest because of its contribution to the belief one admittedly 
wants to alter. It is equally possible to be indirect with deception. And it is 
also possible to be direct with deception, by discussing a belief but not making 


® Received in the Editorial Office on April 10, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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2 The internal structure of a move (or message) is taken here to be a tactical 
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an issue of it, as well as without any deception. Thus, indirectness and decep- 
tion are logically independent of each other. —_ 

A third “metastrategic” variable lies in attempting to induce the opponent 
into action such that he openly deploys his evidence, logic, defensive strategies, 
attitudes, affect, efc. in disclosing or defending his beliefs. An inductive 
strategy can be either direct (by attempting to get the opponent to discuss the 
beliefs one wants to alter) or indirect, and either deceptive (by attempting 
to avoid the opponent’s defining issues on target or central beliefs in his dis- 
cussion) or undeceptive. Noninductive strategies of these several sorts were 
already considered above; so induction is, then, logically independent of both 
indirection and deception. This implies that there are at least eight sorts of 
offensive argument engagement strategies for attempting to alter an opponent’s 
beliefs, 

Likewise, there are another eight defensive argument engagement strategies 
for attempting to preserve one’s own beliefs (see Table 1 below). Discussing 
beliefs other than those one intends to preserve indicates defensive indirection. 
Defining currently arguable issues only on beliefs one does not intend to 
preserve indicates defensive deception. Attempting to draw out the opponent’s 
complete attack and/or his grounds for selecting and mounting it indicates 
defensive induction. As in the case of offense, the three “metastrategic” vari- 
ables. are logically independent in defense. In. the following sections these 
variables will be used to define several strategies, most of which were originally 
adapted from Hart’s (7) analysis of military strategy. 


B. Orrensive STraTecigs 


The simplest and most common species of offensive strategy is the frontal 
assault (e.g., 9, 10). The belief that one wishes his opponent to adopt is 
openly advanced and supported. Incompatible beliefs. are openly attacked. If 
the opponent holds no incompatible beliefs and is disposed to be persuaded by 
one, as in some classrooms, this strategy is most efficient. However, if these 
conditions are absent, the engagement will probably tend to settle into a 
static front or issue in which strategic advantage is expended, through 
self-disclosure, for one example. Then success depends upon tactical and logis- 
tical superiority: i.e., upon force of argument. Apparently, much of the satis- 
faction in verbal conflicts derives from such static battles of mutual display 
and attrition. The emotional release, social intimacy, 
one’s moves often seduce the parties from some calm co: 
vantage into an exhilarating commitment to expendin 
the promise of the opponent’s immediate, 


and serious response to 
mmitment to gaining ad- 
g advantage in battle for 
overt, and self-conscious capitulation 
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in defeat. The anger, shame, or guilt induced by defeat and the continued exis- 
tence of beliefs incompatible with those conceded to be true may jeopardize 
whatever persuasion was achieved, as well as the framework for subsequent 
engagements (e.g., 5, esp. pp. 84-122). 

The nearest relatives to frontal assault are much like it, but they involve 
an element of indirection. Outflanking openly attacks only beliefs that lend 
support to those actually intended for change.® If these supporting beliefs are 
strongly held, decisive outflanking must risk becoming immobilized in dis- / 
puting them. If the supporting beliefs are weak but numerous, decisive out- 
flanking risks prolonging the engagement, exposing one’s objective (unless 
the deception is masterful), and so permitting the opponent either to force the 
underlying issue, exposing one’s indirection, or to perseverate on small issues 
until the strategy collapses from fatigue and exasperation. These risks, plus 
the great knowledge of the structure of the opponent’s beliefs needed for the 
Proper engagements to be undertaken, and also the patience and skill required 
for its execution, are the disadvantages of outflanking. Its advantage is that 
if it fails, but is not exposed (not identified as that strategic move), the under- 
lying issue has not been exposed or become necessarily less accessible to persua- 
sion. If outflanking succeeds and the opponent’s central belief is cut off from 
enough of its support, one can frontally assault it or adopt one of the con- 
structive indirect strategies to replace it. 

Deception is a necessary part of outflanking when the opponent is unwilling 
to have the central belief at issue altered by one. However, if the basis for this 
unwillingness can be identified, it may be the appropriate objective of an 
initial masked outflanking move [see Fenichel (4, esp. pp. 36-38) and Katz 
(11, pp. 180-190) ]. Thus, it might be best to make the opponent see how, in 
general, changing his mind can be more intellectually honest than blindly 
defending old beliefs. There are, of Course, friendly sophisticated debates on 
sufficiently innocuous issues which permit strategic moves being made openly 
without disadvantage. 

The ultimate in both deception and indirection can be practiced in the 
subversion species of strategy. The existence of the critical issues is masked 
altogether, or it is agreed that they are to be handled without conflict, or a 
truce is declared on them. The opponent's central belief and its grounds or 
Cognitive structure are searched for flaws under this covering arrangement 
(and his other beliefs may also be searched for latent incompatibilities with 


8 This strategy is rare in the experimental literature [but 

kart 
(13) for an unsuccessful example]. Schei “ ut see Kelley and Volk 
beliefs through attacks on the pe Sein (16) discusses outflanking of political 
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the one at issue) to be exploited as needed. These may be attitudes, values, 
standards of logic or evidence, or other beliefs. This strategy fits naturally 
when one is apparently trying to understand, strengthen, learn, test, or 
evaluate the opponent's beliefs. It can gain one intelligence advantages under 
such guises, even when it fails on the offensive. However, an indirect or 
flanking subversion designed to flaw supportive beliefs is also possible, The 
risks in subversion are of being unmasked and of inadvertent policy shifts due 
to the role-playing (such as falling in love or being persuaded one’s self). In 
open subversion, where issues are admitted to exist, the constraints of the 
role may be quite severe and even policed by the opponent (e.g., in the case 
of admittedly partisan discussion leaders, an impression of fairness must be 
maintained). 

Like subversion, feinting can be used either destructively or constructively. 
Here the opponent is induced to act (argue, think, or feel) in ways that either 
are incompatible with the central belief that is at issue or are more compatible 
with the belief one is interested in substituting for it. The sorts of flaws that 
might be exposed in subversion are created, developed, or proliferated with 
feinting. The feint is the inducement to the opponent to flaw his position 
unwittingly, to move himself into an outflanked or infiltrated position. What 
feints are both available and appropriate for the persuasion depends upon the 
situation and the opponent’s beliefs, but they may be indirect or not, and 
also deceptive or not. Of course, even a successful feint may not achieve the 
intended persuasion because the opponent may not recognize the flaw it 
creates as such, 

Just as outflanking is the paradigm of destructive indirect strategies, infiltra- 
tion is the paradigm of the constructive indirect strategies. Beliefs to which 
the opponent has little or no resistance, but which would tend to support the 
target belief one wants the opponent to hold, are advanced and supported 
(e.g., 12, 14). These implants must not be identified as such and so must not 
too directly imply any resisted beliefs, or they will tend to be resisted them- 
selves, as well as upset the framework of debate (5, pp. 131-176). This 
depends in part both on the naivety of the opponent and on one’s deception 
(often implicit in the distribution of implants). Once the implants are 
established, the opponent must somehow be induced to recognize them as flaws 
relevant to his central belief if they are to be effective. His own processes of 
dissonance reduction may achieve this result without further interventions 


(5, pp. 123-176). 


4 This is the standard strategy used in cognitive dissonance research [see Brehm 
and Cohen (3, pp. 21-50) ]. 
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We contend that campaigns of persuasion in argument are necessarily built 
from these strategic elements and that unit messages or moves in argument 
engagements should be conceived as components of these elementary strategies, 
They are, of course, descriptive of only one side of an engagement, for an 
opponent under attack also has several strategic alternatives from which to 
build his campaign of defense or counteraction. 


C. Derensive StraTEciEs 


Analogous to the offense of frontal assault in its simplicity, ubiquity, un- 
deceptiveness, and directness is the defense of reinforcement. Knowledge of 
relevant facts and arguments (i.e., logistics) used in a strictly responsive 
manner comprises this defense (e.g., 1, 15). It requires little knowledge of 
the opponent and is often difficult to overcome, but it can so occupy the defend- 
ing party as to leave him open to indirect attacks. The cost of extensive rein- 
forcement against many possible lines of attack may well be prohibitive. 

Potentially more costly than reinforcement, especially because of its implicit 
claims of special privilege or violation of procedural rules (and so its 
abrogation of the framework of argument), is fortification to deflect attacks. 
Inattention to, ignoring or forgetting, misinterpretation or unrealistic inter- 
pretation of, emotional disvaluing or discrediting of, or failure to understand 
what the opponent says about (or flaws in) a belief at issue represent this 
rather deceptive and unresponsive strategy of defense (e.g., 8; 17, esp. pp. 
248-253). What one Perceives or remembers consciously of an attack or 
flaw, if anything, after the deflection (or distortion) may be called a deriva- 
tive [see Fenichel (4, pp. 15-22)]. A flaw concerning one’s own wishes may be 
resolved ; e.g., by displacing the wish to a different object, projecting the wish 
onto the object as his wish for one’s self, or reacting with the substitute 
formation of an opposite wish. A flaw concerning one’s reasons for some 
thought, feeling, or act can be resolved by substituting a less troublesome 
rationalization. We would expect highly valued beliefs crucial to the person’s 
self-concept to be defended by this strategy. 

Camouflage aims by indirection to draw out the opponent's offensive moves 
so that he will expend strategic advantages, unmask his strategies, expose 
and so fail to attack or gain 

advance beliefs that one does 
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induces the opponent to attack into an ambush (which may be simply heavy 
reinforcements, and then the entrapment is direct). This strategy necessarily 
unmasks itself if the induction is apparently intentional rather than the 
opponent’s misinterpretation or misjudgment, but the engagement is then 
jeopardized only with indirect entrapments. 

The several preceding defensive strategies are designed to preserve one’s 
beliefs for another engagement or argument. In order to prepare for them 
(with the exception of unconscious fortification), one must expect to be 
attacked. But for the best defense one must also adapt it to the structure of 
beliefs he plans to defend and to the most effective lines of attack. This re- 
quires careful self-analysis and, perhaps, some modification of one’s beliefs to 
make them more defensible. 

There are, besides the purely defensive strategies geared to preserving 
one’s beliefs, others geared either to diverting the opponent, by (indirect or) 
diversionary counterattacks, or converting him, by counteroffensives. A coun- 
terattack is uniquely defensive when it is pursued only until the opponent’s 
offensive collapses. A counteroffensive aims to end the argument once and for 
all. It, like the counterattack, is prepared for a specific opponent, whereas 
the other defenses can be more general. It is quite common as an impromptu 
response to a frontal assault (e.g., 6). 

Rather than attempting either conversion of the opponent or frustration of 
the opponent’s offensive, one can also defend one’s beliefs by rendering the 
opponent unable to attack—i.e,, tactically disabled. He may be sabotaged with 
defective information or logic upon which to base his arguments, bombarded 
with attacks designed to destroy his will to argue or his ability to maintain 
his strategy, or disarmed by redefining the situation or altering the framework 
from that of argument by using one’s powers of coercion, reward, seduction, 
charisma, ete. 

D.  ExampLes oF STRATEGIES 


In order to illustrate how arguments are framed in the several proposed 
strategies, some central belief that is attacked and some target belief, to which 
it is to be changed, must be stated. Then offensive and defensive strategies 
can be stated on the same topic. The central and target belief for this illustra- 
tion are as follows: 

Central—So long as government routinely sees to it that its citizens 
can achieve at least a reasonable standard of living, safety of person 
and property, and personal freedom, and so long as government does 
not make excessive demands on its citizens, they are adequately repre- 
sented, 
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Target—Government is properly an instrument of its citizens’ will. 
They are adequately represented only when government honestly tries 
to give them whatever they want. Of course, the diversity of and conflicts 
among their wants and the limitations of government resources may 
result in some compromises or failures. 


These two positions, then, demarcate the field of conflict. The strategies 
will be presented in the same order as they were introduced in the preceding 
two sections. 

Frontal assault: Getting what one needs to live reasonably well has no 
bearing logically on how well he is represented because it indicates nothing 
about the amount of say one has in how things are run. If men are free to 
influence government, and government is responsive, they can get both what 
they need and whatever else they want (and value enough) as well. 

Outflanking: How can government decide fairly what constitutes a “rea- 
sonable standard of living . . .” and “excessive demands” for its citizens 
without finding out these things from them? 

Subversion: I think that government’s providing certain essential services 
to and protections of its citizens is the major humanitarian reason for having 
representatve government, but I don’t see yet how we can make people 
limit their demands on government to just these essentials. How will we be 
able to enforce such a limit without depriving those who disagree of these 
very essentials, and so depriving them of their representation? 

Feinting: On what grounds do you think citizens could argue most 
reasonably for more influence over government policies than they have at 
some given time? 

Infiltration: What is reasonable living, safety, and freedom for most people 
is an empirical matter. 

Reinforcement: Without assurance of the essential government services and 
protections, provisions for more elaborate mechanisms of representation are 
dangerous because needy or fearful people are susceptible to being influenced 
against their (long run) best interests. Allowing government to expand 
beyond the provision of essentials tends to make it too great an economic (and 
so also political) power in defending its own organizational interests, too 
complex to be evaluated by ordinary citizens (or any single person), and too 
involved in policies on which no substantial consensus can be reached. 

Fortification: That’s not to the point at all. Without his needs being met, 
no man is free. 

Camouflage: Actually, I’m just about convinced that it would be better to 
reduce government’s functions and size as much as would be consistent with 
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minimal national security, and so reduce the range of (government) politics 

to a minimum. 

; Entrapment: Needy frightened people cannot be reasonably expected to 

» pursue their own (long run) best interests. [If the opponent agrees, then a 
counterstroke for a nonpolitical definition of interest has been acceded to; but 
if he disagrees, then deepen the trap by asking if there are any circumstances 
when the citizenry may be deflected from these interests. ] 

4 Diversionary counterattack: People must be helped to assess their own 
needs, Left entirely on their own, most people could not construct or select 
government policies that would be likely to assure a reasonable standard of 
welfare for the most of their fellows. Those least powerful and most atypical 
would live badly: witness our own racial minorities. 

Counteroffensive: What government tries to do cannot define how re- 
presentative it is, because only what it actually does can give the citizens what 
they want. Most people probably want a reasonable standard of living, safety 

4) of person and property, and personal freedom (including that from excessive 
government demands). 

Sabotage: It’s become rather clear from public opinion studies that people 
want a decent life and the opportunity to live it. Very few want to have any 
more say than necessary for that. 

Bombardment: You are raising a lot of issues for yourself that must be 
clarified before we can really get into this. What do you mean by “getting 

«what one needs to live has no bearing logically on how well he is repre- 
sented ...”? Who said anything about not letting citizens have their say in 
trying to influence government policies? On what grounds do you imply that 
people will get what they need through democratic politics? Are there to be 
no limits on the accumulation of government functions? And those are just a 
few of the matters needing clarification. 

Disarming: Your arguments are rather new to me and interesting, though I 
have discussed this matter with quite a few people. 1’ll wager that if we really 
got into it deeply, we'd find that there’s no fundamental disagreement between 


us. Let’s try that next time we get a chance, 


E. PREPARING AN OFFENSIVE 


The first prerequisite to a successful campaign of persuasion is knowledge 
of the opponent: strategic intelligence. One would best know what his 
opponent believes on the issue of interest, on what grounds he holds these 
beliefs, how these beliefs are important to him, and how he will resist the 
conversion of these beliefs to one’s intended substitutes for them. The acquisi- 
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tion of this information, as well as one’s prior dealings with the opponent, 
constrain the framework of argument and give the opponent opportunities 
for his own preparations. Care must be taken very early to avoid being 
sabotaged, disarmed, or successfully counterattacked, by checking one’s own 
arguments and. evidence, avoiding compromising relationships with the oppo- 
nent, and exercising security (camouflage) precautions, respectively. Also, 
incidents that may precipitate reinforcement, or worse, fortification, must be 
avoided. 

Although it may seem premature, the prospects for the opponent’s defending 
the substituted beliefs, should one succeed in establishing them, must be ex- 
plored, They may not be viable in the cognitive structure into which they are 
implanted, or they may be subject to attacks by other parties. If the prospects 
for their survival are not favorable, the effort required for establishing them 
may be wasted. Furthermore, some opportunity and resources must be avail- 
able for eradicating flaws in the substituted beliefs and reinforcing them 
where necessary. 


F. Summary 


Several offensive and defensive strategies in a third person psychology of 
argument are presented and differentiated in terms of their indirection, decep- 
tion, and induction of action. 
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ANAGRAM SOLVING AND SAT PERFORMANCE* 


Herbert H. Lehman College of the City University of New York 


Epwarp I. GAvuRIN 


A, INTRODUCTION 


An earlier study (1) investigating the relationship between anagram solving 
and mental ability revealed that the solution of anagrams was more strongly 
associated with nonverbal than with verbal skills. Among other things, the 
study reported a stronger relationship between performance on an anagram 
test and the Numerical Ability subtest of the Differential Aptitude Test 
(DAT), than between the anagram test and the Verbal Reasoning subtest of 
the DAT. More recently, Maier and Casselman (2) have made a similar 
comparison between the ability to solve verbal insight problems and perfor- 
mance on both the math and yerbal forms of the Scholastic Aptitude Test 
(SAT). Their study yielded significantly positive correlations between 
problem solving ability and SAT math performance, while the SAT verbal 
scores were found to be unrelated to problem solution. Both anagram solving 
and the solution of verbal insight problems have therefore been shown to be 
more closely related to nonverbal numerical skills than to verbal ability. 

To evaluate further the relationship of verbal and numerical skills to 
problem solving, and to determine the consistency of the earlier findings 
mentioned above, the present investigation undertook to relate anagram 
solving to the math and verbal forms of the SAT. 


_ B. Meruop 
1, Tests 


a. Anagram test. A 60-item, five-letter, single solution anagram test was 
employed. The test and its method of administration have been described in 
an earlier report (1, pp. 227-228). 

b. Scholastic Aptitude Test. This test was utilized in the present study 
because it provided both a math and a verbal score for each student and be- 
cause these scores were available from college records. 


* Received in the Editorial Office on April 12, 1972, and published immediately at 
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2. Subjects and Procedure 


The subjects were 61 male and 152 female undergraduate college students 
enrolled in psychology classes. Group procedure was employed in administer- 
ing the anagram test, utilizing intact classes in their respective classrooms. 
The SAT scores, already available for each subject, were later obtained from 
the college records. 


C. Resuirs AND Discussion 


The Pearson correlations between the anagram test and each of the SAT 
forms (Math and Verbal) are presented in Table 1, 

Although the correlations were relatively low, it is nevertheless evident 
that for all comparisons (males alone, females alone, and the two groups 
combined), a stronger relationship was obtained between anagram solving 
and SAT (Math) than between the anagram test and the verbal form of 
the SAT. For the male group, however, the difference in degree of association 
of the two SAT forms with the anagram test was sharpest, since SAT 
(Math) was found to be positively and significantly related to anagram 
solying, whereas no significant correlation was obtained between anagram 
solving and SAT (Verbal). 

Unlike Maier and Casselman’s finding that males are superior to females 
in the solution of insight problems, the present study yielded no statistical 
difference between men and women in the ability to solve anagrams. In fact, 
the means and standard deviations for the two groups were roughly compar- 
able, yielding a t value of well below unity. Nevertheless, in spite of the fact 
that different problem tasks and aptitude tests were used, the present study, 
the earlier Gavurin study (1), and the Maier and Casselman study (2) are 
clearly in agreement with the rather curious general finding that the solution 
of verbal problem solving tasks thus far investigated is more strongly related 
to mathematical than to verbal ability. In view of these apparently consistent 
findings, it would appear desirable to continue research with other types of 
problems in order to determine the generality of the reported relationships. 


TABLE 1 
PEARSON CORRELATIONS BETWEEN ANAGRAM TEST AND SAT Scores 
Males and 
i * Males Females females 
‘omparison (N= 61) (N= 162). (N=213) 
Anagram test vs, SAT (Math) .28* 27° .26* 
Anagram test vs. SAT (Verbal) -13(NS) oe .18* 


—_———— _..(<————(—a—a_=<($ EEE 


*p<.01. 
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D. Summary 


The present study, utilizing undergraduate college students as subjects, 
investigated the relationship between anagram solving and performance on 
both the math and verbal forms of the SAT. The results revealed that for 
males, females, and the two groups combined, a stronger degree of association 
was present between anagram solving and SAT (Math), than between the 
former and SAT (Verbal). These results were found to be consistent with 
those of previous studies that have utilized either different verbal problems or 
different measures of numerical and verbal ability. Further research was 
recommended to test the generality of the findings thus far reported to the 
solution of other types of problem solving tasks. 
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BEHAVIORAL CONTRAST IN MIXED AND MULTIPLE 
FIXED RATIO SCHEDULES? * 


College of St, Teresa 


Joun O. Kino 


A. IntTRopucTION 


Behavioral contrast describes a condition in which rates of responding in 
the presence of two different stimuli change in opposite directions. The con- 
trast effect appears to be the result of a worsening in conditions of one com- 
ponent of a multiple schedule in relation to the other component (1). 

Mixed schedules are similar to multiple schedules except that there are no 
exteroceptive stimuli associated with the different schedule components, 
Several studies have shown that mixed schedules can control differential rates 
of responding in a two component schedule (2, 4). However, no detailed 
analyses of acquisition in mixed and multiple schedules have been undertaken, 
In the present study, acquisition of behavioral contrast was compared in mixed 
and multiple schedules, 


B. Mernop 
1, Subjects 


The subjects (Ss) were 12 Sprague-Dawley rats maintained at 85% of 
their free feeding weight. The Ss were between 120 and 160 days old at the 
beginning of discrimination training. 


2. Apparatus 


A Scientific Prototype rat chamber containing one lever was located inside 
a sound-attenuating chamber. The chamber was ventilated by a 110-volt AC 
fan which also served as a masking stimulus. A 5-watt light, a discriminative 
stimulus (S®) for the FR 3 component in the multiple schedule, was located 
at floor level on the front panel of the chamber. Programming and data 
collection were in adjacent rooms. The FR programmer was reset following 
each two-minute cycle, so that reinforcements did not accumulate, 
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3. Procedure 


After shaping, the Ss were given 100 reinforcements on continuous rein- 
forcement. This was followed by training on either a mixed or multiple 
schedule, Both the mixed and multiple schedules contained two FR com- 
ponents presented successively to the S. However, in the multiple schedule, 
a light (S?) was present when the higher density FR 3 reinforcement com- 
ponent was present, and absent when the lower density FR 33 component was 
present, Both mixed and multiple schedules consisted of eight two-minute 
FR 3 components and eight two-minute FR 33 components. Training on the 
mixed and multiple schedules continued for 10 sessions, During these sessions 
the rates of responding were recorded in both the FR 3 and FR 33 com- 
ponents. 


C. Resutts 


‘The mean rates of responding for Ss on mixed and multiple schedules are 
presented in Figure 1. Inspection of Figure 1 indicates that by the tenth 
session the Ss on the multiple schedule attained the same rate of responding 
in the FR 3 component as did the Ss on the mixed schedule (27.50 responses 
per minute or 9.18 reinforcements per minute for the mixed schedule Ss, as 
compared to 27.45 responses per minute or 9.15 reinforcements per minute 
for the multiple schedule Ss). The acquisition data were analyzed by a two 
way analysis of variance with repeated measures on the sessions variable. The 
analysis revealed that the type of training—mixed or multiple—had no effect 
on acquisition of responding in the FR 3 components (F = .15, df = 1/10, 
~p > .01). A significant sessions main order effect (F = 33.47, df = 9/90, 
p < .01) revealed that learning occurred across sessions. A significant training 
by sessions interaction (F = 5.18, df = 9/90, p < .01) indicated that the 
slopes of the FR 3 acquisition curves were different for the mixed and 
multiple schedules. 

Analysis of the rates of responding in the FR 33 components indicated that 
by the tenth session the rates of responding were (a) in the mixed schedule 
5.11 responses per minute or .15 reinforcements per minute and () in the 
multiple schedule 1.75 responses per minute or .05 reinforcements per minute. 
A two way analysis of variance with repeated measures on the sessions vari- 
able was conducted. The analysis indicated that the type of training had an 
effect on acquisition of responding for the FR 33 components (F = 17.98, 
df = 1/10, p < .01). In other words, the rates of responding in the FR 33 
component were significantly lower in acquisition of the multiple schedule 
than in acquisition of the mixed schedule. The analysis also indicated a 
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significant sessions effect (F = 43.96, df = 9/90, p < .01) and a significant 
sessions by training interaction (F = 21.90, df = 9/90, 2 < .01). The 
significant sessions effect indicates that learning occurred, while the interaction 
indicates that the slopes of the FR 33 curves for mixed and multiple schedules 
were different across acquisition. 


D. Discussion 


The data suggest that rats learn to discriminate the ratio components in a 
mixed schedule almost as readily as in a multiple schedule. In acquisition the 
difference between mixed and multiple schedules appear in the FR 33 com- 
ponents. This is not entirely unexpected, since an external stimulus marks 
the various components in a multiple schedule. In a mixed schedule, inasmuch 
as the durations of the ratios are uncertain to the rat, a test must be made. 
‘Three responses in succession indicate if the smaller ratio is operating. Because 
of this continual testing for the smaller ratio in the mixed schedule, the rat 
will make at least three responses in a two-minute period (1.5 responses per 
minute), as compared to a rat in the multiple schedule which will not have 
to respond at all in order to find which FR component is present. ‘This 
analysis finds some support from Sadowsky (3). Sadowsky suggests that 
mixed schedules that produce a great deal of differential responding will be 
learned quickly. However, if the mixed schedule produces a small amount of 
differential responding, the mixed schedule will be learned slowly. Since the 
present study employed a schedule that generated a great amount of dif- 
ferential responding, the discrimination was learned quickly. 


E. Summary 


Behavioral contrast was investigated with rats in both mixed and multiple 
FR 3 and FR 33 schedules. Multiple schedules are similar to mixed schedules 
except that they contain an exteroceptive stimulus. The Ss were placed on 
either a mixed or a multiple schedule for 10 days, and rates of responding 
were collected and analyzed. The results indicated that rates of responding 
were similar in the FR 3 components for both mixed and multiple schedules. 
However, rates of responding were significantly lower in the FR 33 com- 
ponents of the multiple schedule than in the mixed schedule. Differences in 
rates of responding in the FR 33 components between mixed and multiple 
schedules suggest that in a mixed schedule the rat must continually test by 
lever pressing to determine which component of the schedule is in effect, 
whereas in the multiple schedule the rat is provided with this information 
from an exteroceptive stimulus. 
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ANAGRAM SOLVING METHODS: THE EXPECTED 
AND THE OBSERVED* 


Herbert H, Lehman College of the City University of New York 


Epwarp I. Gavurin 


A. INTRODUCTION 


Observation of anagram solvers suggests that there is a considerable amount 
of variability among them in their approach to the solution of these problems. 
Some individuals, for example, prefer to use pencil and paper to rearrange 
the letters physically, while others prefer to do all of the rearranging men- 
tally. Similarly, there are some solvers who prefer to work continuously on 
each problem until it is solved, while others feel that it is beneficial to rest 
from an unsolved problem either by trying to solve another one in the interim, 
or by simply doing something else. Of course, in nonexperimental situations, 
most solvers do not use any of the above methods exclusively, but combine 
them with others to develop a highly idiosyncratic approach to the solution of 
each anagram, 

Although little empirical evidence is available to evaluate the various 
problem solving methods, it is clear that solvers do have very specific pref- 
erences about their suitability. The present paper is therefore devoted to a 
report of both a survey, conducted to elicit opinion, and an experiment, 
designed to yield empirical data, regarding the effectiveness for anagram 
solving of (a) physical vs. mental letter manipulation and (b) massed vs. 
distributed solution effort. 


B. Tue Survey Stupy 


A short four-item questionnaire was administered to 58 students in four 
different undergraduate psychology classes. Before the questionnaire was 
answered, the subjects were given a brief description and an example of an 
anagram problem in order to insure that the situations to which the questions 
referred were clear. The subjects were then asked to indicate under which 
of the two alternate solution methods presented in each question they would 
do better at solving anagrams. The questions and results are presented in 
Table 1. The chi-square analysis of the results, based upon the assumption 


* Received in the Editorial Office on April 26, 1972, and published immediately at 
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of an equal split between the expected frequencies, reflects a highly significant 
preference for all methods that permit the physical manipulation of letters 
(Questions 2, 3, and 4). No significant preference was observed, however, for 
working with or without rest periods, indicating that the hypothesis of an 
equal split between these two alternatives was clearly acceptable, 


C, Tue Exprrmmentat Stupy 
1. Subjects 


The subjects were 40 volunteer students from undergraduate psychology 
classes, ~ 


2. Materials 


The problem solving material consisted of 10 six-letter, single solution 
anagrams which were presented on individually lettered tiles approximately 
3/4 inch by 3/4 inch in size. 


3. Procedure 


The subjects were randomly assigned and individually tested under one of 
.the following four experimental conditions, Ten subjects served under each 
of the conditions. 

Under Condition I, or the Massed Practice Nonmanipulation Condition 
(MN), each of the anagrams was presented once for a maximum of 180 
seconds, The subject was required to work continuously on each problem 
without manipulating the tiles, and to call out the solution as soon as it 
occurred. To prevent letter manipulation, the lettered tiles of each anagram 
were taped together, making it impossible to rearrange them physically. Each 
succeeding anagram in the list was presented either as soon as the subject 
had solved the preceding one, or had worked on it for the maximum 180- 
second time limit. The time to solve each anagram and the number of solu- 
tions achieved were recorded. — 

Condition II, or the Massed Practice Manipulation Condition (MM), 
was identical to Condition I except for the fact that the letters of each 
anagram were not fastened together, permitting free manipulation and re- 


arrangement of the tiles. 
Condition III, or the Distributed Practice Nonmanipulation Condition 


(DN), provided for 30-second anagram solving work periods, interspersed 
with 15-second rest periods. As in Condition I, the subject was required to 
solve each anagram without manipulating the letters. To prevent rehearsal 
during the rest interval, the anagram was covered with a cardboard shield, 
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and the subject was required to count aloud backwards by sevens from dif- 
ferent three-digit numbers provided by the experimenter at the beginning of 
each interval. A maximum of six 30-second work periods was permitted for 
each anagram to insure that the total possible work time allotted was the 
same as that provided under the other conditions. Similarly, a new anagram 
was presented as soon as the previous problem had been solved or upon the 
completion of the sixth and final work period. The number of anagrams 
solved and the time to solve each anagram were recorded. 

Condition IV, or the Distributed Practice Manipulation Condition (DM), 
was identical in all respects to Condition III except for the fact that the 
subject was permitted to manipulate the letters as in Condition II. 


4. Results 


The two performance measures for each subject (median solution time and 
number of anagrams solved) were subjected to a two-way analysis of variance. 
Neither measure yielded a significant difference between the two solution 
methods or their interaction. For solution time, the values for the distribu- 
tion effect (MN and MM vs. DN and DM), the manipulation effect (MM 
and DM vs. MN and DN), and the interaction effect were F (1, 36) = 
18, F (1, 36) = 1.02, and F (1, 36) = 2.29, respectively. The cor- 
responding F values obtained from the analysis employing the number of 
solutions as the performance measure were F (1, 36) = .19, F (1,36) = 
1.73, and F (1, 36) = 2.13. 


D. Discussion 


The marked disparity between intuitive judgment and experimental fact 
is clearly evident in a comparison of the results of the present survey and 
experiment with respect to the effectiveness of physically manipulating ana- 
gram letters. Hence, while the strong statistically significant preference for 
overt letter manipulation, based upon personal judgment, suggests that the 
method is very compelling intuitively, the experimental data provide no evi- 
dence for its superiority over a purely mental method of manipulating ana- 
gram letters. The latter result is in accord with the results of an experiment ~ 
by Groezinger reported by Johnson (3), which failed to produce a difference 
in anagram solving when one group of solvers was permitted to use pencil 
and paper while the other was not. On the other hand, the absence of a clear- 
cut preference for massed or distributed practice in the survey data parallels 
the experimental result that neither method is superior to the other. This 
finding is in accord with previous studies both on anagrams (1) and on prob- 
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lems involving deductive reasoning (4). It also serves to support a suggestion 
made in an earlier study (2), that unlike the solution of mechanical puzzles 
where massed practice clearly facilitates solution, verbal problems may be 
unaffected by conditions of distribution. 

It seems appropriate to conclude from the results of the present report that 
intuitive judgment is an unreliable measure of the effectiveness of problem 
solving methods. More specifically, the illusory superiority of physical letter 
manipulation in anagram solving serves to underscore both (a) the limita- 
tions of introspective reporting in such situations and (2) the attendant 
difficulties of relying upon such information to discover the essential nature 
of the problem solving process. 


E. Summary 


The present paper reported the results of both a survey, conducted to 
elicit opinion, and an experiment, designed to provide empirical data, re- 
garding the effect upon anagram solving of (a) physical letter manipulation 
and (6) distribution of solution effort. While the survey data yielded a 
highly significant preference for the physical manipulation of letters in solving 
anagrams, the experimental data failed to demonstrate the superiority of this 
method over a purely mental method of manipulating letters. On the other 
hand, the survey data, which yielded no significant preference for massed or 
distributed practice in anagram solving, was in accord with the experimental 
findings. It was concluded that intuitive judgment is an unreliable measure 
of the effectiveness of problem solving methods. 
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A CROSS-CULTURAL INVESTIGATION OF THE 
RELATIONSHIP BETWEEN CONSERVATISM 
AND PERSONALITY* 


University of Cape Town, South Africa 


CHRISTOPHER ORPEN 


A. IntTRopucTION 


The main finding of The Authoritarian Personality (1), according to the 
authors, was that the various social attitudes of an individual form a broad 
and coherent pattern which is itself an expression of deep-lying personality 
trends. In their monumental study they obtained correlations of the order of 
-80 between the social attitudes of ethnocentrism and antisemitism, as mea- 
sured by the E scale and the A-S scale, and the personality variable of 
authoritarianism, as measured by the F scale. Working within this frame- 
work, McClosky (4) also found a close relationship between the social at- 
titude of conservatism and the personality variable of adjustment-maladjust- 
ment. In his study individuals who held conservative views obtained signifi- 
cantly higher scores than individuals who held liberal views on scales of 
“clinical” and “psychiatric” personality traits. This led him to claim that 
“compared to liberals, conservatives tend to be submissive and alienated, as 
well as poorly integrated psychologically” (p. 43). 

McClosky’s study was conducted in the relatively “liberal” climate of the 
northern United States, in which the holding of extremely conservative views 
receives cultural disapproval. Since there is a growing amount of evidence that 
the relationship between social attitudes and deeper personality trends is 
largely a function of whether the prevailing cultural norms explicitly sanction 
the particular social attitudes (5, 6), a different relationship between con- 
servatism and adjustment should be found in the relatively “conservative” 
climate of white South Africa, in which the holding of extremely “liberal” 
views receives cultural disapproval (3, 9), than was found in the northern 
United States. 

B. MetHop 


To test this hypothesis 69 middle-class English-speaking schoolchildren in 
their final year of school (average age 17.5 years) were given measures of 
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prejudice and conservatism and a measure of personal adjustment. Prejudice 
was measured by a form of the Bogardus Social Distance scale, which tapped 
the subjects’ social distance attitudes toward the various minority groups in 
South Africa (8). Conseryatism was measured by a 20-item Conservatism 
scale, specially constructed by Orpen (7) for use in white South Africa, 
while personal adjustment was measured by the Alienation, Social Non- 
conformity, and Discomfort subscales of the Psychological Screening Inven- 
tory (PSI), developed by Lanyon (2). The latter two scales require further 
comment, 

In the construction of the Conservatism scale, 50 items, taken mainly from 
previous scales of conservatism, were given to three judges who stated whether 
the issues covered by each of the items were “relevant” or “irrelevant” in 
South Africa today. Of these 50 items, 38 were regarded as covering “rele- 
vant” issues by all three judges. These 38 items were then given to two 
criterion groups, consisting of 40 supporters of the prosegregationist United 
Party and 40 supporters of the prointegrationist Progressive Party. In a pilot 
study the United Party group had obtained a significantly higher (p < .01), 
more conservative, mean score on a six-point self-rating scale of liberalism- 
conservatism. When these two criterion groups completed a questionnaire 
consisting of these 38 items presented in a six-point Likert-style format, 20 
were found to distinguish between the groups in the appropriate direction at 
the .001 level. These 20 items were then cross-validated on a fairly hetero- 
geneous sample of 62 English-speaking schoolchildren in their final year of 
school, who also completed the Wilson-Patterson Conservatism scale (10) 
and a six-point self-rating scale of liberalism-conservatism, and indicated their 
preference for the relative “conservative” United Party or the relative “liberal” 
Progressive Party. In this cross-validation study, each of the items correlated 
significantly (6 < .01) with this total 20-item scale, while the subjects 
falling in the top quartile of the distribution of total scores obtained signifi- 
cantly higher (p < .01) mean scores on each of the items than those falling 
in the bottom quartile. In addition, the total scale correlated significantly 
with the Wilson-Patterson Conservatism scale (r = 41,  < .01) and with 
the subjects’ self-ratings (r = .38, p < .01), while the mean total score of 
the United Party subgroup was significantly higher (@ < .01) than that of 
the Progressive Party subgroup. 

The subscales of the PSI, which have been shown to be highly related to 
the corresponding subscales of the Minnesota Multiphasic Inventory, were 
empirically developed to measure, respectively, the similarity of the respon- 
dent to hospitalized psychiatric patients (alienation), to incarcerated prison 
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inmates (social nonconformity), and to classified neurotics (discomfort ot 
anxiety). In addition to scores on these three indicators of personal adjust- 
ment, a total PSI score was obtained for each subject by summing his 
responses to all the items in the three subscales. 


C. Resuxts 


The correlation between adjustment, as given by the total PSI scores, and 
conservatism was significant: at the .01 level (r = .65) as was that between 
adjustment, as given by the total PSI scores, and prejudice (r = 433). The 
first significant correlation indicates that, at least in the present sample, there 
is a relationship between the holding of conservative views in present-day 
white South Africa and personality adjustment, with the more conservative 
subjects being better adjusted. The second significant correlation indicates 
that there is also a relationship between the holding of prejudiced views in 
present-day white South Africa and personality adjustment, with the more 
prejudiced subjects being better adjusted. The close relationship between con- 
servatism and adjustment is further borne out by the fact that the negative 
correlations between alienation and conservatism (r = —.33), social noncon- 
formity and conservatism (r = —.66), and discomfort (anxiety) and con- 
servatism (7 = —,.50) were all significant at the .01 level. Although the 
negative correlations between alienation and prejudice (r = —.15) and be- 
tween discomfort (anxiety) and prejudice (r = —.17) were not significant 
at the .05 level, the correlation between social nonconformity and prejudice 
(r = —.50) was significant at the .01 level. 

Three other subsidiary results are worthy of note. Conservatism and pre- 
judice, which both reflect socially approved views in white South Africa, were 
positively correlated (r' = .62,  < .01). The mean scores of this fairly 
typical high school sample on the three subscales of the PSI did not differ 
significantly (f > .05) from those of Lanyon’s normative United States 
sample of the same age, suggesting that the two samples are roughly the same 
with regard to level of adjustment, as given by the PSI. The three subscales 
of the PSI correlated with each other at beyond the .01 level, suggesting that 
the scale is tapping a fairly general dimension (adjustment) rather than a 
number of independent factors. 

At a more general level, the results suggest that the relationship between 
conservatism and personality adjustment varies with the cultural background 
of the group concerned; i.e., that McClosky’s findings are culture-bound, If 
it is assumed that the PSI is a fairly valid indicator of adjustment, the results 
suggest, in sharp contrast to McClosky’s findings, that in certain settings the 
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holding of fairly conservative and prejudiced views may actually be an indica- 
tion of personality adjustment. 


D. SUMMARY 


To investigate the relationship between personal adjustment and the social 
attitudes of conservatism and prejudice in a fairly “conservative” cultural 
setting (white South Africa), 69 English-speaking white schoolchildren were 
given the Psychological Screening Inventory to measure adjustment, a social 
distance scale to measure prejudice, and a specially constructed South African 
conservatism scale. The significantly positive correlations between adjustment 
and both conservatism and prejudice were taken as indicating that, in certain 
settings, conservatism and prejudice may be related to personal adjustment. 
This result differs markedly from those found in fairly “liberal” settings and 
emphasizes the importance of cross-cultural studies in this area. 
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DIFFERENTIAL T-GROUP BEHAVIORS OF HIGH AND LOW 
DOGMATIC PARTICIPANTS* 


Memphis State University; and Frye, Timmons, and Associates, Memphis, Tennessee 


Rotanp L. Frye, Ropert N. ViputicH,! BARBARA MEIERHOEFER, 
AND Sytvia A. JouRE 


A. INTRODUCTION 


Although sensitivity training is a “potentially powerful tool for changing 
behavior” (4, p. 20), researchers have had difficulty verifying predictions 
about behaviors within the group (3, 9) and hypotheses derived from theoret- 
ical points of view: McGregor’s Theory X-Theory Y (13, 15), Riesman’s 
inner vs. other directedness (7), and Ohio State’s leadership dimensions of 
consideration and initiating structure (8). 

Basic differences in life-style, personality, emotional adjustment, and adap- 
tion of defense mechanisms as theorized by Rokeach (10, 11) and others (1, 
2, 12, 14) suggest that high dogmatics (HDs) and low dogmatics (LDs) 
will behave differentially in a T-group and that the HDs will benefit less from 
a sensitivity training experience. Therefore, in comparison to the T-group 
behaviors of HDs, the LDs will be predicted to make fewer statements 
rejecting the experimental situation and more self, central, situational, positive, 
and supportive statements, and to exhibit fewer behavioral manifestations of 
anxiety, annoyance, or boredom. 


B. MertHop 
1, Subjects 


Two hundred sixty-two male Memphis State University students enrolled 
in introductory psychology were administered Rokeach’s Dogmatism Scale, 
Form E (10). Those males who scored one standard deviation above or one 
standard deviation below the mean of the group were asked to volunteer to 
participate in a T-group for which extra credit was given only if the S came 
for all sessions. The Ss were randomly assigned to two experimental groups 
as follows: HDs, N = 7; and LDs, N = 9. 
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2. Procedure 


a. Training of nonparticipant raters. Two psychology graduate students 
became familiar with the rating categories, to be described later, which were 
derived from Rokeach’s theory of dogmatism. The raters sat apart, outside, 
and behind the T-group members and did not participate in the groups’ ses- 
sions. 

b. The T-group. The HD group met in a classroom with hardback chairs 
arranged in a tight circle. The two trainers were nondirective because of the 
expected differential reactions of high and low dogmatics to authority figures. 
The group lasted from 3 p.m. to 6 p.m. (Session 1), broke for dinner, and met 
again from 7 p.m, until 10 p.m. (Session 2). The same format was adhered 
to for the second day of sensitivity training (Sessions 3 and 4, respectively.) 
On the following two days, the LD group experienced a T-group using the 
same format. The raters sat outside the group and did not take part in any of 
the group interactions. 

c. Definitions and examples of coding categories. The following are the 
categories that the nonparticipant observers used to code the behaviors of the 
T-group members. The theoretically derived form was modified with ex- 
perimental use. The edited form is given because of its clearer definitions and 
greater ties with the “real” experimental situation, Only the second (Speaker) 
through the sixth (Situational-Nonsituational) categories were coded on every 
occasion. ‘The others were used only when they were felt to be appropriate. 

(1). Time, Time was noted every 15 minutes to aid an E in matching 
the coding sheets of the two nonparticipant observers. 

(2). Speaker. Referred to the individual who made a comment. The 
number on the participant’s chair was used as the code. 

(3). Receiver. The individual, group, or outside referent to whom the 
speaker's comment was directed. The choice of codes included the number 
assigned to an individual’s chair or chairs, G when the speaker was directing 
his comments to the group, or O for a referent outside the group. 

(4). Self-Other. Self, 8, referred to a comment about the speaker or a 
statement that revealed something about him, even if it was of very general 
nature—e.g., “I went skiing at McKellar’—to statements that revealed in- 
formation of a psychological nature, such as “I don’t think there’s a world 
after death.” Other, O, was reserved for information referring to someone 
other than the speaker ; e.g., “Mr. Clarksdale plays a lot of games with us.” 

s (5). Central-Peripheral. A central statement, G, referred to informa- 
tion about an individual’s self-concept, view of the world, or general assump- 
tions about man (no matter whether the speaker was referring to himself of 
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giving another person feedback). Some examples of central statements were: 
“TI don’t feel anyone has the right to take another person’s life” (this referred 
to a group’s discussion on abortion) or “I don’t say much until I really get 
to know people. Then I really talk.” 

A peripheral statement, P, referred to a specific act, person (except 
when central feedback was being given about the referent individual which 
requires the C code), object, or idea; e.g., “I like baseball,” “My father was 
a tyrant, unkind to Mom and me,” or “Spiro’s right.” 

(6). Situational-Nonsituational. If a statement referred to a here-and- 
now group interaction—e.g., “Why were you so hostile to John just now?” 
—or to an interaction that occurred sometime in the life of the group—e.g., 
“This morning you acted the same way when I told you that you’re passive- 
aggressive,”—the comment was coded S. If a statement referred to something 
outside of the group, it was considered nonsituational, NS; e.g., war, hobbies, 
school, classes, and politics. 

With use of the coding system, it later became clear that many statements 
were neither situational, within the here-and-now context, nor really non- 
situational, but somewhere in between. These statements were group-oriented 
or group-generated with a referent outside the group, but were produced by 
group interplay and reflected a change in the process of an interaction from 
truly nonsituational material. This category was also used when the group 
interaction was bordering on situational discussions or feedback, but was not 
yet down to the “nitty gritty.” (Among the raters, both types of statements 
were sometimes colloquially known as “‘garbage.”) Both uses were coded G. 
The intended result of the G subcategory was to purify the § and NS cate- 
gories. No hypothesis pertaining to G was given in this study: for example, 
“How does your wife feel about your timidity?” or “What do you mean the 
Vietnam war is a necessity to the economy of the U.S.? Are you serious?” 

(7). Positive-Negative. A positive statement clearly endowed the re- 
ceiver, self or other, with socially desirable “good” qualities; e.g., “I think 
you’re nice,” or “I like people.” A negative comment endowed the receiver, 
self or other, with socially undesirable qualities; e.g., “I’m stupid,” and 

*“You’re too nervous.” The code for a positive statement was -+, and for a 
negative one, —. 

(8). Supportive-Nonsupportive. A supportive comment was one that was 
made in defense of or support of someone else’s statement; e.g., “I guess so,” 
or “I think you’re right.” A nonsupportive statement disagreed with another 
group member’s comment or the speaker’s own previous comment; e.g., “You 

c.. know what you’re talking about,” “Don’t you think your knowledge of 
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the situation is too sparse to draw that conclusion?” “You would change your 
mind if you’d been there,” or “Well, you’re probably right” (disagreement 
with speaker’s own previous statement). 

Another type of S§ action that was felt to reflect nonsupport of a speaker 
was avoidance. When a speaker directed a comment or question to another 
individual who ignored it or began speaking to another person without having 
replied to the original speaker, an A for avoidance was coded in this column 
for the receiver. The 4 category was a later adopted change, and, con- 
sequently, no hypothesis in this study pertains to this subcategory. 

(9). Statements of Rejection of the Experimental Situation. This cate- | 
gory included negative statements about the group itself; e.g., “This dis- 
cussion seems rather meaningless to me,” or “If you don’t like what we are 
doing, tell us what we are supposed to do! You’re the leader!” or “I don’t 
see how you can mean that at all!” The observer simply checked (/) the 
category when he felt that a statement belonged in this column, and could — 
also use the content category to add other information considered relevant. 

(10). Behaviors of Anxiety, Annoyance, or Boredom. The category 
referred to a display of anxiety, such as perspiration, tapping of hands or 
feet, or other nervous mannerisms (e.g., playing with or manipulation of a 
neck chain, watch, keys, or lighter), frequent change of posture, and chain 
smoking. A rater used the content category to describe the behavior more 
fully, 

(11). Content. A word, phrase, or brief statement describing the gen- 
eral content of the interaction was the principal use of this category; e.g. 
“Dave's showing his hostility again,” “feedback from trainer,” “garbage,” or 
“bull.” It could also be used to record anything of interest to an observer. 


C. Resutts 


The results of the ingroup data were analyzed in terms of the coding cate~ 
gories, The percentages given in Table 1 are an average of these given by. 
both raters, A chi square analysis was computed between the High and Low. 
Dogmatic percentages for each session of each category. Table 1 contains @ 
summary of the statistics and indicates which value was used as the expected) 
for the analysis. The intercoder reliability was calculated for each dimension 
by means of a correlation between the eight percentages earned by the 
coders for each category, Reliability coefficients are those obtained with 
original version of the coding system. 

The first category considered was Self-Other, During Session 1 and Session 
2, the LDs made significantly more self statements than did the HDs. The dit 
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> TABLE 1 
Cut Square ANnALyses BETWEEN MEAN PERCENTAGE OF STATEMENTS AND BEHAVIORS 
Given By HicH Docmatic (HD) ann Low Docmatic (LD) Susyecrs 


Category HD mean LD mean — 
and Jo of % of 
session statements statements Expected a 
Self 
1 35.0 58.0 LD 9.12008 
2 38.5 55.0 LD 4.95% 
se 3 39.0 44.0 LD 57 
4 70.5 66.5 LD 24 
« Central 
1 13.0 59.0 LD 35,8600" 
2 62.0 94.0 LD 
3 40.5 78.5 LD 
4 76.5 94.0 HD 
Situational 
- i 45.5 $3.0 LD 16,9400" 
2 75.0 93,5 HD 4,56" 
oy 3 57.0 95.5 LD 15.5geee0 
| 4 94.0 84.5 LD 1.06 
_ Positive 
1 14.5 51.0 LD 
2 24.5 47.5 LD 
3 19.0 30.0 LD 
4 45.0 34.5 LD 
Supportive 
st 1 37.0 43.0 LD 84 
¥ 2 49.0 51.5 LD 12 
, 3 22.0 27.0 LD 1,14 
4 74.0 41.0 LD 26.5600"" 


be Statements of 
Rejection of 
Experimental 


Situation 
1 9.5 2.0 HD 5.92°* 
2 2.5 3.0 HD 10 
3 10.5 0 HD 10,50%8* 
4 6.0 0 HD 6,00°* 

% Behaviors of 
Anxiety, 
Annoyance, 


. or Boredom 


306 JOURNAL OF PSYCHOLOGY 


ference in Session 1 was significant at O1 level, and in Session 2, at .05 level. " 
‘There were no significant differences between the two groups during the third 
or fourth session. The intercoder reliability was .82. 

Along the Central-Peripheral dimension, during all four sessions, the LDs 
made significantly more central statements than did the HDs. During the 
first three sessions, this difference was significant at the .001 level; during 
Session 4, at the .05 level. The intercoder reliability was 91. 

The next category analyzed was Situational-Nonsituational. During the | 
first three sessions, the LDs made significantly more situational statements 
than the HDs. The differences were significant at the 001 level in the first 
and third sessions, and at the 05 level in the second. During Session 4, 
there were no differences between the groups. The intercoder reliability 
was 81. 

The next category considered was Positive-Negative. During Sessions 1, 
2, and 3, the LDs made significantly more positive statements than did the 
HDs. The differences in the first and second sessions were significant at the 
O01 level; in the third, at the .05 level. There was no significant difference 
between the two groups during Session 4. The intercoder reliability was .80. © 

Along the Supportive-Nonsupportive dimension, the first three sessions 
showed no significant differences between the two groups. During Session 4, 
the HDs made significantly (at the .001 level) more supportive statements 
than the LDs, contrary to the hypothesis. The intercoder reliability was .81. 

During Sessions 1, 3, and 4, the HDs made significantly more Statements 
of Rejection of the Experimental Situation than did the LDs. During Sessions 
1 and 4, these differences were significant at the .02 level; during Session 34 
at the .01 level. During Session 2, there was no difference between the two 
groups. The intercoder reliability was 95. 

The last category included Behaviors of Anxiety, Annoyance, or Boredom. 
During all four sessions, the HDs evidenced more outward signs of stress 
than did the LDs. In Sessions 1, 2, and 4 these differences were significant 
at the .001 level; in Session 3, at the .01 level. The intercoder reliability 
was .89. 

The trainers intervened slightly more in the HD (42 interventions) group 
than the LD (31 interventions). These data indicate nothing about the length 
of each intervention. A chi square computed between the number of interven- 
tions per session and the level of dogmatism of the group was not statistically 
see suggesting that the role played by the trainers in both groups was 
simular. 
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D. Discussion 


The ingroup data indicate many things. First of all, on some of the coding 
dimensions—Self-Other, Positive-Negative, Situational-Nonsituational, and 
Supportive-Nonsupportive—the HDs reached or exceeded the behavioral level 
of the LDs, but it took longer. This is in keeping with the expected difficulty 
of establishing psychological safety in the HD group. However, it is possible 
that the same level of behavior in the HD group is a function of the HDs’ 
behaviorally conforming to the trainers’ expectations of what their behavior 
should be. The HDs may give the appearance of accepting “honesty” and 
“openness” without actually changing their values, which Rokeach described 
as “party line change” typical of the HD. The trainer in a T-group, even 
though nondirective, is an authority figure. While in his presence, the HD 
may, behave the way he thinks the trainer wants him to, exhibiting his 
deference to authority figures, while not actually integrating into his belief 
system the values he thinks the trainer wishes him to adopt. 

The HDs’ exhibition of stress behaviors, along with their rejection of the 
experimental situation, indicates that the unstructured T-group situation 
made them uncomfortable. These findings further suggest that HDs are 
more prone to anxiety, particularly in unstructured situations where they 
do not have a set of rules they can use to govern their behavior. Being un- 
comfortable, the HDs may also have felt insecure and threatened, and con- 
sequently made fewer central statements about themselves or others. The 
fewer central statements implied that the content of their conversation tended 
to steer away from extremely personal statements which revealed something 
about their self-image or beliefs. 

One dimension, Supportive-Nonsupportive, was significant in the direction 
opposite to that predicted. During the fourth session, the HDs made more 
supportive statements than did the LDs. Another study (6) suggested that 
HDs do not undergo deeper psychological change, but only become more 
cheerful in a T-group over time. The greater number of supportive state- 
ments may simply be a reflection of superficial cheerfulness. Or, the reversal 
may be related to Jacoby’s (5) finding that there is a negative correlation be- 
tween dogmatism and independence of judgment. With a stronger indepen- 
dence of judgment, the LDs may feel less of a need to support and be sup- 
ported. HDs may actually use support of others as a defense, feeling that 
this support will lead to reciprocal support for themselves by others. 

The data showed that the trainers did intervene more in the HD than 
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the LD group, but this only points further to the differences between the 
groups. The HDs kept directing questions to the trainers. On the other 
hand, the LDs as a group did not rely as much on the trainers. 

What the statistics fail to bring out is the different flavoring of the groups. 
The HDs spent, in total, over an hour sitting in strained, nervous silence. 
One of these silences started at the beginning of the second session and 
lasted for half an hour, at which time it was broken by one of the trainers. 
In contrast, the LDs spent less than five minutes total in silence. They did 
not have difficulty with the usually uncomfortable first few minutes of the 
T-group. The lack of structure did not appear to bother them much, They 
did not ramble, discussing what they should discuss, but jumped into their 
views of psychology and their opinions on the war. The staff members ate 
with the Ss during their dinner break and observed that with the HDs there 
was little dinner conversation and what little did occur was considered 
superficial. After dinner the HDs typically wandered off alone and in silence 
to await the expiration of the break hour, often returning to the experimental 
room where they sat quietly, waiting for the trainers to return. The LDs 
conversely talked, laughed, and joked with one another at dinner, often 
sitting around the table until just before it was time to return to the group. 
Their conversation often dealt with material that pertained to the group and 
their feelings about it, sometimes functioning as an extension of the group 
itself. 

The content of both groups was similar. The approach to the content and 
the process of the groups was very different. The favorite topic of conversation 
of both groups was the war. The HDs spoke about their periences. The 
LDs spoke of their feelings and reactions to their cet and their 
reactions to the feelings of the others in the group. 

In general, the hypotheses were supported, with the exception of the 
Supportive-Nonsupportive dimension which was the opposite of the prediction. 
"The behaviors exhibited in the two groups differed significantly and, in 
almost all cases, in the direction that would be predicted from Rokeach’s 
theory or additional research on dogmatism. 


E. Summary 


Hypotheses concerning T-group behaviors of homogeneous groups of high 
and low dogmatic Ss were derived from theory and research on dogmatism. 
Each group received 12 hours of sensitivity training by two nondirective 
trainers. Coders placed group members’ statements and behaviors into the 
following theoretically derived categories: Self-Other, Central-Peripheral, 
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Situational-Nonsituational, Positive-Negative, Supportive-Nonsupportive, Re- 


jection of Experimental Situation, and Behaviors of Anxiety, Annoyance, and 
Boredom. With one exception (Supportive-Nonsupportive), the results in 
whole or part supported the hypotheses. 


’ 
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COMMENT ON “THE PROJECTED BRAIN ILLUSION”* 


Department of Psychology, Brandeis University 


Brverty RengE STERN? 


Recently an article appeared in this journal entitled “The projected brain 
illusion” (PBI) by du Mas and du Mas (3). The authors reported producing 
a “bizarre visual phenomenon” (p. 115) by moving a candle elliptically in 
front of and below the eyes on a dark night. The phenomenon is described 
as an image quite similar to “pictures of the human brain” (p, 115), con- 
sisting of distinct black curved lines with pointed ends. Du Mas and du Mas 
believed the experiment and results to be new to the scientific literature on 
visual perception. 

However, the method used to obtain the effect and its description bear a 
remarkable semblance to Helmholtz’s account of the entoptical viewing of 
retinal vessels. In his Physiological Optics Volume I (5) Helmholtz attributes 
the original research and methods for production of the phenomenon to 
Purkinje (1819), and reviews the methods. Method Two appears identical 
to the method employed by du Mas and du Mas to obtain PBI. 

The second method of observing the vascular system of the retina is 
as follows: The observer looks at a dark background and at the same 
time moves a brilliant light to and fro a little to one side of the eye, 
Under these circumstances, the background presently appears suffused 
with a pale grey glow, against which may be seen the dark outline of 
the vascular figure. It remains visible only so long as the light is kept 
moving (5, p. 214). 

One discrepancy between the Helmholtz and the du Mas and du Mas 
accounts concerns the movement of the light source necessary for viewing 
the image. The latter claim that “When PBI is observed, all motion of the 
candle is stopped and PBI can then be observed at length” (3, p. 115). This 
is quite surprising in view of the fact that PBI, an entoptical image, is 
actually a stabilized image with respect to the retina. With a typical external 
visual stimulus, movement of the image on the retina is characteristic, as a 
result of involuntary movements of the eye. However, this movement of the 
image across the retina is cancelled out when the visual stimulus moves in 
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exact correspondence with the eye, as retinal vessels do. It has been known for 
some time that these stabilized entoptical images disappear soon after ob- 
servance. 

Helmholtz accounted for this phenomenon by explaining that in ordinary 
vision, with light entering mainly through the pupil, shadows of the retinal 
vessels fall on a certain location of the retina, whereas when light enters 
largely through the sclera, shadows of these retinal vessels are projected onto 
sensitive new areas of the retina. When the shadows of the vessels fall on 
these atypical areas, the image can be observed, but with constant stimulation 
at the same location, it will fade and soon disappear. 

I have replicated this experiment and have attempted to replicate the 
du Mas and du Mas experiment, using both a candle and a small electric 
light source (of varying intensities), but as yet have been unable to retain the 
entoptical image unless the light source or the eyes are kept moving. In other 
words, movement of some form is necessary to continue viewing the image. 
A possible explanation may be that the du Mas and du Mas technique, using 
a candle out of doors, might be equivalent to continually moving illumination, 
recognizing the flicker motion of a candle flame? and the motion of this light 
stimulus that wind would create. These factors then, might actually produce 


the moving condition Helmholtz postulated as necessary to maintain the 
entoptical image. 


SumMMaryY 


Du Mas and du Mas recently reported producing a visual phenomenon 
which they believed to be new to the scientific literature on visual perception. 
However, this phenomenon is remarkably similar to an account of entoptical 
viewing of retinal vessels given by Helmholtz (5). Methods used and effects 
obtained by both Helmholtz and du Mas and du Mas are cited and compared. 
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A METHOD FOR THE STUDY OF COMPETITION 
IN A FREE-OPERANT SITUATION* 


Georgetown University and West Virginia University 


Rosert §. Ruskin anp Cuartes D, Corman 


A. InTRopucTION 


Most studies in the area of “dominant-submissive” behavior have tradi- 
tionally used round-robin paired-comparison competition testing on groups 
of subjects. “Dominant” subjects in these situations were those animals that 
succesfully kept control of some limited access goal object for longer periods 
of time than did other more “submissive” group members. In most limited 
access competitive situations, food is present in a container whose opening is 
wide enough for only a single subject to eat at one time (2, 3). The problem 
with this technique is that (@) accurate measurement of the amount of food 
eaten by each competitor during a given competition is impossible to obtain, 
and (4) using a single criterion measure for dominance eliminates the mea- 
surement of many other social behaviors that are also occurring concurrently 
in any competitive situation. The worth of certain social behaviors in the 
determination of reliable dominance hierarchies has been adequately shown 
(1, 9). 

A procedure and apparatus have been developed that allow a more com- 
plete analysis of competitive behavior and the resulting dominance-submission 
hierarchies, 


B. Meruop 


The utilization of a “free-operant” test situation in conjunction with video- 
taped replays permits a multivariate analysis of competitive behavior and also 
allows one individual to evaluate all behavior present in the test, thus greatly 
increasing reliability in scoring. 


1. Apparatus 


Competition occurring in a free-operant situation may be studied through 
the use of rodents as subjects in a nose-poke apparatus. The nose-poke com- 
partment is 18” } 18” with plywood walls and grid floor. The nose-poke 
en 
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opening itself is 114” in diameter and positioned in the middle of one wall. 
The bottom lip of the opening is 14” above the grid floor. Food, water, and 
shock may be delivered by standard delivery mechanisms. A food or water 
cup is located directly behind the nose-poke opening. This positioning, in 
conjunction with the size of the nose-poke opening, allows penetration by only 
one animal at a time. Reinforcement is available for the responding subject 
and only to that subject in a competitive situation. 

Penetrations are recorded, and reinforcement is presented automatically 
by means of a photoelectric cell unit mounted inside the opening. The opening 
of the nose-poke may be varied in diameter, thus allowing an experimenter the 
option of studying widely differing species and test subjects of varying ages. 

The grid floor on the nose-poke allows presentation or shock to test sub- 
jects, lending it also to the study of competitive avoidance of escape behavior. 
Hamilton (6) utilized avoidance and escape behavior to show that dominance- 
submission hierarchies can be obtained with rhesus monkeys. Hierarchies have 
been obtained in the nose-poke when hooded rats were forced to compete in a 
free-operant setting in order to escape foot shock. These results indicate that 
shock-escape competition can be a useful and promising measure of dominance. 
Free-operant competition in the presence of both positive and negative rein- 
forcers may therefore be analyzed in the nose-poke situation. It is important 
for the apparatus to possess the capability of measuring multiple behaviors if 
it is desirable to place the same subjects under competition in more than one 
condition for comparative purposes (8). 

Stimulus presentations may be programmed to the nose-poke in the same 
way as to a standard Skinner box. This affords the researcher the option of 
Presenting complex schedules of reinforcement to subjects in a competitive 
situation, 

Closed-circuit television equipment is used in conjunction with the nose- 
poke apparatus to insure adequately proper and complete scoring of all 
relevant competitive behaviors, and to eliminate the Previously mentioned 
problem of interscorer reliability. Videotaping of all sessions permits replay of 


the session as many times as needed until all relevant behaviors have been 
scored, and eliminates the need for several ex 


! perimenters hurriedly scoring 
the same session at once. 


h ; ; Several closed-circuit television systems permitting 
videotaping are available at a cost of less than $2000 (10). 

Competition may be studied in groups of varying size. Becker and Flaherty 
(2) found group size an important variable in the stability of resultant 
dominance hierarchies. A majority of studies in this area, however, used 
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paired-comparison testing in which each member of a given group is in- 
dividually paired with all other members of the group in a round-robin 
manner. “Limited access” testing procedures used in conjunction with round- 
robin paired-comparisons have been utilized with a large degree of success 
in determining dominance hierarchies (4). The nose-poke presents an op- 
portunity to institute procedures in which competitive situations involving 
only two animals or much larger groups can be studied. The advantages of 
videotaping readily become apparent when larger groups are studied. From 
the use of both the nose-poke and videotaping apparatus we have been able 
to isolate several scorable behaviors for analysis. Behaviors scored consist of 
the following: number of nose-poke responses emitted by each animal; amount 
of time each animal spends positioned in front of the nose-poke opening; and 
the initiator of each interaction. Each of these measures may be recorded in 
session totals and/or broken down into temporal segments within sessions, 

Six behaviors have been isolated with this method, allowing a multivariate 
approach to the study of competition (7). The behaviors analyzed were 
classified in either a “social encounter” (1) or a “limited access” category 
(3). Sparring or biting responses initiated and the number of 9” x 9” grid 
floor segments traversed (locomotion) were considered by the experimenters 
to reflect “social encounters” present in the competitive situation. The time 
each competitor spent positioned in front of the nose-poke opening, the num- 
ber of poking responses made by each competitor, and the number of cross- 
overs in responding from one competitor to the other were classified as 
“limited access” competitive behaviors. 

In order for any method studying social dominance to function adequately, 
it must first demonstrate its viability by allowing determination of reliable 
dominance hierarchies from the measures used. 


2. Experimental Design 


Seventy-two male hooded rats were randomly placed into one of 12 groups. 
Six of the 12 groups were randomly assigned to the “social encounter” con- 
dition, and six were assigned to the “limited access” competitive condition. 

From weaning, all subjects were housed separately from other members 
of their group to insure that no dominant-submissive relationships were 
formed prior to testing. Age and weight, variables also found to be important 
in the establishment of dominant-submissive relationships, were held as con- 
stant as possible with no difference exceeding five days or 10 grams. 

Subjects were placed on 24-hour deprivation schedules and were randomly 
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paired with all other members of their group daily for five consecutive days. 
Each individual test trial (one pairing) was five minutes in duration and was 
videotaped for later scoring. 


C. Resutts anp Discussion 


Previous data in this laboratory (5) have determined that subjects in the 
social situation initiating the highest rates of sparring and biting, as well as 
the lowest locomotion scores, are the most “dominant” subjects in the resul- 
tant dominance hierarchies. Likewise, subjects emitting the highest nose- 
poking rates, the highest time scores positioned in front of the nose-poke 
opening, and the lowest number of crossovers while competing in the “limited 
access” situation also tend to be the most dominant in the resultant hierarchies. 

On the basis of these previous findings, animals in each group were ranked 
as dominant or submissive for a given pairing for each of the three measures 
used. At the end of each test day, therefore, the most dominant subject could 
achieve a high of 15 (five pairings X three measures per pairing). The lowest 
score that any subject could achieve was zero. 

Coefficients of concordance were computed over the five test days and are 
presented in Table 1. It is important to note that all groups in the “social 
encounter” situation and five of the six groups in the “limited access” situation 
were found to be reliable. 

On the basis of this evidence, the method described and the measures used 


appear to be adequate in providing the experimenter with a tool for the 
determination of highly reliable dominance hierarchies, 


Wros 1 
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_D. Summary 


Competition and resultant dominance hierarchies were obtained in rodents 


through the use of a free-operant testing procedure. It was found that the 


no: 


se-poke apparatus, videotaping, and free-operant testing could be success- 


fully implemented in the determination of reliable dominance hierarchies. 
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CREATIVE GROWTH AND COGNITIVELY 
STRUCTURED STIMULATION IN EDUCATION* 


The University of Georgia and Indiana University 


Bert O. RicHMmonp, Vicror K. PHILLIPs, AND WILLIAM BLANTON! 


A. IntTRopucTION 


Some critics of education insist that the major impact of educational experi- 
ences on the child is to increase conformity and to limit his growth in freedom 
and self-expression, This structuring process may limit the full development 
and expression of an individual so that he feels rejected by and in turn rejects 
the educational process. Such an explanation is offered for many students who 
rebel and drop out of school. Janssen (6) reports from his study of a group of 
children in the United States that dropout students with a lower socioeconomic 
background are more creative than nondropout students with a lower socio- 
economic background. Earlier, Jackson and Marsden (5) reported similar 
results from their study which utilized dropout and nondropout students from 
a lower socioeconomic background in Great Britain. 

Prior to either of the above studies, Rogers (8, p. 292) urged greater atten- 
tion to providing creative experiences in schools; 

But if we value creativity . . . then we may wish to give a trial to 
ways of facilitating learning which give more promise of freeing the 
mind, If we value independence, if we are disturbed by the growing con- 
formity of knowledge, of values, of attitudes, which our present system 
induces, then we may wish to set up conditions of learning which make 
for uniqueness, for self-direction, and for self-initiated learning. 


Apparently the educational process has been unwilling or unable to ingest 
the research findings or to heed the exhortations of Rogers and those with 
similar points of view. As recently as 1970, James Cass (2, p. 61), Education 
Editor of the Saturday Review concluded: 


The student rebellion, on the other hand, made clear that in large 
part the schools were failing the advantaged as well as the deprived, 
The pervasive emphasis on conformity rather than creativity, on dis- 
cipline rather than independence, on the defensive “put down” rather 
than student support, on quiet orderliness rather than on the joy of dis- 
—__ 
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covery, on the neatness of administrative convenience rather than the 
often untidy enyironment of true learning . . « all highlighted the 
authoritarian rigidities of the system. Gradually, an increasing number 
of adults became aware that they were subjecting their children to a cus- 
todial environment that denied the very nature of childhood and youth 

. . an environment to which they would not consciously submit them- 
selves any more willingly than would their rebelling children. 


‘There seems to be enough concern and sufficient evidence to suggest a mean- 
ingful relationship between the child’s creative experiences and his feelings 
about education. There is ample evidence and nearly unanimous agreement 
among educators that the child’s feelings about school play an important role 
in his success in school. One of the questions that remain unanswered is the 
effect of different kinds of educational experiences on the child’s creative 
development. 

A number of recent preprimary educational programs have moved from play 
and social activity approaches to school readiness to more explicitly intellectual 
development ones (1, 3, 4, 7). However, these approaches have not indicated 
the effects of such programs on the child’s creative development. One of the 
few psychometric documentations of creative growth associated with an ex- 
perimental preprimary program is reported by Torrance, Fortson, and Orcutt 
(11). Children in this experimental program exhibited greater creative think- 
ing ability than those in a more traditionally oriented preschool program. 

With these basic issues in mind, the following study was devised to assess 
the effect of a typical cognitively structured educational program upon the 
development of creative ability in children, Torrance (10) reports that chil- 
dren normally progress in creative ability for the first few years in elementary 
school, but apparently lose some of their zest by the fourth grade, It is also 
usually assumed that the more intelligent child will profit more from cognitive 
exercises. For these reasons, first grade pupils of both average and superior 
intellectual ability were utilized in the study. The specific objectives of this 
study were to measure the effects of cognitively structured stimulation in an 
educational Program upon the creativity of first grade children and to determine 
if there were differential effects of such a program upon first grade males and 
females of average and superior intelligence. 


B. Procrpures 


All subjects used in the present study were randomly selected from a parent 
population specifically selected to exemplify a national stratification by sex 
socioeconomic status, parental educational level, and race. The parent po} atta 
tion comprised the entirety of an elementary school research model in which 
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special content orientation in the form of cognitive structured stimulation was 
undertaken, Children in the first grade (with the exception of the control 
groups) received special advanced instruction in areas of physical education, 
art, music, social studies, and mathematics. The professional staff were basically 
concerned with the cognitive stimulation of the children. All forms of class- 
room and intraclassroom activity were directed toward the objective of conver- 
gent behavior with basic content as the subject matter, 

All first grade subjects in the entire school were randomly assigned either to 
control or experimental conditions, All first grade teachers were randomly 
assigned to work with experimental or control classes. This resulted in two 
experimental and three control classes, Each control class was self-contained 
and exposed to only one teacher for all content areas, The three control teachers 
were given no instructions regarding teaching activities, 

Each of the two experimental classes was exposed to five teachers throughout 
the day, each teacher representing a cognitive structured content area, Thus the 
total experimental sample was exposed to five teachers, while the total control 
sample was exposed to three teachers, It was not possible to control completely 
for the teacher variable problem. This procedure was employed to minimize 
the effect of the teacher on the child’s learning. 

The control subjects were randomly selected from the same elementary 
school, but. had experienced no preprimary educational stimulation. They 
were similar in background, intelligence, and sex to those included in the ex- 
perimental preprimary program. At the time of the present study the experi- 
mental group was involved in its second year of a cognitively structured 
educational program, whereas the control group was experiencing its first year 
of a traditional elementary school program. 

The research samples used in the present study were randomly selected from 
the total of both experimental and control groups. The sample consisted of 40 
children; 20 first grade experimentals and 20 first grade controls, equated on 
1Q grouping, sex, and age. Each group contained five males and five females 
in the 90-110 Stanford Binet (Third Revision) group and an equal number of 
each sex in the 119-133 Stanford Binet group. 

Both experimental and control subjects were individually administered the 
Figural Form B of the Torrance Tests of Creative Thinking, This test is 
composed of four components that relate to the subject’s creative thinking 
ability in terms of fluency, flexibility, originality, and elaboration, 

The creativity components describe aspects of developmental behavior that 
indicate divergent abilities (9). Fluency represents the subject’s ability io) 
Senerate ideas and concepts unrestricted by content parameters. Flexibility 
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denotes a measure of the diversity that pervades the subject’s ideas and problem 
solving behavior. It also indicates to a large extent the subject’s ability to view 
his environment as multifaceted and to shift his behavior when conventional 
methods are no longer appropriate. Originality measures indicate the strength 
of the mental leap and the rarity of departure from the commonplace. Such 
measures characterize the truly divergent behavior of creative individuals. ‘The 
elaboration score exemplifies the subject’s regard for his own ideas. In this 
measure it is not infrequent to find subjects who add details to basic ideas in 
order to convey the essence of their concepts and feelings, especially in the 
young where the need to communicate is very strong. 

‘These data were evaluated by means of four analyses of variance in which 
each component score of the Torrance Test was used as a dependent variable. 
In each of the analyses (2 X 2 X 2) there are two levels of treatment, two 
levels of intelligence, and two levels of sex. A 10 point differential between the 
two levels of intelligence was arbitrarily assigned to ensure a comparison be- 
tween “average” and “superior” intellective groups. 

The results of these computations are given in the following tables. 


C, Resutts 


Table 1 contains the analyses of variance in which individual scores on each 
of four measures of creativity were used as the dependent variable in compar- 
ing levels of treatment, intelligence, and sex. 

On figural fluency, at the end of the first grade year the experimental chil- 
dren failed to differentiate significantly from the control subjects in the ability 
to generate a higher number of responses. This is reflected in the treatment F 
value of .27 (with 1 and 32 df). In terms of intellective abilities, it would 
seem logical to assume that if a relationship existed between J Q measures and 
creativity, a content oriented program would certainly induce differences be- 
tween average and superior JQ groups on fluency scores. Such was not the 
case. The difference between average and superior JQ groups in relation to 
the number of responses elicited under control and experimental conditions 
was not significant as seen in Table 1 (F value of -33, with 1 and 32 df). This 
indicates a lack of relationship between creative fluency and JQ measures, and + 
tends to support the concept that convergent type educational Beianilacidal does 
little for the creative potential of the individual in a developmenal sense. In 
addition, no significant differences in fluency measures on the basis of me oc- 
curred. The F value of .006 (with 1 and 32 df) indicates that the sex of the 

subject does not contribute to differences in fluency, 

In Table 1 the interaction F values may be interpreted at face value, With — 
1 and 32 df, only intelligence by sex approached, but failed to reach, signi 
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TABLE 1 
ANALYSES OF VARIANCE FOR TREATMENT, INTELLIGENCE, 
AND SEX oN Four MEAsuRES OF CREATIVITY 


Source Ss df MS F 
Figural fluency 


Treatment 9.03 1 9.03 27 
Intelligence 11.03 1 11.03 33 
Sex .23 1 23 006 
1 ae | 11.02 1 11.02 - 33 
Txs 24.02 1 24.02 72 
Ixs 119.02 1 119.02 3.58 
TXIxs 23 1y 23 .006 
Error 1063.20 32 33.22 

Figural flexibility, 
Treatment 27,23 1 27.23 1.78 
Intelligence 63 1 63 04 
Sex .23 1 23 01 
T <% 1.22 1 1.22 08 
Txs 13.22 1 13.22 86 
Ixs 38.02 1 38.02 2.49 
Tx Les .03 32 03 001 
Error 487.20 15.22 

Figural originality 
Treatment 402.23 1 402.23 3,22 
Intelligence 319.23 1 319.23 2.55 
sex 11.03 1 11.03 08 
aie os 289.02 1 298.02 2.38 
Txs 28.22 1 28.22 .22 
Ixs 455.62 1 455.62 3.65 
Tex rE eS 1.03 1 1.03 008 
Error 3994.00 32 124.81 

Figural elaboration 
‘Treatment 2433.60 1 2433.60 13.00* 
Intelligence 688.90 1 688.90 3.68 
Sex 48.40 1 48.40 25 
TXxI1 193.60 1 193.60 1.03 
Txs 184.90 1 184.90 98 
Ixs 3,60 1 3.60 01 
TXIXxs 10 1 10 .0005 
Error 5990.00 32 187.18 


* Significant at .01 level. 


cance at the .05 level. It is evident from these data that the main effects or any 
combination thereof reflect no significant differences between the first grade 
children compared on creative fluency. 

Table 1 also contains the comparisons between first grade groups on mea- 
sures of creative flexibility. Main and interaction effects (with 1 and 32 df) 
are similar to the reported data for creative fluency insofar as no significant 
differences were found. These findings suggest several possibilities. It may well 
be that the experimental children who received intensified training in amassing 
factual content were not equally trained in alternate ways of exploiting the 
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information imparted to them. Also, it seems evident that intelligence measures 
fail to assess the adaptability potential in child behavior as indicated by the F 
value of the intelligence effects. 

In the analysis of first grade originality measures, it appears that a cogni- 
tively structured educational program emphasizing convergent learning does 
not increase divergent behavior. As seen in Table 1, none of the F values for 
originality reached significance. 

Table 1 also contains the analysis of first grade measures on creative elabora- 
tion. Where the experimental treatment had no marked effect upon the in- 
dividual’s creative fluency, flexibility, and originality, it had a pronounced 
influence upon the subject’s ability to elaborate. The difference in favor of the 
experimental subjects on creative elaboration was found to be significant beyond 
the .01 level (F value of 13.00, with 1 and 32 df). 

‘Table 2 contains the means and standard deviations for each experimental 
and control group and further explains the preceding results. 


D. Discussion 


It appears that first graders being taught to think and solve problems con- 
vergently give responses characterized by ornamentation rather than unique- 
ness, quality, synthesis, or improved levels of information integration. Subjects 
tending only to be elaborate, without displaying strengths in creative fluency, 
flexibility, and originality, indicate the inability to break away from the stimu- 
lus into more creative expression, 

In conclusion, subjects in this study, regardless of their intellectual level, did 
not achieve more creative expression as a result of cognitively structured 
stimulation. The study further implies, as many college students and educators 
advise, that an emphasis on cognitively structured experiences in education does 
not enable the individual to progress in creative self-expression. 

‘These findings lend support to those of other researchers who find the crea- 
tive individual not stimulated sufficiently by his educational experiences. This 
research raises the possibility that pupils seeking creative growth and stimula- 
tion for greater freedom of expression do not find either in a cognitively 
structured education program, Also, the results suggest further that the em- 
phasis in education on imparting information currently regarded as fact with- 

out a sufficient opportunity to expand meaningful modes of self-expression is of 
dubious value, Such an emphasis may fail to encourage the development of an 
inquiring mind that can grasp for the reality of a world as yet unknown. 
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E. Summary 


Twenty first grade experimentals enrolled in the second year of a cognitively 
structured educational stimulation program were equated with 20 first grade 
controls on sex, age, grade, and JQ groupings and compared by analyses of 
variance on dependent measures of creativity. The control pupils were in a 
traditional classroom that did not stress cognitive experiences, The objectives 
of the present study were (a) to measure the effects of highly content oriented 
instruction upon first grade fluency, flexibility, originality, and elaboration as 
indicative of independent behavior; (b) to assess the effects of such a program 
upon the creative development of superior and average intelligence groups; and 
(c) to evaluate the impact of this approach upon divergent expression in male 
and female subjects. The effects of structuring the learning situation to 
maximize factual content without stimulation by divergent experience created 
a significant growth in elaborate behavior, but failed to bring about growth of 
new ideas, different ideas, and the positive aspect of development which is 
referred to as adaptability. 
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SUPERPSYCHOMETRICS: ULTRARELIABILITY 
AND HYPERVALIDITY* 


Augusta, Georgia 


Frank M. pu Mas 


A. IntRopucTION 


Psychology began as an infant science about a century ago and immediately 
took the physical sciences as its developmental model. Psychology attempted 
to discover its basic laws and principles, If psychology were to develop laws as 
the more mature sciences, it was necessary that it turn its attention first to 
methods for measuring those phenomena entering into lawful relationships. 

The chronology of psychological measurement began with psychophysical 
scaling (13, 14) and runs through a wide variety of other methods, such as psy- 
chological scaling (12, 13, 14), mental tests (1, 2, 12, 13, 14), factor analysis 
(13, 26, 27, 30, 31), configural scaling (4, 5, 6, 9, 15, 16, 18, 19, 28, 29), 
pattern analysis (3, 4, 6, 9, 21, 23, 28, 29, 34), and multidimensional scaling 
and prediction (17, 24, 26, 31, 33). Probably the best single presentation is 
Guilford’s Psychometric Methods (13). 

There now exists a plethora of psychometric methods. But the science of 
psychological measurement seems to have reached a region of vexng resistance 
somewhat like the sonic barrier in physics. For some years now psychometrics 
seems to be unable to go beyond this metric region of resistance characterized 
by reliability coefficients of .90-.95 and validity coefficients of .50-.70. What is 
needed is a scientific breakthrough in psychology equivalent to the breaking of 
the sound barrier in physics. Once this is accomplished there should be a 
rapidly accelerating improvement in applied psychometry, just as there was in 
airborne transportation once the sound barrier was breached. 

We have created three neologisms by means of Greek and Latin prefixes to 
describe the mensurational characteristics of psychological tests beyond the 
_ Psychometric barrier. Ulérareliability denotes the reliability of tests when Kae 
> .99. Hypervalidity denotes higher validity than previously obtained, say, 
when rep > .90. Superpsychometrics denotes the scientific procedures and 
instruments associated with the attainment of ultrareliability and hypervalidity 
(8). 


* Received in the Editorial Office on May 16, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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The remainder of this paper attempts to answer the question: Is it possible 
for psychological measurement to approximate the degree of precision extant 
in the biological and physical sciences? 


B. Possist—e ANSWERS 


Behavioral scientists sometimes must use items, tests, or information whose 
predictive power is very low. These low predictor entities are often used by 
clinical and applied practioners working with biographical data sheets or case 
histories utilizing such data as birthplace, books read, religion, etc. The sup- 
pressor variable is another case in point. 

Perhaps the notion of very low predictor items being included in an aggre- 
gate whose characteristics are stable and predictable will seem more palatable 
to behaviorial scientists by the contemplation of Bohr’s (20) principle of com- 
plementarity in physics. Complementarity describes a seemingly paradoxical 
state of affairs where the behaviors of individual atoms in a volume of gas are 
completely unpredictable; simultaneously, the states of the system as a whole 
are quite predictable. If physics can conceive of perfect order emerging from, 
or coexisting with, total disorder, then psychology might find such conceptions 
useful in its conceptualizations as well. 

Another solution to the problem would be to locate a few high predictor 
entities and use these to predict complex phenomena. Something of this notion 
is contained in the concepts of single item tests, multiple correlation, and factor 
analysis, 

A different solution was proposed by du Mas (7) in his theoretical analysis 
of what he called intratests. He showed that all tests—with the exception of 
single item tests—were composed of intratests, and the number of such in- 

tratests becomes fantastically large for even a test of such moderate size as 100 
items, He defined two quantities: U, the proportion of available information 
in any test that is typically utilized, and W, the Proportion of information 
wasted. He further deduced that U — 0 and W —> 1 as tests increase in 
length. 

hein ae solution to our general problem may be found in the areas of 
pattern and configural analyses (3, 5, 6, 15, 18, 19, 23, 28). Mechl’s (21) 
Herat showed that two items, each having zero correlation with the 

, become perfect predictors when their responses are considered to- 
gether. Du Mas (6) demonstrated that two items, if answered honestly by 
subjects, would yield a three strata scale based on 


ime P response patterns whose 
validity coefficient was unity. 8 


ey 
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C. THeEoreticaL RATIONALE 

While it is obvious from the preceeding discussion that there are several 
possible approaches to the attainment of superpsychometrics, we shall follow a 
course that is closer to conventional ideas as contained in classical test theory. 
It is not the intention of this author to bore the reader with a restatement of 
classical test theory which can be found in any of several sources, But it is 
necessary that we review certain of these notions so that our suggested solu- 
tion to the problem of attaining superpsychometrics will be as clear as possible. 
One way of approaching the problem is to make the best estimate of an in- 
dividual’s true score with extremely small error variance. Theoretically, there 
are several ways this can be done. 

1. One moderate length test taken many times by one particular individual. 
The mean of many test scores would improve the estimate of an indivdual’s 
true score. 

2. A single long test composed of a very large number of items taken by a 
single individual. 

3. Many short parallel tests taken by an individual. The average score of 
these many equivalent scores would yield a better estimate of the individual’s 
true score, . 

Figure 1 presents the essential details for a clarifying example of what will 
be discussed. It depicts a normal distribution over the metric X with mean 
M,, standard deviation oz, and the standard error of measurement op, for the 
score S, 

Let us now state the assumptions under which the following developments 
are made. They will seem elaborate and restrictive, but actually several re- 
searchers have found that wide deviations from these may occur without 
seriously affecting empirical results. The assumptions are as follows: 

I. All operations are such as to yield random sampling of specified para- 
meters, 

II. When scores are combined they are unweighted, simple summative 
scores, 

III. An obtained score on the variable X is composed of two components, a 
true score component and an error score component, as: X,=X1+Xe 

IV. The correlation between true scores and error scores is zero: rie = 0. 

V. Error scores on parallel tests are uncorrelated ree = 0. 

VI. Parallel tests have the same means, variances, and correlations between 


all pairs of tests in the population. 
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FIGURE 1 
GrapuHic DEPICTION OF THE NORMAL DISTRIBUTION AND CERTAIN PARAMETERS 
DiscusseD IN THE THEORETICAL RATIONALE 


From assumptions I and III we know that the obtained mean is equal to the 
mean of the true scores plus the mean of the error scores; since error scores are 
both positive and negative and normally distributed, they tend to cancel each 
other out. 

It follows that the mean of the obtained scores is an unbiased estimate of the 
mean of the true scores; that is, 


M,=M;. {1 


It also follows that the variance of obtained scores is equal to the sum of 
the true score variance and the error variance as, 


aa att ot (21 


From relations 1 and 2 we see that the contribution of the characteristics of 
errors to the statistical parameters are different. That is, since plus and minus 
errors tend to cancel each other out, there is no influence on the mean; but 
since plus and minus values are squared for variances, error variance does 
make a contribution to obtained score variance. A widely held view is that 


reliability can be regarded as the proportion of true score variance existing in 
the obtained score variance as, 


— EE 
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oa di 
= eP (3) 
et It follows that the proportion of unreliability, so to speak, of a test is in- 
| dicated by, \ 

es 
Tex = 1 eet. [4] 
From relations 3 and 4 it follows that 

oo — Os Vi —Toe- [5] 


The term on the left is the well-known standard error of measurement; it is 
also called the standard of error of the obtained scores. 

Since the standard deviation of obtained scores indicates the way particular 
magnitudes of a psychological dimension are distributed in the population, the 
standard deviation of obtained scores for the population is constant and will not 
shrink or change. As a test instrument is made better or worse, the contribu- 
tions of true score variance and error score variance are changed; and from 
relations 3, 4, and 5 it follows that in the measurement of an individual score, 
the error indicated by the standard error of measurement is directly dependent 
upon the size of the coefficient of reliability. The theory indicates, then, that 
in order to reduce the standard error of measurement and to constrict fiducial 
limits for an individual's true score, it is necessary that test constructors increase 
the size of the reliability coefficient. So much for conventional test theory. 

There is an implicit logical error contained in applying the conventional 
interpretation of the standard error of measurement defined in relation 5. For 
example, it is assumed that this standard error of measurement derived for all 
individuals of the population is the same for each individual who achieves the 
same obtained score. That this is certainly not true is indicated by several 
studies of individual variability, consistency, or inconsistency (10, 11, 13, 22, 
25, 32). In fact, one attribute of personality or behavior is that some individuals 
exhibiting the same degree of a trait show characteristically a wider range of 
magnitudes at different times under different occasions than other individuals. 
We speak of someone being more “emotional” and others as being more 
“stable” or less “erratic” in their test responses. 

Another implicit idea contained in the conventional interpretation of the 
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standard error of measurement for an individual’s obtained score is that this 
measure, which is essentially an average error for all members of the popula- 
tion, is an adequate description of each individual within the population who 
has that same score. 


1. Solution 1. Standard Error of an Individual’s Mean 


Suppose that a single individual takes the same test many times. For the sake 
of this logical development let us assume that there is no systematic error 
because of experiences with the same test. Or let us assume that the individual 
takes a large number of parallel tests for the same dimension. Either way, we 
get many different obtained scores for a single individual. From these many 
scores from a single individual, we calculate a new mean, M;, and a new 
variance, oj. Either of these two means would be a better estimate of the in- 
dividual’s true score than any single obtained score. Similarly, the standard 
deviation of the individual’s many obtained scores is a better estimate of the 
standard error of measurement for that particular individual than the popula- 
tion standard error of measurement. 

But we can go farther than this. We have calculated the standard error of 
an individual’s scores obtained from repeated testing of the same individual as 
Ow 

It follows that the standard error of the mean 


o for all scores obtained by a 
single individual is given by, 


— % 
n= ae [6] 
As the number of obtained scores from parallel tests on the same individual 
increases, the standard error of the mean rapidly decreases. The fiducial limits 
within which an individual’s true score lies becomes increasingly narrow, and 
prin true regardless of the size of the coefficient of reliability as long as reo 


Let us clarify, Even if the reliability of a test is, say, .10 or .01, we can 
estimate an individual’s true score with great precision and with little error. 
As the number of test scores for a single individual over the same dimension 
becomes exceedingly large, the standard error of the mean of his scores becomes 


exceedingly small and approaches zero as the limi i 
nonzero reliability coefficient, ie 


is 
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estimation of psychological magnitudes by using present tests with low or 
moderate reliability and getting many scores on each individual. Theoretically, 
the problem of superpsychometrics is solved. 

Empirical problems of getting many meaningful scores on the same individual 
involve questions of testing time, availability of items, and such individual 
factors as fatigue, learning, level of ability of the testee, etc. We suggest one 
solution to these practical problems in a later section of this paper. 


2. Solution 2. Correlational Approach 


We now turn our attention to the conventional consideration of the problem 
of improved test reliability. 

It is commonly held that the proportion of the obtained score variance ac- 
counted for by the true score variance is given by the square of a correlation 
coefficient, a sort of coefficient of determination. This index of reliability is 


Tog = Ty: [7] 

Now if we combine parallel forms, this is equivalent to making a test longer 
so that the summation of an individual’s scores on parallel forms is equal to 
an individual’s score on a test m times as long as any one of the parallel tests. 


We will not repeat the proofs for the following relations, since they are avail- 
able in several sources (13, 14). 


o?, = no? (1+(n—1)r,,], [8] 
o?, = no, 19] 
2, =n07. [10] 


What relations 8, 9, and 10 state is that as we increase the length of a test » 
times, the contribution of error variance increases ” times, but the contribution 
of true score variance increases n square times. That is, the lengthening of a 
test greatly increases the proportion of true score variance and decreases the 
proportion of error score variance in the obtained score variance. 

Much of the theoretical discussion up to this point can be partially sum- 
marized by the central limit theorem, which states that when scores are 
given as standard scores, the sum of 2 independent random variables tends 
to normality as 2 —> ©. 

The proceeding logic results in. the now famous Spearman-Brown formula 
for a test increased to any length as, 


LS eee [11] 
mY (an + 1)ree 
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Table 1 depicts a hypothetical example fora mental ability test with a mean 
of 100, a standard deviation of 20, and a reliability coefficient of .80. 

Let us also assume that an individual obtained a score, 8 = 120, and that 
the test was 25 items long. The standard error of measurement for the score, 
oz, is 8.94 (calculated from formula 5) as shown in the first data column of 
Table 1, 


TABLE 1 
HyporueticAL EXAMPLE SHOWING CHANGES IN TEST PARAMETERS 
DUE TO LENGTHENING 


Measure x 25X 250X 
Tow 30 99 999 
M, 100.00 100.00 100.00 
Oy 20.00 20.00 20.00 
Ss 120.00 120.00 120.00 
6, 8.94 2.00 63 


We can see what happens to the reliability coefficient and the standard error 
of estimate when that same test is made 25 times as long, as shown in the 
second data column, and 250 times as long, as shown in the third data column of 
the same table. Note that the population mean, the population standard devia- 
tion, and the score remain constant. However, lengthening a test causes the 
coefficient of reliability to climb from .80 to .999, and the standard error of 
measurement to shrink from 8.94 to .63. It is also interesting to note that if 
the test is lengthened to 250,000X, the coefficient of reliability becomes 
.999999. This precision is superior to many measurements found in the biologi- 
cal and physical sciences. 

This part of the rationale, of course, presents little that is new. At the risk of 
boring our reader we have tried to make explicit what happens to certain 
parameters, We also wanted to point out clearly that the possibility of super- 
psychometrics. has been with us for a long time and the foundation of the 
so-called psychometric barrier rests upon problems related to very long tests. 


D. Practicay Empiricat Socution 


: The solution suggested here is not only capable of experimental verification, 
ut is practical in its demands on research time and money to develop the test, 


" ak administration of the test, and in demands on the individual taking the 
est. 


We wish to develop a research design that permits us to test the primary 
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empirical hypothesis that it is possible—now—to develop a psychological test 
whose reliability is exceedingly high by present standards; that is, 


Hy: ree > 99. [12] 


A secondary hypothesis is that the new validity coefficient, rye, will be higher 
than the old validity coefficient, roo; that is, 


Hg": 7 ec > Teo [13] 


The suggested solution has three stages: I, II, III. 

Assume a pool of 1600 items and that the probability of passing each item is 
known. This probability indicates the “difficulty level” of the items, so that .99 
means that 99% of the general population passed the item and 1% failed it; 
similarly, .01 means that 1% passed the item and 99% failed it. Assume further 
that there are 16 items at each level of difficulty from .01 to .99, and eight items 
‘ beyond each end of these difficulty values. 


4 


1. Stage I 


Make a 25 item test A by selecting an item at the highest and lowest diffi- 
culty level, and select remaining items equally dispersed over the intervening 
difficulty range. Repeat for three more tests B, C, and D. Stage I is composed 
of four parallel tests (A, B, C, D) of, say, 10 minutes each. See Figure 2. 

An individual takes test A, then B, then C, then D. Each test is scored 
immediately by one of several quick scoring techniques. Individual i receives 
four separate probability scores. Calculate the mean of these scores and find 
the average probability associated with this mean score, Let us suppose that the 
stage I composite score has a probability value of .78. 


2. Stage IT 


Go back to the item pool and select 100 items, rectangularly distributed 
from .10 below .78 to .10 above .78; that is, .78 = .10 probability. 
The probability range in stage II is only 1/5 the probability or difficulty 
range of stage I, The number of items attempted is the same for the two stages, 
~ 100 items each, Let us assume a one hour time limit for the stage II test; that 
is, test E, and that i gets a score of .816 on test E. 


3. Stage Ul 


Let .816 be equivalent to its nearest item probability .82. Build a stage III 
test of 10 items at each difficulty level over a range .05 above and .05 below 
the .82 probability level obtained in stage II. The stage III test, F, is composed 


7 


= 
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1600 ITEM POOL STAGET 


(PARALLEL TESTS) 


‘STAGE Z STAGET 


i 


60. ve. a 
|---}-—-}-—-h,78 

4 H H : 
5 AB prob FZ 2 
H 3 i i 

: : 
i 4 4 Fy 
H H H : 


$S><—§<anem prov odiries > 882 


{ow score) 


Each test 25 items Each test 100 items 


Each test 100 items Tree stage test 


Toto! 100 items Tote! 500 items Toto! 1000 items 300 hems 
Prob range,99 Range .20 Ronge ,10 
FIGURE 2 


Depiction oF A MULTISTAGE SEQUENTIAL TEsT CONSTRUCTION AND ADMINISTRATION 
ReskarcH DesicN For THE EMPIRICAL TESTING OF THE 
SurerPsycHomeTric HyPorHEszs H, AND Hy 


of 100 items whose difficulty range is .77 to .87 and has, say, a one hour time 
limit. 

Let us assume an individual, i, gets a score associated with the calculated 
probability level of .8089, 

This practical multistage testing sequence is summarized in Figure 2. The 
test taker has spent about three hours time in attempting 300 items. But this is 
equivalent to the same individual having taken all 1600 items or 64 parallel 


tests of 25 items each. He simply has not wasted time on items that are far 
too easy or far too difficult for him, 


E. Discussion 


It should be obvious to the reader that practical problems, 
the original item difficulties to the third, fourth, or fifth 
number of items of any one subset at any one stage, 
the range of item probabilities selected for each sta 
stages are all arbitrary. 


pirical test could ‘actu 
Hi; and Ho, Future 


decimal place, the 
the total items in the pool, 
, and the total number of 
We have specified them in some detail so that an em- 
ally be made of the two superpsychometric hypotheses, 
empirical research may suggest that in restricting the 


such as calculating - 
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range of items for successive stages one of several measures of variability, such 
as the standard error, probable error, average deviation, etc. of the mean of a 
previous stage, may be a more precise and a more useful guide. 


1. Time Limits 


‘The question of time limits is also arbitrary. It would be possible to develop 
a combination speed and power test as suggested here. It is obviously relatively 
easy to write a large number of items over the entire intensity or difficulty 
continuum for such fields as verbal and numerical abilities, mathematics, his- 
tory, personality, interests, and all of the sciences. Many large item pools, such 
as those at the Educational Testing Service at Princeton, already exist where 
large numbers of items have been written and their associated probabilities cal- 
culated. And for those areas where they do not exist, they could be written 
and probabilities calculated with a moderate research budget. 


2. Test Format 


What would the test look like? One solution is the typical test booklet. If 20 
items were written per page, our proposed 1600 item testing sequence booklet 
would contain 80 pages, or a test booklet of 40 sheets each. All individuals 
would take the same stage I tests, but after they had found their ability or 
intensity levels, subsequent stages would refine the evaluation of that region 
of the trait continuum associated with each particular individual. Something of 
this general notion is contained in testing procedures as far back as the old 
Stanford-Binet intelligence test, when testers attempt to estimate that region 
of the scale in which an individual will most probably fall and then concentrate 
their efforts on evaluating his responses to test items at that particular region of 
difficulty. 

The new computer technologies and automated electronic psychological 
equipment offer the best solution to practical problems of banking thousands 
of items with their associated probabilities, rapidity of scoring, selection of 
subsequent difficulty regions in the testing sequence, and calculating scores. 
Most practical problems are now obsolete. The only requirement is an adequate 
research budget. 


3. Hypervalidity 
It is interesting to contemplate the old notion of validity as the correlation 
of a test with some criterion C denoted rze. The full implications of the Spear- 
man-Brown formula also suggest that the validity of a test rises with the 
increase in test length according to the formula, 
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by Trae : 14] 
Tn x wc = V [=r ie [ 
i rae ‘an 


For a test of infinite length, relation 13 reduces to, 


Dake ie 15 
To xX nc = V Tes . [15] 
But this kind of validity rises much less rapidly than reliability and may reach 
a finite upper bound well below 1.00. 

One reason for this is that in the attainment of ultrareliability one is con- 
cerned only with reducing the error variance in one dimension. But in the 
attainment of hypervalidity one is concerned with reducing the error variance 
in two different dimensions and evaluating the strength of the relationship 
existing in nature between the two specified variables; i.e., the test scores and 
the criterion. This older notion of validity is a sort of scientific triptych of 
sources of variance and, therefore, has been a vexingly complex problem, 

Even so, the attainment of ultrareliability in the predictor psychological test 
would reduce one source of error and consequently, in itself, raise validity. 

But there are other and more recent ways of looking at validity that are not 
concerned with some new outside dimension, such as an external criterion. 
Such notions are contained in modern scale theories. When a set of items mak- 
ing up a test are highly homogeneous or when the coefficient of reproducibility 
is very high for a Guttman-type response parallelogram or a Lazarsfeld latent 
dichotomous structure, the old notions of validity as relating to an external 
criterion do not apply. And there is some truth to the statement that, as the 
reliability of the ordered scale structure increases, the validity of the instru- 
ment increases at about the same rate. It may be more scientifically fecund to 
concentrate on ultrareliability with the expectation that hypervalidity will 
follow. This view is tenable if we regard the relationship of test scores with 
some outside criterion as less relevant to the psychological instrument itself and 
more relevant as a way of stating relationships, associations, or laws of nature. 


F. Summary 


The measurement of human characteristics called psychometrics includes 
such diverse things as psychophysical scaling, psychological scaling, psychologi- 
cal tests, multidimensional scaling, and configural analysis, 

1, Psychological measurement seems to have come up against a psychometric 
barrier in the region where correlation coefficients reach but do not exceed .90 
to .95 for reliability and .50 to .70 for validity. : 


2. Several theoretical analyses Suggest ways of breaching this psychometric 
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barrier in order to attain a superpsychometrics whose characteristics are ultrare- 
liability and hypervalidity, as follows: (a) the use of many low predictor 
entities, (4) the use of a few high predictor entities, (c) the further analysis 
and utilization of intratests, (d) pattern and profile analyses, (e) notions 
derived from classical test theory. The notions mentioned in point (¢) are 
further subdivided as 1) repeated testing of the same individual to produce a 
mean and standard error of that mean that would yield a highly precise esti- 
mate of an individual’s true score with very narrow fiducial limits of error, and 
2) greatly increasing the length of a test so that the coefficient of reliability 
becomes exceedingly high. 

3. A practical empirical solution for the attainment—now—of superpsy- 
chometrics is given which has three essential characteristics: (a) An assessment 
procedure which is characterized by multistage sequential testing. (4) Concen- 
tration of the evaluative effort on that difficulty or intensity region of the scale 
within which an individual’s ability or attributes most probably lie. (c) A test 
that looks different from present tests; i.e., because of the number of items 
in the assessment pool it would be a relatively thick booklet, or a test-taking 
console connected to an automated electronic and computerized test item 
selector, presenter, scorer, and sequencer. 

4. Hypervalidity presents a special problem and is considered from two 
approaches: (a) Classical validity conceived as the ability of test scores to 
predict an external criterion is better viewed as a relationship, association, or 
law of nature. (4) Reliability and validity considered as internal problems 
of the testing instrument itself, with the probable consequences that the attain- 
ment of ultrareliability would necessarily entail hypervalidity. 
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ASTUDY OF THE USE OF THE SEMANTIC DIFFERENTIAL 
BY FIFTH GRADE CHILDREN FROM DIFFERENT 
SOCIOECONOMIC GROUPS* 


Eastern Michigan University 


Warren S. WiLLiaMs 


A. Inrropuction 


The semantic differential (SD) rating technique offers some attractive 
advantages for the researcher interested in assessing the meaning of concepts 
to subjects. Among its numerous uses, the SD permits the comparison of the 
meaning of concepts to members of different groups: be they political groups 
(8), groups representing different nationalities (3, 5, 6, 7, 9, 10), or members 
of different socioeconomic status (SES) classes (1, 14, 15). 

The utilization of the SD rating technique to make between-group com- 
parisons is based on the assumption that the different groups use the rating 
instrument in a similar fashion. That is, that the differences between the 
responses of two or more groups of Ss can be attributed to variance in the 
meaning of the concepts rated and not to differences in the use of the rating 
system, 

In fact, however, there are several possible sources of between-group 
difference in responses to the SD (2). Aside from variance attributable to 
random error, other possible sources of between-group variance include (a) 
differences in the meaning of the concepts being rated (i.e., true variance) ; 
(2) differences in the responses to the rating technique itself (e.g., different 
response sets) ; (c) differences in the meaning of the terms that define the 
rating scales. i 
The factor analytic procedure used as a component of most sD analyses re- 
Presents, in part, an attempt to deal with the last of these sources of variance, 
Typically, one factor analysis is conducted for each group of Ss participating 
in the study. The data for these analyses consist of a correlation matrix 
generated by summing over both Ss and concepts to determine which bipolar 
adjectival rating scales cluster together. The results of the factor analyses are 
then used to define the dimensions of meaning for the concepts rated by the 
Ss Participating in the study. If the SD responses of two or more groups are 
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to be compared, similar factor structures for the two groups are interpreted as 
suggesting similar use of the rating scales by the different groups of Ss. 

Unfortunately, this conclusion is not entirely justified by the data analysis " 
technique employed. While similar results from the factor analyses suggest 
that the same sets of rating scales cluster together for the groups being com- 
pared, these analyses are not always sensitive to consistent differences in the’, 
use of these scales within each factor. For example, this approach may not be 
sensitive to (a) response sets within one or more of the factors identified or 
(2) consistent differences in the meaning of the terms that define the rating 
scales for an entire factor (i.e., consistent between-group differences in the 
‘meanings of all terms that define one dimension of meaning). Just because 
the scales which most often define a dimension of meaning have similar 
factor weights for each of two groups of Ss, one cannot assume that these 
scales are being used in the same manner and have the same meaning to the 
Ss, It is possible that the terms that define any dimension of meaning connote 
different things to the two groups of Ss. While the words good, fair, clean, 
and beautiful might “go together” for both groups, these clusters may have 
different connotative meanings to each of the groups doing the ratings. The 
investigator still must assume that the Ss responded similarly to (a) the 
terms that define the rating scales and (4) the SD technique itself. 

The present study was designed to examine these assumptions. Its purpose 
was to determine if these potential sources of variance would result in 
between-group differences in the responses of chil 
SES groups to an SD instrument. No attempt 


differentiate between the relative effect of violatio 
assumptions, 


dren from two different 
‘was made, however, to 
ns of either of the two 


B. Mernop 
1. Subjects 


Two groups of Ss, a low and a middle SES sample, 


study, The low SES ot apd ttle! 


sample was chosen from the students in five classes 
drawn randomly from the fifth grade classes in an inner city school. The 


middle SES sample was selected from the students in five classes in a school # 

located in é middle SES neighborhood in the same large city, 

Peat (12) Index of Status Characteristics (ISC) was computed for 

Be ildren_ in ii selected classes, Information was collected about the 
ather’s and mother’s occupation for each S$, while house type and dwelling 

an were determined through direct observation by the researcher. If an S’s 
ES was in question, he was dropped from the sample. 


= 
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The low SES sample consisted of 83 students; the middle SES sample was 
comprised of 98 students. 


2. Measuring Instrument 


An SD rating instrument was constructed that was sensitive to (a) 
between-group differences in the use of the SD rating technique and (bd) 
differences in the meaning of the terms that define the most frequently used 
rating scales. This instrument consisted of a set of 24 concepts to be rated 
on 12 bipolar adjectival rating scales. The terms that defined one pole of 
every rating scale also served as one of the concepts to be rated. For example, 
one of the scales was good-bad. The terms “good” and “bad” also appeared 
as two of the concepts to be rated. Between-group differences in the ratings 
of such concepts as “good” and “bad” would suggest differences in the 
meaning of the terms that define the rating scales, differences in the use 
of the SD rating instrument by the two groups of Ss, or both. 

The 12 scales (which defined not only the rating scales but also the 24 
concepts to be rated) were those most heavily weighted on the three dimen- 
sions of meaning in Osgood’s (8) early work with the SD. These were 
good-bad, beautiful-ugly, fair-unfair, clean-dirty, strong-weak, sharp-dull, 
large-small, hard-soft, heavy-light, fast-slow, hot-cold, and moving-still. The 
scales were arranged so that items from different factors were interchanged, 
and every second scale was reversed so that the negative side of the reversed 
scale appeared on the left. As suggested for use with children, five position 
rating scales were constructed. 

To reduce the possible confounding effects of differences in reading ability, 
the instrument was read to the Ss. 

3. Analysis of Data‘ 

The S’s scores on each concept were determined by assigning a numerical 

value to each alternative in the SD scale. The numerical scores were then 


used in the following analyses: 

1. Two factor analyses were conducte : 
groups. The function of these factor analyses was to determine the dimen- 
sions of semantic space for the two groups of Ss. The analyses provided in- 
formation on (a) the dimensions that define the semantic space of each 
socioeconomic group and (6) the generality of the semantic space of the two 
groups’ ratings of these concepts. ‘ Y 

The data for each factor analysis consisted of the matrix of intercorrela- 


tions between scales. By summing over both Ss and concepts, two 12 x 12 


d, one for each of the two SES 
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matrices of intercorrelations of each scale with every other scale were gen- 
erated (one matrix for each SES group). Each matrix then served as the 
data for a principal axis factor analysis (13). Each factor matrix was 
rotated with use of the varimax (4) criterion, 

2. The unweighted means and standard deviations of the distribution of 
scores on each factor were obtained for each concept and for each group. 
Each S’s score on each factor was computed for every concept. This was 
accomplished by averaging his scores on the scales found to be most heavily 
weighted on each factor. 

3. Two multiple discriminant analyses were conducted, one for each of 
the factors identified in the factor analyses. The first multiple discriminant 
analysis used the unweighted average of Ss’ scores on the scales most heavily 
weighted on factor one, Average scores for Ss in each of the SES groups on 
each of the concepts served as the data for these analyses. An F ratio as- 
sociated with a p value equal to or less than .05 was established as the level 
of rejection for each hypothesis. Similar analyses were performed for the 
second factor identified in the factor analyses. 


C. Resutrs anp Discussion 
1. Factor Analyses 


‘The results of the two factor analyses are summarized in ‘Table 1. 

For both groups, the scales most heavily weighted on factor I could be 
termed “evaluative.” For purposes of this study, each §’s score on the evalua- 
tive dimension was defined as his average score on the good-bad, beautiful- 
ugly, clean-dirty, and fair-unfair scales. 

Factor II was also clearly defined for the two groups of Ss. The scales 
most heavily weighted on this factor, small-large, light-heavy, soft-hard, and 
weak-strong, suggested a potency dimension. 

A third factor was not as readily identified. The factor analysis for the 
low SES group yielded a heavy loading of only one scale on this’ factor. 
which was considered a residual factor of no consequence to this study. The 


pi pa ee of the middle SES group’s tesponses yielded a third 
actor defined by scales that connote activi 
mobiigadll Ga eor ee os € activity. These are the fast-slow, 
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; TABLE 2 Ay 
‘TucKER’s COEFFICIENT OF CONGRUENCE 
Factor I II Ill 
I 963 153 — 254 
II 077 961 138 
Ill 570 — 342, —.724 


tion; factors I and II were similar for the two groups, factor III was not. 
It was decided to compare the ratings of the two groups fn a two-dimensional 
semantic space. ¥ 


2. Multiple Discriminant Analyses 


‘Two multiple discriminant analyses were conducted, one analysis for each 
of the congruent factors. The data for the first analysis were the unweighted 
average of the Ss’ scores on the four scales heavily weighted on the evalua- 
tive factor (fair-unfair, beautiful-ugly, clean-dirty, and good-bad). The 4 
second analysis was based on the data derived from the scales most heavily 
weighted on the potency factor (small-large, light-heavy, soft-hard, and 4 
weak-strong). 

The Ss’ mean score on each factor (sometimes referred to as a “factor 
score”) for each concept served as the data for the multiple discriminant 
analyses. The null hypotheses were the following: (a) There is no difference 
between the unweighted mean scores of Ss from the two SES groups on the 
evaluative factor. (b) There is no difference between the unweighted mean 
scores of Ss from the two SES groups on the potency factor. 

The overall F of between-group differences on the evaluative factor was 
1.68, associated with a probability of .032 for 24 and 156 degrees of freedom. 4 
The overall F for the potency dimension was 1.32, associated with a p value 
of .158, 

These findings suggest differences in the ratings given these concepts by 
the two groups. In the case of the most salient factor, the evaluative dimen- 
sion, these differences are sufficient to support rejection of the null hypothesis. 
In the case of the potency dimension these differences do not reach the 05, | * 
level which was established as the level of rejection for each hypothesis. 

The finding of differences in the rating of terms that define a dimension of 
semantic space should be of interest to researchers using the SD technique 
for between-group comparisons. Intuitively, one would expect that the terms 
that define congruent factors should have similar meanings to different groups 
of Ss. While this may be so (the findings may be attributable to between- 
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group differences in use of the SD rating technique), the validity of this 
assumption has been questioned. 

While it is possible that the differences in the use of the SD instrument 
which were identified in the current samples will not be found in different 
cultural, religious, or political groups, the assumption that members of dif- 
ferent groups respond similarly to the SD instrument and to the terms that 
define its rating scales deserves continued study. 


D. SumMaRy AND CONCLUSIONS 


The SD rating technique is widely used to compare the connotative 
meaning of concepts to members of different groups. It is generally assumed 
in these studies that differences in the responses of Ss from different groups 
reflect variance attributable to differences in the meaning of the concepts 
rated, not differences in the meaning of the terms that define the rating 
scales or differences in responses to the SD technique itself. 

The present study was designed to investigate these assumptions. 

One hundred eighty-one lower and middle SES fifth grade children were 
asked to rate 24 concepts on 12 bipolar adjectival scales. The concepts to be 
rated consisted of each of the 24 adjectives that defined the 12 bipolar scales. 

The data from each group were factor analyzed, and two factors extracted 
for each group: the evaluative and potency factors. Ss’ ratings of concepts on 
these factors were compared by means of the multiple discriminant analysis 
technique. Between-group differences were identified in the ratings on the 
most salient factor, the evaluative dimension (¢ = .03). Differences on the 
potency dimension were not sufficient to permit rejection of the null hypoth- 
esis (p = .16). ; 

The finding of between-group differences in the rating of the terms that 
define some of the most widely used rating scales suggests that the validity 
of the assumptions underlying the use of the SD for between-group compari- 
sons merits continued study. 


REFERENCES 


1. Borowrcz, N. A semantic differential study between lower-class and middle- 
class pupils in the seventh and twelfth grades. Unpublished Doctoral disserta- 
tion, Pennsylvania State University, University Park, 1969. 

2. Hetse, D. Some methodological issues in semantic differential research. Psychol. 
Bull., 1969, 72, 406-422. 

3. Jaxosovirs, L, Comparative psycholinguistics in the study of cultures. Internat. 
J. Psychol., 1966, 1, 15-37. 

4. Karser, H. The varimax criterion for analytic rotation in factor analyses, 
Psychometrika, 1958, 23, 187-200. 


350 


15, 


JOURNAL. OF PSYCHOLOGY 


Miron, M. A cross-linguistic investigation of phonetic symbolism:-J. dbx. & 
Soc. Psychol., 1961, 62, 623-630. . ma 
Oscoop, C. Studies on the generality of affective meaning systems. Amer, Psy- 


~ chol., 1962, 17, 10-28. : , 


. Semantic differential technique in the comparative study of cultures. 
Amer. Anthrop., 1964, 66, 171-200. 

Oscoop, C., Suct, G,, & TANNENBAUM, P. The Measurement of Meaning. 
Urbana: Univ. Illinois Press, 1957. 

Suct, G. A comparison of semantic structures in American Southwest culture 
groups. J. Abn. & Soc. Psychol., 1960, 61, 25-30. 

Tanaka, Y., Oyama, T., & Oscoop, C, A cross-culture and cross-concept study 
of the generality of semantic spaces. J. Verb. Learn. @ Verb. Behav., 1963, 2, 
392-405. 

Tucker, L. A method for synthesis of factor analysis studies. Personnel Re- 
search Section Report No. 984, Department of the Army, Washington, D.C., 
1951, 

Warner, W., Meeker, M., & Eztrs, K. Social Class in America: A Manual of 
Procedure for the Measurement of Social Status. Chicago: Sci. Res. Assoc., 
1949, 

Witxinson, J. The Algebraic Eigenvalue Problem. London: Clarendon, 1965. 

Wittiams, W. Measuring the meaning of personality test items to fifth grade 
children from different socioeconomic groups. Unpublished Doctoral disserta- 
tion, University of Rochester, Rochester, New York, 1969. 

. A semantic differential study of the meaning of personality test items 


GAOL from different socioeconomic groups. J. of Psychol., 1971, 79, 179- 
188. 


Department of Educational Psychology 
Eastern Michigan University 
Ypsilanti, Michigan 48197 


Published as a separate and in The Journal of Psychology, 1972, 81, 351-360. 


INDICATIONS OF PSYCHOPATHOLOGY IN MALE 
NARCOTIC ABUSERS, THEIR EFFECTS 
AND RELATION TO TREATMENT 
EFFECTIVENESS* * 


Research Division, Central Islip State Hospital, Central Islip, New York 


CHARLES SHEPPARD, JOHN Fraccuia, Exizasetu Ricca; 
AND SwNEY Mertis 


A. INTRODUCTION 


As the incidence and prevalence of illicit drug use increase, more people 
from diverse professional persuasions become involved in mediating solutions 
to the problem. In the-main, drug abuse is recognized and accepted as a psy- 
chosocial condition having physiological components. But treatment of this 
condition remains costly and largely ineffective (4). This is perplexing and 
generally dissatisfying to everyone concerned. One reason for this apparent 
ineffectiveness could possibly be traced to an inadequate delineation of the 
natural history of drug abuse as a psychosocial condition. Logically, this 
understanding would be a necessary first step toward developing a rational 
approach to treatment. 

There are at least four psychosocial hypotheses, beyond drug availability, 
explaining why a person becomes exposed to and involved with drugs. Some 
people become. addicted accidentally. In general, these are usually not re- 
fractory cases nor a major source of treatment difficulties. Peer pressure may 
account for some of the variance explaining how a person gets started with 
drugs. Sensation secking has been offered as another possibility. The concept 
of the addiction-prone personality has been suggested as a reason for drug 
abuse. Some may abuse drugs because of concurrent psychopathology. If the 
latter is a relevant determinant of drug use, it has to be taken into account 
because it should have meaningful treatment implications, 

The purpose of this communication is to focus on the incidence, type, and 
degree of psychopathology in a large sample of male narcotic addicts. An 
early 1900 view considered addicts as degenerates, liars, thieves, and habitual 
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criminals (1). Pescor (14) reported 880 of 1000 treated addicts to be 
“sociopathic” which tends to coincide with the view represented above. More 
objective data, represented in terms of Minnesota Multiphasic Personality 
Inventory (MMPI) report, were presented by Hill, Haertzen, and Davis 
(8) and Hill, Haertzen, and Glaser (9). Their oblique factor solution 
identified five possible clinically relevant subgroups. The first group was 
designated as undifferentiated “sociopath,” accounting for 31 percent of the 
sample differences. The second factor identified two other subgroups different 
from the undifferentiated sociopath and different from each other. These 
“primary and secondary sociopathic” patient types accounted for 24 percent 
of the differences among the sample tested. The third factor extracted is less 
clear, but retains the influence of characterizing the addict as a “‘sociopath.” 
In spite of the elevations on what is considered to be the psychotic indicators 
on the MMPI, the two subgroups isolated by factor three were characterized 
as “schizoid sociopath” and “neurotic sociopath-hypomanic type.” This factor 
accounted for 15 percent of the total explained variance. The following points 
develop from the information presented. If addicts are sociopathic personality 
types as clinically and psychometrically indicated, it is possible to subclassify 
sociopathic personalities into at least three distinct clinical types. Lastly, we 
would hypothesize on the basis of factor three that not all addicts are 
sociopaths, and factor three is not labeled appropriately. But, it can be 
estimated that about 54 percent of the Hill et al. (8, 9) addict sample were 
“sociopathic.” This is lower than the 88 percent reported by Pescor (14). 
These data may represent either sample, trend, or sample and trend dif- 
ferences in terms of the type of person becoming addicted to drugs. Olson 
(12) reported approximately 50 percent of his sample of addicts as sociopathic 
from MMPI report. Sheppard, Lemmer, O'Neill, and Merlis (15), also 
using MMPI diagnostic criteria, reported 29 percent of their sample as 

sociopathic.” Of the three studies using the MMPI report, the Hill e¢ al. 
(8, 9) data were collected at the National Institute of Mental Health drug 
treatment hospitals, Olson’s (12) data were drawn from a California state 
hospital, and those of Sheppard et al, (15) from a New York state hospital 
with narcotic treatment units. It is possible that addicts seen at these installa- 
tions were different subsamples. But it is clear that the sociopathic personality 
is not an all inclusive diagnostic category for addict patients. 

The incidence of neurotic disorders in addict samples ranges between 18 
percent as reported by Olson (12) to 21 percent as reported by Sheppard 
etal. (15). This demonstrates a close correspondence between the Pili, 
and New York samples reported relatively closely in time. But, there was a 
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difference in incidence of psychotic-like MMPI profiles between the two 
samples. Olson (12) reported 16 percent, whereas Sheppard et al. (15) re- 
ported 35 percent. More recently Greaves (6) indicated a high percentage 
of incipient psychosis in a sample of drug addicts which was considered to be 
present prior to drug use. 

Although there has been apparent reluctance to view some addicts as pos- 
sible schizotypic, incipient psychotics, or schizophrenic, this approach may 
have some merit. Glover (5), for example, has hypothesized that drug addic- 
tion may in some patients serve to control sadistic feelings, thereby protecting 
the addict from a paranoid type of reaction. 

Several competitive hypotheses become apparent from the data above. Ad- 
dicts reflect measurable psychopathology. No one personality or psychopatho- 
logical type explains all addicts. Are the differences in personality types 
reported due to sample differences, or do they represent changes in the trend 
of the type of person becoming addicted to drugs? If each of the above are 
supported, this type of information should have clinical relevance regarding 
treatment methods. 

There has yet to be one article that attempts to provide the type of MMPI 
data that offers detailed indication of the broad variations in pathology seen 
in a large sample of addicts. Most of the current literature has provided mean 
MMPI profiles of addict samples. Mean profiles of heterogeneous groups 
tend to obscure individual differences. It is our belief that when more detailed 
data accrue in sufficient quantities, studies can be designed to evaluate the 
effectiveness of different treatment programs within each addict subgroup. 
With this information available, addicts may be assigned to programs that 
will maximize treatment effects. The first step toward approximating this 
goal requires defining patient groups accurately. 


B. MetrHop 


Data were gathered on 336 male narcotic users who were committed to 
a special treatment unit at Central Islip State Hospital by a probate court 
procedure for a 90-day treatment period. All were free from drug influence 
when tested. In an effort to ensure validity of response to a short social 
history form and the MMPI, participation was left to the patients’ discre- 
tion. Of the patients admitted during the course of this study, 95 percent 
participated. 

There were no deviations from the standard instructions for administering 
the MMPI. No time limit was imposed. All patients were tested in group 
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sessions which had adequate Proctoring assistance.? Questions reflecting mis- 
understanding of instructions were met with a restatement of the standard 
instructions. Questions directed at seeking support or clues to preferred an- 


approach was not always viewed as satisfactory by the patient, it was certainly 
successful in practically eliminating further questions, The. preponderance 
of patients tested accepted the testing procedure and were interested in re- 


deviation of 2.2 years, Forty-nine percent of the patients had been employed 
in unskilled, 26 percent in semiskilled, 13 percent in skilled, 10 percent in 
clerical or sales, and two Percent in professional occupations. Marital status 
included 51 percent bachelors, 36 percent married, 10 percent separated, two 
Percent divorced, and one percent widowed. 

All MMPI data were machine scored, and the resulting profiles typed 
according to the Dahlstrom and Welsh (2) coding system, MMPI self- 


comparison of batches of data collected at Varying institutions at different 
times. Certain differences in Teporting MMPI code types were exercised 


For example, if a patient's Profile code was 420/ » diagnostic Consideration was 
given to the 42’ elevation rather than 420’. With these qualifications in 
mind, attention is directed at the data, 
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confirmed by review of Table 1. Data reported in one, two, and three ele- 
vated MMPI scale codes demonstrate that six percent of this sample were 
without measurable pathology. A methadone maintenance program with some 
vocational counseling and social service support may be sufficient for these 
patients to reorganize and redirect their-liyves according to some socialized 
goals. 

One hundred and twelve protocols, 33 percent, indicated sociopathic per- 
sonality types. But, there were definite subgroups within this sample of 112 
patients. There were 21 patients who had spiked four (4’) codes which Hill 
et al. (8, 9) show are similar to the MMPI profiles of convicted criminals. 
There were 34 patients whose protocols indicated primary sociopathic per- 
sonality characteristics. Twenty-four additional protocols suggested some 
variant of the 49’ primary sociopath, but reflected some defensive operations 
to contain feelings of anxiety as in binding anxiety in somatic concerns, 
compulsive or phobic symptoms, or in depression and denial mechanisms. 
Lastly, within the sociopathic sample was a group of secondary sociopathic 
personality types. The 42’ profile types and the variants of this, seen in Table 
1, are often reported in alcoholic samples. There were 32 of these profile 
types. There is some support for the generalizability of three different socio- 
pathic personality types seen in the Hill et al. (8, 9) studies, Treatment 
procedures and goals for these patients must focus on different aspects of 
the patients’ functioning and have different initial expectations than other 
patient subgroups. The antisocial behavior patterns in these patients are not 
simply a function of a life style adopted with drug use. It is hypothesized that 
drug use is an expression of the sociopathic life style. Therefore, we might 
expect different employment records, criminal activities, continued illicit drug 
use, and resistance to therapeutic intervention in this subgroup. Sensation 
seeking behavior may be high in this group of patients. Treatment programs 
will have to be adapted in accordance with the considerations suggested above. 
In spite of creative efforts on the part of gifted therapists, crisis intervention 
with legal support added to chemotherapy may be the most realistic initial 
treatment approach with sociopathic heroin addicts. 

One hundred and twenty-three protocols, 36 percent, had elevations on 
the psychotic portion of the MMPI. One hundred and eight of these pro- 
tocols, 32 percent, suggested consideration of underlying schizotypic, incipient 
schizophrenic, or overt schizophrenic behavior. The remaining 15 protocols, 
four percent, indicated possible affective psychotic trends. Glover's (5) hy- 
pothesis of drug addiction, serving to control sadism and protecting the 
addict from a possible paranoid schizophrenic break, obtains some. indirect 
support from these data. MMPI profile codes representative of paranoia, 
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TABLE 1 ‘ 
MMPI Protocot Cope Types INDICATING HETEROGENEITY IN TYPE AND INCIDENCE OF 
MEASURABLE PsyCHOPATHOLOGY, T SCORE OF 70 OR ABOVE, FOR THIS SAMPLE 
oF 336 MALE Narcotic AppIcTs 


Protocol code points N % Frequency of cases 


No measurable pathology 1 6 
Sociopathy 112 33 
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SS 
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Schizotypic, incipient schizo- 
phrenic, or schizophrenia 108 32 
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Paranoia or paranoid state 5 2 
46 


465 
467 
469 


pete 
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TABLE 1 (continued) 
Protocol code points N %o Frequency of cases 


Personality disorders 17 5 
43 
45 
47 
426 


NH Aw 


Affective psychoses 15 4 
9 
831 

Organic involvement 7 2 


29 
290 


Heep 


Neurosis 53 16 
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QQ 
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paranoid state, or paranoid schizophrenia equaled 79 cases or 24 percent of 
this group. The data above support the hypothesis that some portion of a 
drug addict sample have severe psychopathological disturbances, some of 
psychotic proportions. Therapy with these patients should be supportive and 
directed toward reversing their tendency to withdraw. This type of informa- 
tion when applied to appropriate cases may enable the therapist and patient 
to gain some insight into one possible source of their addictive behavior. A 
drug maintenance program will not be sufficient for these patients. Possibly 
an attempt to understand the patient at this level may have a significant 
influence in enabling him to attempt to put order and meaning into his life. 

Fifty-three protocols, 16 percent, indicated neurotic disturbances. Within 
the neurotic subgroup, 32 patients had 274’ profile types. Again this profile 
type is often encountered in alcoholic samples. Symptoms of anxiety, tension, 
and depression brought about by feelings of inferiority and inadequacy may 
account for some of the reasons for drug use. Traditional therapeutic pro- 
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cedures, in addition to some form of maintenance chemotherapy, may be as 
effective as therapy with nonaddicted neurotic patients. 

Seven protocols, two percent, were indicative of organic damage as the 
overriding diagnostic consideration. Were this verified neurologically, these 
patients would require supportive therapy with extensive rehabilitative coun- 
seling and training, as well as maintenance chemotherapy. The remaining 
data in Table 1 suggested personality or other character disturbance. Clearly, 
heroin addicts represent a diverse group in terms of type and degree of 
psychopathology. It is unlikely that one treatment method will be equally 
effective in all patient groups. Also treatment goals and criteria of effective- 
ness must be different for each subgroup. 


D. Discussion 


The Joint Information Service of the American Psychiatric Association 
and the National Association for Mental Health recently published an evalu- 
ation of nine drug abuse treatment programs located in various geographic 
areas in the United States. Each of the nine programs differed with regard to 
their philosophy and treatment methods. None of these programs was con- 
sidered outstandingly successful (4), Part of this ineffectiveness can be 
attributed to a lack of understanding of the natural history of drug abuse as 
a psychosocial condition. Because of this lack of meaningful information, we 
are unable to assign addict patients to a treatment that will be maximally 
effective. 

The data of this report support the notion that this end is achievable. 
Addicts are a heterogeneous psychopathological patient group. Some portion 
of these patients are sociopathic. In general, traditional therapeutic methods 
are ineffective with sociopathic personalities (7). At best we can only expect 
that these traditional treatment methods will be equally ineffective when 
applied to sociopathic herion addicts. This should mean that treatment 
methods, initial treatment plans and expectations, and total therapeutic goals 
for sociopathic patients must be different from those for other patient groups 
in order to be realistic. For example, we might expect greater continued 
criminal activity and or illicit drug use in sociopathic patients with “spiked 
four” MMPI protocols initially defined by Hill ¢¢ al. (8, 9). Lastly, socio- 
pathic heroin addicts can be subdivided into three groups which may differ in 
counseling readiness, attrition Tates, and prognosis. Evaluation of treatment 
effectiveness must be made with tegard to specific end points tailored to the 
characteristics of each of these clinical groups. 


A second large portion of this sample which represents a generally re- 
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fractory treatment group are the schizotypic, incipient schizophrenic, overt 
schizophrenic, or psychotic heroin drug addicts, Obviously, focusing on the 
drug addiction in this patient group will not be sufficient, especially if the 
choice of heroin abuse was developed as a means of coping with a basic with- 
drawal, defensive operation, or protection from sadistic urges or feelings [as 
suggested by Glover (5) and which may appear plausible for some of the 
patients in this study]. Olson (12), Sheppard et al. (15), O'Neill. et al. 
(13), and most recently Greaves (6) reported a high percentage of incipient 
psychosis in a sample of drug addicts. Greaves (6) considered the psychotic 
process to be active prior to drug use. Programs servicing large numbers of 
this type of patient and built primarily on methadone maintenance will prob- 
ably have less success than chemotherapeutic programs for psychotic non- 
heroin addict patients which May (11) has argued should be the treatment 
of choice for schizophrenic patients. 

‘The best groups to achieve low recidivism rates, high employment, and 
low incidence of criminal activity and illicit drug abuse may be expected in 
the neurotic and addicts without measurable pathology. Olson (12) reported 
18 percent of his sample, and Sheppard et al. (15) 21 percent, as neurotic 
drug abusers. This group may show greater readiness for counseling and be 
reached more effectively by traditional therapeutic approaches. It is generally 
agreed that “talking therapy” is most effective in neurotic and minimally 
disturbed patients. 

The question of the addiction-prone personality gains little direct support. 
Two MMPI code types, the 42’ and 274’ codes, which appear frequently in 
alcoholic samples were present in this heroin addict sample. While there was 
some overlap between alcoholic and drug addict samples, the greater weight 
of evidence fails to support an addiction-prone personality type. Were this 
so, greater similarity in MMPI profiles would have been found in this 
sample. This was not apparent at the level of analysis employed with these 
data. 

E. Summary 

The Minnesota Multiphasic Personality Inventory Report was admin- 
istered to 336 male narcotic addicts admitted to a special treatment unit at 
Central Islip State Hospital. Ninety-four percent of these patients had 
measurable pathology as reflected in T scores of 70 or above. The data 
supported the hypothesis that addicts form a heterogeneous psychopathological 
sample. Specific diagnostic subgroups can be identified which should require 
different treatment modalities. Each of these diagnostic groups have base- 
rate expectations of treatment effectiveness when addiction is not an attended 
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difficulty. Comparisons of treatment effectiveness of addict subgroups should *+ 
be made with regard to these base-rate data, rather than to some more global 
criteria for a total addict sample. It may in fact be unrealistic, for example, 

to expect a treatment program for schizophrenic addicts to be more efficacious 

than a treatment program for schizophrenic nonaddicted patients, which is in 

fact one error we continue to commit. Hypotheses of this type are readily 
tested. 
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Genetic Psychology Monographs (: continued ) 
VOLUME 1—1926 
1. Performance tests for children of pre-school age—R. ‘STuTSMAN 


‘An experimental study of the eidetic type—H. Kuiiver 
p. AD RP Study of natiowracial mental differences—N. D. M. Hmscx AP 
S.A psychological study of juvenile delinquency by grown wcthods—J. W. Buipces axp K. M, B. Brrocrs 
The influence of puberty praecox upon mental growth—A. GESELL 
1 & 2. The mind of a gorilla—R. M. Yenkts VOLUME 2—1927 
sarin the expression of the emotions—K, DUNtar 


ilitiee—R. R. Wmtoucusy 


5. Coordi 
6. The mind o Mental development—R. M. YERKEs 
VOLUME 3—January-June, 1928 
1. An experimental study of the olfactory sensitivity ‘of the white rat—J. R. Liccerr 
's in reading formu ae—M, A, TIxKen 


" A’ photographic study of eye movement 
3. An experimental study of the East Kentucky mountaineers Ns D M, Hinsca 
i turel; elivered—G. T. AVERY 
A, Responses of foetal guinea pigs prematurely, Geller casion (teachers, research workern, and administrators) — 


5. Objective differentiation between three groups i 

M. B. Jensen 

6, The effect of segregation on the sex behavior of the whit 
VOLUME 4—July-December, 1928 
1. Cee ed and training of fundamental habits in young children—E. 

. Borr ‘1 

2 & 3. Determination of @ content of the course in Literature of a suitable difficulty for junior and senior high school <\) 
‘students—M. C. Bunce 
4&5, Methods for diagnosi 
6. The relative effectiveness o! 


e rat as measured by the obstruction method—M. JenxIns 


A. Bort, W. E. Bratz, N. Cuant, ann 


and treatment of cases of reading disability—M. Monroe 

und Jtace and individual reading as methods of college teaching—E. B. Gneene 
VOLUME 5—January-June, 1929 

1. The age factor in animal learning: I. Rats in the problem box and the maze—C. P. Stone 

2. The effect of delayed incentive on the hunger drive in the white rat—E, L. HaminTon 

3. Which hand is the eye ‘of the blind?—J. M. Smits 

4. The effect of attitude on free word association-time—A. G, Expant 

5. The localization of tactual space: A study of average and constant errors un 


der different types of localization— 


L. E, Cone 
6. The effects of gonadectomy, vasotomy, and injections of placental and orchic extracts on the sex behavior of the & 
white rat—H, W, Nissen VOLUME 6—July-December, 1929 
the method of co-twin control—AN) 


1, Learning and growth in identical jnfant twins: An experimental study by 
Gxseth ann H, Tompson 
2. The age factor in animal learning: II. Rats on a multiple light discrimination box and a difficult maze—C. P. Sront 
3. The acquisition and interference of motor habits in young children—E, McGinnis 
4. A vocational and socio-educational survey of graduates and non-graduates of small 
A. D. Muecrer 
& 6, A study of the smiling and laughing of infants in the first year of life—R. W. Wasupurn 
VOLUME 7—January-June, 1930 
, Tensions and emotional factors in reaction—E, Durry 
2, Teacher influence on class achievement: ‘A study of the relationship of estimated teaching ability to pupil achieve- 
ment in reading and arithmetic—H. R. TAYLOR 
3&4. A study of the effect of inverted retinal stimulation upon spatially coordinated behavior—P. H. Ewent 
5, A study of ‘the mental development of children with lesion in the central nervous system—E. E. Lono 
6. An experimental study upon three hundred school children over a six-year period—N, D. M, Himscu 
th Rape os Pie VORUME Ba Discehes 1930 if 
|. The amount and nature of activities of newborn infant di it i iti i 
le amount and Fite 0. Ci. Inman orn infants under constant externa] stimulating conditions during the 
re Race and pee (ee be in performance tests—S. D. Porteus, et al. 
| Language and growth: The relative efficac if earl; id def i 
oe a eth, Ce SHmAxeR cy ly and deferred vocabulary training, studied by the method of 
4, Eye-movements and optic nystagmus in early infaney—J, M. McGrwwts 
5 & 6, Reactions of kindergarten, first-, and second-grade children to constructive play materials—L. Farwett 
VOLUME 9—January-June, 1931 
1&2, The status of the first-born with special reference to intelligence—H. H. Hstao 
3&4, An experimental study of bright, average, and dull children at the four-year mental level—H. P. Davipson 
5. An historical, critical, and experimental study of the Seashore-Kwalwasser test battery—P. R. Fannswortn 
6. A Pee reer and improvement in the learning of bright and dull children in reproducing a descriptive 
* VOLUME 10—July-December, 1931 A 
L A comparative study of 8 group at southern white and negro infants—M. B. McGraw 
. An experimental study of jen: in int Pat 
2 & S  iimite of learning abllity ts veal ie feats means of systematic cinema records—H. M, Hatverson 
3&6. The effect of habit interference upon performance in maze learning—O. W. Aus 


high schools of New England— 


VOLUME 11— A 
1. General factors in transfer of training in the SURES a dieeeat i 
be Fee ffect of color on visual apprehension and perception—M. A. Tinker iN 
2 The Chisbility and validity of maze experiments with white rate—R, Lexren Y 
seal atedy of two lists of best books for children—F. K. Snortuewonrm 
ring human energy cost in industry: A general gnide to the literature—R. M. Pace \ 


VOLUME 12—July. 

1. Family resemblances in verbal and numerical Le ihe vet 
2, The development of de prehension in infeney BM. On 

2 ad The growth of adaptive behavior in infants: An 

§ & 6. Differential reactions to taste and Ee erie 


VOLUME 13—January. 
1. A critique of sublimation in males: A study of forty Hic Reto y 
2. A study of the nature, measurement, and determination of hand Wrileresoe Hl! T...Koce, 
2. A atudy ot und decline of intelligence: A study of a homogeneous group betw Shaner sixty Be 

reese in HS Comte a > between the ages of ten and sixty—H- 
n 2 relation between the complexit tl i 

re Secarrbon plexity of the it to be acquired and the form of the learning curve in youn 
§. Eathg kabits in’ relation to personality development of two- 

‘children in two nursery schoole—A. A. Ex1oT pment of two: and three-year-old children: A study of ixtr-#it 

6. Cogidineting mechanisms of the spinal cord—O. C. Ixczmmirses 
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t VOLUME 14—July-December, 1933 
1. Mental growth during the first three years: A developmental study of sixty-one children by repeated tests—N. Barzz| 
2. A study of triplets: including theories of their possible genetic relationships—F. N, ANDERSON AND N. V. ScuEIpEsan 
3. The objective measurement of emotional reactions—H. V. Gaski. 
4, Development of behavior in the fetal cat—J. D. Coronros 
5. A study of certain language developments of children in grades four to twelve, inclusive—L. L, LaBRANT 
6. The effect of early and delayed practice on memory and motor performances studied by the method of co-twi 


control—J. R. Hiccarp 
VOLUME 15—January-June, 1934 
1. Studies in the psychology of tone and music—P, R, Farnswonti 
2. Motoy learning of children in equilibrium in relation to nutrition—E. L, Berne 
3. Discrimination limens of pattern and size in the goldfish Carassius auratus—J. B, RowLey 
4. Limits of learning ability in the white rat and the guinea pig—B. F. Riess 
5 & 6. The limits of learning ability in rhesus monkeys—H, A. Fyeup 
Shes VOLUME 16—July-December, 1934 
1, A statistical study of ratings on the California Behavior Inventory for Nursery-School Children—H, S. Conran 
2. An eye-movement study of objective examination questions—A. FRaNvsen 
3. An experimental study of constitutional types—O. Kurvesenc, S. E. Ascu, ano H. Brock 
4, The development of a battery of objective group tests of manual laterality, with the results of their application t 
1300 children—W. N. Durost 
5&6. An experimental study in the prenatal guinea-pig of the origin and development of reflexes and patterns o} 
behavior in relation to the stimulation of specific receptor areas during the period of active fetal life—L. Caamicu. 
VOLUME 17—January-December, 1935 
1, Organization of behavior in the albino rat—R. L. THornpike 
2, Brightness discrimination in the rhesus monkey—M. P. Crawrorp 
3, The limits of learning ability in cebus monkeys—A, M. Kocu 
4. Nature-nurture and intelligence—A. M. Leany 
5. On intelligence of epileptic children—E, B. Sururvan anv L. Ganacan 
6. A study of the play of children of preschool age by an unobserved observer—D. L, Cockreuy 
VOLUME 18—January-December, 1936 
1. Sex differences in variational tendency—Q. McNeman ann L. M. TERMAN 
2, The process of learning to dress among nursery-school children—C. B, Key, M. R. Wurrs, M. P. Honzix, A, B. 
Heiney, ann D, Erwin 
3. A study of the present social status of a group of adults, who, when they were in elementary schools, were classifie 
as mentally deficient—W. R. Batter 
4. The influence of specific experience upon mental organizations—A, ANASTASI 
5 & 6, Studies in aggressiveness—L. Benner, S. Kerser, ano P. Scumpen 
VOLUME 19—January-December, 1937 
1. Psychological bases of self-mutilation—C. Dasrowski 
Masculine temperament and secondary sex characteristics: A study of the relationship between psychological and 
physical measures of masculinity—H. Guxivson 
2. A psychological study of forty unmarried mothers—R, D. Nortincuam 
Behavior problems in the children of psychotic and criminal parents—L, BrNver 
3. Domination and integration in the social behavior of young children in an experimental play situation—H, H, Anpenson 
4, The sequential patterning of prone progression in the human infant—L. B, Ames 
VOLUME 20—January-December, 1938. 
1, Tho relationship between characteristics of personality and physique in adolescents—P. S. vz Q. Casor 
2. Behavior problems of elementary school children: A descriptive and comparative study—I. Y. Masten 
Graphic representation of a man by four-year-old children in nine prescribed drawing situations—P. F. Gnipuey 
3. Differences between two groups of adult criminals—R. S. Touman 
4. A comparative study by means of the Rorschach method of personality development in twenty pairs of identical 


twins—E. Troup . 
Individual differences in the facial expressive behavior of preschool children: A study by the time-sampling method— 
Swan 


VOLUME 21—January-December, 1939 


1, An experimental analysis of “level of aspiration"—R. Gounp 

2. Some light on the problem of bilingualism as found from a study of the progress in mastery of English among pre: 
school children of non-American ancestry in Hawaii—M, E. Smira 

3. Domination and social integration in the behavior of kindergarten children and teachers—H. H. ANpEnson 


The capacity of the rhesus and cebus monkey and the gibbon to acquire differential response to complex visual 


stimuli—W. E, Gaur 
4. The social-sex development of children—E. H. Campari 
VOLUME 22—January-December, 1940 
1. Measuring human relations: An introduction to the study of the interaction of individuals—E,. D. Cnarrur 
2. Aggressive behavior in young children and children’s attitudes toward aggression—M. D. Fire 
3. Student attitudes toward religion—E. Newson . 
The prediction of the outcome-on-furlough of dementia praecox patients—J. S. Jacos 
Significant characteristics of preschool children as located in the Conrad inventory—K. H. Reap 
4, Learning by children at noon-meal in a nursery school: Ten “good” eaters and ten “poor” eatere—J. B. McCay, E. B. 
Wane, ann P. J. Kruse ; ; i 
Studies in the interpretation of play: I. Clinical observation of play disruption in young children—E. H. Errxson 
VOLUME 23—January-June, 1941 
1. An analysis of certain variables in a developmental study of language—F. M. Youne 
Infant development under conditions of restricted practice and of minimum social stimulation—W. Dennts 
An analysis of the mental factors of various age groups from nine to sixty—B. Batinsxy 
2. Factors influencing performance on group and individual tests of intelligence: I. Rate of work—M. W. Bennett 
Individual differences in apperceptive reaction: A study of the response of preschool children to pictures—E. W. AMEN 
VOLUME 24—July-December, 1941 ‘\ - 
1, Twins 7 and C from infancy to adolescence: A biogenetic study of individual differences by the method of co-twin 
oh ere Grsett ann H. THomrson i dee SMe iiteo 
inger nail-biting: Its incipiency, incidence, and amelioration—A. L. 
An experimental study of the factors of maturation and practice in the behavioral development of the embryo of the 
frog, Rana pipiens—A. Fromme 
2. The Fels child behavior scales—T. W. Ricuanps anv M. P. Simons 
Measurement of the size of general English vocabulary through the elementary grades and high school—M. K. Ssirz 
Stereotypes in the field of musical eminence—P. R. FaRNswortH 
VOLUME 25—January-June, 1942 
1. ie Rey of eee) determining Poally nae ina lected professional group—J. C. Franacan 
genetic study of geometrical-optical illusion: . WALTERS 
Interpretation OF bekaviostitings in terms of favorable and unfavorable deviations: A study of scores from the Read- 
Conrad Behavior Inventory—K. H. Reap ann H. S. Conran 
2. Are there any innate behavior tendencies ?—J. B. Scr 


OOLLAND. 
An investigation of the intelligibility of the speech of the deaf—C. V. Hupcrs ann F. C. Nusmaxns 


ee 


VOLUME 27—January-June, 1943 5 
Comparison of children’s personality traits, attitudes, and intelligence with parental occupation—N. R. 
A comparative study of mental functioning patterns of Problem and non-problem children seven, ci 
years of age—M. L. Picnatexu 

VOLUME 28—July-December, 1943 
Separation anxiety in young children: A study of hospital cases—H. EpELston 
Correlates of vocational Preferences—W, A, y IR. 
VOLUME 29—January-June, 1944 
botomy—S. D. Porteus ano R. D. Kernen 
earning of auxiliary languages—B. Price, 


Mappy 
ight, and nine 


Mental changes after bilateral prefrontal Io 


A twin-controlled experiment on the | » W. J. Kostm, ano W. M. Tayion 

VOLUME 30—July-December, 1944 

A method of administering and evaluating the thematic appreciation test in group situations—R. M. Cuarx 

A study of anxiety reactions in young children by means of a projective technique—R. Trapt anv E. W. ALO 

VOLUME 31—January-June, 1945 

The evolution of intelligent behavior in rhesus monkeys—B, WEINSTEIN 

Perceptual behavior of brain-injured, mentally defective children: 
technique—H. Wenner 


An experimental study by means of the Rorschach 


VOLUME 32—July-December, 1945 
A clinical study of sentiments —H. A, Murray anv C.D. Morcan 
A clinical study of sentiments 


—H. A. Murray anp C. Morcan 
VOLUME 33— wary-June, 1946 
Interpretation of spontaneous drawings and paintings—T, S, WazHer 


Preferences for sex symbols and their pei 
Outstanding traits: In a selected college 
ano W. L. Woops 


L. Desrzat anv H. 0. Prence 
the ontogenesis of social relations—R. A. Sprrz 


VOLUME 35—January-June, 1947 
The thematic apperception technique in the 


study of culture-personality relations—W, 
continuation study of anxiety reactions in young children by means of a proj 
E. W. Amen 


A study of the vocational interest trends of secondary school and college women—, 


VOLUME 36—July-December, 1947 
Maze test validation and sychosurgery—S, D, Porteus anv H. N. Peters 
The diagnostic implications of Rorschach’ 


» M. Cawury 


se studies of mental defectives—I. Jouzzs 
The radio day time serial: A symbolic an 


The relation of Learema! characteristics 

The mechanism of vision: XVIII. Eficen feat mf ; 

A study of the relationship between handwriting end Ies—P. Casrerevon girth monkey—K, 5. Labmay 
VOLUME 38—July-December: 1946 

Modern language learning: The intensiv 


undergraduate course of study—-M, Liwp 
Conflict: 


learning task—G, L. Grace 


fantasy production in schizo: 
S. Suvenman 


development—G. 
child a vend 


vi 
1. An experimental study of what young sched tag 
A study of the relative’ oad oftee 


Bipen ano C. Lewrs 
A projective experi 


iment using incomplete stories with fh Lunris 
P+ Sheets of sex role and social status on the early adres + Sekron 
Social perceptions and attitudes of chiles i. 


+ Some _psycholog: 
One-trial learnin, 


fluences bearing on the Prestige level of vocational choice—j 
VOLUME 42—July. 


ealed iy 


x | 
2 the expression of hostility and « 
» HEFFERLINE Suarmo 


VOLUME 43—January. 
1. A study of copying ability in children—E ok Townsenp Yume, 1951 
Prestige motivation of gifted 


| children—D. P. “Avsuser, 
2. A psychological study of physical scientists—A. Roz 


195) 
arious personality Structures—S_ Fys, 
list - VISHER AND EF, Hinps ) 
| behavior measures—E. A. Riccrurr ©” "US types i NK 


of radio programs in t 
n young children—W. E, Manriw sasast exes 


1 
if American personality and acculturation—W. Cans? June, 1952 
2. A statistical study of the Freudian theory of leon oF 


h 
Personality characteristics of selected disability groupe ete Ne Wier 2OPMeMt—C, 4, Banxes 
Children in tWo-mrureor— 


6. Coordinating mechanisme of the epinal corer 


ees of an infant—a, W. Lys 


bind 


. Negative stereotypes concerning Americ: 


Genetic Psychology Monographs (continued) 


de Z VOLUME 46—July-December, 1952 

The relationship of social status, intelligence, and sex of ten- and eleven-year-old children to an awareness of 
poverty—F. J. Esrvan 

An empirical study of the castration and Oedipus complexes—S. M. Friepman 

The relationship between projective test scoring categories and activity preferences—M, M. Scuwanrz 

A comparison of formal and content factors in the diagnostic testing of schizophrenia—M. SHERMAN 

VOLUME 47—January-June, 1953 

tlier judged mentally deficient—D. C. Cuantes 

ioral meaning of personality test scores—M, KoRNREgICH 


Ability and accomplishment of persons 
Variations in the consistency of the bel 


» Some child-rearing antecedents of aggression and dependency in young children—R. R. SEans, et al. 


Symptom correlates for descriptive diagnosis—J. R. Wirrensonn, ef al. 
VOLUME 48—July-December, 1953 
Age and mental abilities: A longitudinal study—W. A, Owens, Jn. 
The development of a personality questionnaire for drinkers—P. J. Hampton 
Personality and physical disease: A test of the Dunbar hypothesis applied to diabetes mellitus and rheumatic fever— 
D. H. Crowe 
Socio-economic contrasts in children’s peer culture prestige values—B. Pore 
A critical review of the stability of social acceptability scores obtained with the partial-rank-order and the paired- 
comparison scales—S. A. Witrrot ano G. G. THoMPsoN 
A study of the effects of color on Rorschach responses—G. G. Brovy 
VOLUME 49—January-June, 1954 
Factors underlying major reading disabilities at the college level—J. A. Houses 
Parent behavior toward first and second children—J. K, Lasko 
ial-status and intelligence: An experimental study of certain cultural determinants of measured intelligence— 
E. A. Haccanp 
Certain determinants and correlates of authoritarianism—S, Steck 
Personalities in faces: I. An experiment in social perceiving—P. F, Szconp, W. F, Dukes, anv W. Bevan 
VOLUME 50—July-December, 1954 
A study of the relationship between play patterns and anxiety in young children—E, W. Amen anp N, RENIsoN 
Operational explor ion of the conceptual self system and of the interaction between frames of reference—M, Epr.son 
AN - JONES 
Problem solving: A_ statistical description of some relationships between organismic factors and selected response 
measures—N. A. Fattu, E. Kavos, ano E. V. Mecu 
The relation of cot 1 potentials to perceptual functions—C, Cuyatre 
The import for clinical psychology of the use of tests derived from theories about infantile sexuality and adult 
character—D, W. Mires 
Measuring personality in developmental terms: The Personal Preference Scale.—M. H. Knovr anp J. K, Tantn 
VOLUME 51—January-June, 1955 
Some relations between techniques of feeding and training during infancy and certain behavior in childhood—A. 
Bernstem 
The expression of personality in drawings and paintings—L. H. Stewart 
among American-born children receiving various types of minority-group 


education—J, A. FIsHMan 
The Lincoln-Oseretsky Motor Development Scale—W. Suoan 
VOLUME 52—July-December, 1955 
Some personality correlates of sex, sibling position, and sex of sibling among five- and six-year-old children—H, L. 
ocH 


A quantitative Rorschach assessment of maladjustment and rigidity in acculturating Japanese Americans—G. DxVos 
Measurement of authoritarianism and its relation to teachers’ classroom behavior—H, M. McGre 


. The formal aspects of schizophrenic verbal communication—B. Mrmn 


A study in an aspect of concept formation, with subnormal, average, and superior adolescents—H. N. Horrman 
Traumatic avoidance learning: Acquisition and extinction in dogs deprived of normal peripheral autonomic function 
—L. C, Wyxwe ano R. 1, Sovomon 
VOLUME 53—January-June, 1956 
As the psychiatric aide secs his work and problems—F, L, Wetts, M. Greenpiarr, axp R. W. Hyoe 
An investigation of avoidance, anxiety, and escape behavior in human subjects as measured by action potentials in 
muscle—J. D. Broriens 
Spread of effect: A critical review—M. H. Manx 


. Stress, fantasy, and schizophrenia: A study of the adaptive processes—O. J. B. Kenner 


A Q-study of the educational attitudes of professors and Jaymen— 


The attitude structure of the individual 
F. N. Kervincen 
VOLUME 54—July-December 1956 


+ A study of personality differences between middle and lower class adolescents: The Szondi Test in culture-personality 


research—L, RamNwaTer 
The assessment of parental identification—S. W. Gray anv R. Kraus 


. The influence of social context on impulse and control tendencies in preadolescents—G. H. Zux 


Tender-mindedness versus tough-mindedness in psychology: A reexamination—H. WinTHROP 
A method for the comparison of groups: A study in thematic apperception—L, C. Scuaw ann W. E, Henny 
VOLUME 55—January-June, 1957 
Academic performance and personality adjustments of highly intelligent college students—B. M, Honnaz * 
The use of the Vineland Social Maturity Scale in the planning of an educational program for non-institutionalized 
low-grade mentally deficient children—M. G. Gorrsecen 
The structure and origin of the anal character—H. Betorr 
Free expression of adolescents’ interests—M. AMATORA . 
The réle of mass media and the effect of aggressive film content upon children’s aggressive responses and identification 
choices—R. S. Arpent 
Interest in persons as an aspect of sex difference in the early years—E. W. Gooprnovcn 
VOLUME 56—July-December, 1957 , 
Some stable response determinants of perception, thinking, and learning: A study based on the analysis of a single test 
—M. L, Simmer anp S. Counts 
Potentials of age: An exploratory field study—S. L. Pressey 
Tactile communication—L, K. FRANK 
The use of a filmed puppet show as a group projective technique for children—M. R. Hawoara 
The social competence of middle-aged people—R. J. HavicHurst 
VOLUME 57—January-June, 1958 
Psychological and cultural problems in mental subnormality: A review of research—S, B, Sanason anv T, Guapwin 


. Developmental aspects of discrimination in relation to adjustment—P. Lonvow 


Muscular tension as a response to psychological stress in rheumatoid arthritis and peptic ulcer—J. A, SoutawortH 
Frustration: Theory and experiment—R. Lawson axp M. H. Marx 

VOLUME 58—July-December, 1958 
: The woman voter—E. M. Bennett ano H. M. Goovwin 


While eI — EE 


Genetic Psychology Monographs ( continued) 


VOLUME 59—Jenuary-June, 
1. Freshman Rorschachs and college esr, Scumerier, M. 
Perceptual aspects of attributed moyement—R. W. 
Men fee moma Personality patietne sie perpen L. R. Conen 
. mality factors in of ci ie S . 
2 Pervonality factors Wationism: A social-peychological analysis of editorial content in New York newspapers of 
‘American minority groups—J. A. FisaMaN anp G. S. Fisaman 
Self, role, and satisfaction—A. L. Broray 


VOLUME ag gree 1959 

1, The constancy of personality ratings over two decades—R. D. Tuppennam 

Mother-son identification and vocational interest—L, H. Stewart 

Moth son ssetecelild telatfonshipe in the emotional disorders of childres—M. J. Rosewruat, M. Fuecxierem, Ey 

M1, ano R, E, Ronertson 

On the trail of the wolf-children—W. F. Ocsury anp N. K. Bose 
2, Measuring the mental health of normal adults—R. F. 

Parental identification in young children—W. Emmenice 

‘The problems of Jewish religious youth—B. M. Lxvinsox 


VOLUME 61—January-June, 1960 
Identifying socially maladjusted school children—A. A. ApAMs 
Social Cae in an bjickine sy quiet! wie ‘W. Srevensow an N. G. Stevenson, et al. 
Interests of pre-adolescent boys and girle—M. Amarona } 
Educational aaa eee in the organization of abilities in technical and academic students in Colombia, South. 


1959 
J. Netson, anv M. Barstow 


° 


Ameri '. PILELLA 
2, Some themes in the personalitfes. of German men—L. Ramwaten 
Right-left gradients in body image, body reactivity, and perception—S. Fisnex 
Longitudinal survey of child Rorschach responses: Younger subjects two to 10 yeare—L. B. Ames 
Personality patterns related to occupational roles—M. Srecerman ano R. F. Peck 


VOLUME 62—July-December, 1960 
. The reading process: A descriptive, interdisciplinary approach—M. P. Lancman 
Reactions to stress as a function of the level of intelligence—S. M. H. Zaior 
Rorgchach diagnosis by a systematic combining of content, thought process, and determinant scales—P. A, Bowen, 
R. Testin, anp A, Ronents 
Longitudinal survey of chjld Rorschach responses: Older subjects aged 10 to 16 years—L. B. Ams 
. ‘Twenty years of shock therapy in America, 1937-1956: An annotated bibliography—N. H. Pronxo, R. Sirrenty, 


~ 


we 


K, Banc 
The quantitative analysis of parent behavior toward psychotic children and their siblings—E, M. Donnetty 


VOLUME 63—January-June, 1961 
1, Make a sentence test: An approach to objective scoring of sentence completions—E. F. Boncatta 
Some aspects of the relationship between perception and motility in children—E. P. Rornman 
Some psychophysical aspects of mental illness in children—P, H, Berxowrrz 
Conceptual thinking in schizophrenic children—G. Frio Man 
2. Some maternal influences on children's personality and character—J. C. Finwey 
A study of concept formation in normals, mental defectives, and brain-damaged adults—R. W. Zastow 
An investigation of trait relationships among six-year-old children—S. A. AVAKIAN 


* * VOLUME 64—July-December, 1961 
1. Coping patterns of preschool children in response to intelligence test demands—A. Montarty 


peepee PhO RC ar interactional problem-solving behavior of rats: II. Solitary and social controle | 


Studies of role performance—J. H. Mann 

A psychological assessment of professional actors and related professions—R. Tarr 
A cross-cultural study of menstrual taboos—W. N. Srepuens 

The biological roots of creativity—H. Gutman 


VOLUME 65—January-June, 1962 


1, A Rorschach study of the development of personality structure in white and Negro children in a Southeastern 
community—A. C. Price 


Measuring personality patterns of women—J, Loxvincen 
Home interests in early adolescence—M. AMATORA 


Social and emotional adjustment di adoli in 
ge titee be Blass luring lescence as related to the development of psychosomatic illness 


2, A systematic study of interaction process scores, peer and self-assessments, personality and other variables—E. F, 
Boncarra Ay 


Children’s conceptions of the content and functions of the human body—E. Gruuzar 


VOLUME 66—July-December, 1962 
anual laterality and perceptual rotation of literal symbols—I. Furscuen 
Parental child-rearing attitudes and their correlates in adolescent hostility—S. B. Cuonost 
Some explorations into the nature of anxieties relating to illness—P. R. Rossins 
Developmental-interactive aspects of hypnosis: Some illustrative cases—J. R. Hucano ano E. R. Huncano 
2. Teaching a two-year-old to read: An experiment in early childhood learning—W. Fowxer 
Achievement gains or losses during the academic year and over the summer vacation period: A study of trends in 
achievement by sex and grade level among students of average intelligence—K. M. Panstey, Jn., axp M. POWsLt 


VOLUME 67—January-June, 1963 
1, Verbatim dialogue of a mother and child in therapy—C. E. Mousraxas 
A study of self-concept: Delinquent boys’ accuracy in selected self-evaluations—W, E. Amos 
An integrating study of the factor structure of adult attitude-interests—R. B. Carrett anv J. Horn 
Social factors in mental disorders: A contribution to the etiology of mental illness—W. G. Conuiner 
2. Conceptual preferences associated with choice of quantitative and nonquantitative college majors—J. Grsert 
A study of the concept of the feminine role of 51 middle-class American families—A. Srammann 


VOLUME 68—July-December, 1963 
1. Brain dynamism as reflected in illusions—G. K. Yaconzynsxt 
The nature and measurement of components of motivation—R. B. Carreit, J. A. Ravcuirrs, ann A. B. SwENET 
2, Sharing in preschool children as a function of amount and type of reinforcement—W. F. FiscHEn 
The Gesell Incomplete Man Test as a measure of developmental status—L. B. Ames ann F. L. Inc 
Psychological and sociological factors in prediction of career patterns of women—M. C. Muver 
Functionalism and its historical significance—R. Hanaison 


@ 


PUBLICATION IN THE JOURNAL OF PSYCHOLOGY 


The Journal of Psychology provides a special service to authors and readers. 
Articles accepted for publication appear in print with the least possible delay. 
Although editorial standards are the same as those for the other journals, all 
correspondence and handling are expedited, and the referees may, at their dis- 
cretion, permit some latitude in length of treatment with respect to tabular 
material, discussion, and review of the literature. 


To save time, authors are requested to exercise unusual care in preparing 
their papers. Special attention should be paid to the General Instructions on 
the back cover of each issue and to the announcement on the first preliminary 
page. 


ERRATUM 
DarbeN, E. Masculinity-femininity body rankings by males and females. J. Psychol, 
1972, 80, 205-212. P. 209. In Table 2, the first three numbers in or. Oran 


should be changed from .75 to .70, .05 to .95, and .05 t 
1 and the six ¢ values listed under it should be deleted, ao 


poe per annum BIMONTHLY 

18.00 per volume S$ b 

e Three volumes per year eptember, 1972 
Single Numbers $9.00 Immediate Pabtieaeon Volume 82, First Half 


(Founded in-1935 by Carl Murchison) 


The Journal of 
PSYCHOLOGY 


The General Field of Psychology 


by Tf thi space should be  unstamped, this is the regular library edition, But 
if this space is stamped with a designating title, this is a special edition, sold 


under the restrictions of a bilateral contract, and may not be resold for a period 
of five years from the date of publication. 


SEPTEMBER, 1972 
(Manuscripts are printed in the order of final acceptance) 


CFQ: The first statement of a statistical procedure to determine similarities and 
differences for behavioral science research . 4. 3 i ? if a 3 


By Lronarp M. CoHEN AND JOHN H. ForTHMAN 


The effects of two kinds of reinforcement on the persistence scores of children 
of different ability pattern . = 5 § ¥ h r, ‘ Q é 13 
By KeirH Barton AND JAMES W. BARNARD 


Expected need deficiency and preferences for three types of organizations ; 21 
By WitiiAM K. GRAHAM AND Patricia A. RENWICK 


Mother’s education and working: Effect on the school child Pree . 4 27 
By Frances CoRNWALL HUTNER 


‘ A note on sex differences in adolescents’ divergent thinking . ‘ iS ‘ 39 
By HELEN OLIVE 


(over) 


Copyright © 1972 by the Journal Press 
Provincetown, Massachusetts, U. S, A. 
All rights reserved 
Second-class postage paid at Provincetown, Mass. 
and additional mailing offices 
Send all mail to Managing Editor, 2 Commercial Street, Provincetown, Massachusetts 02657 


An experimental investigation of hitchhiking A . 3 > 
By Pgrer CRASSWELLER, MARY ALICE GORDON, AND w. HL Aunpon, Jr. 


Relationship between creative activity and the health of elderly persons 
By Aice M. Dawson AND WARREN R. BALLER 


A study of visual perception among aged people “ : 
By Louis Lerga AND Uxricu K. RATHEY 


International cross-language factor stability of personality: An Analygis of the 
Shure-Meeker Personality/Attitude Schedule . A % 5 i 
By L. L. Cummines, D, L. Harvert, ann S, M. Scumir 


A normal curve method for combining probability values from independent 
experiments . . . . . . . 
By Evucene S. EncINGTON 


Landmarks in the history of clinical psychology from its se Ge griniae 
through 1971. . ei . 4 a e y . 


By M. Mixe Nawas 


The completion hypothesis and the compensatory dynamic in intersexual attrac- 
tion and Jove coeium te muirde pie ee apeaers on 


By RicHarp CenrTers 


Personality and nonverbal rigidity: Some preliminary findings 
By Wituiam E. Simon, Louis H. Primavera, BERNarp Kew, AND 
Robert M, Crista 


Magnitude estimation of visual velocity . naa 
By R. S. Kennepy, M. D. Yzssenow, ano G, R. won 
A factor-analytic synthesis of paremonaliy theories; Basic dimensions recon- 
sidered .  , 
By Jamzs D. EVANS AND H. WAtzAGe Sunn 


Social stereotyping of male and female body types with eiering eeu ex- 
pressions by elementary school age boys and girls 
By Donatp W, Ferker 


Some psychological determinants of alpha attenuation after patterned flashes 
By Minami Kanon anp SHojr Kirayima 


An investigation of the athletic personality type hypothesis 
By Rotanp Lowe Anp Ron SANr 


Dogmatism and preference in 1970 Tennessee Senate campaign 
By C. V. Rosen ann Cuartes T. Kenny 


Fathers and sons: A longitudinal study 


of 506 Yale fath i 
By Pavr § Borie ers and their Yale sons 


43 


49 


59 


67 


85 


91 


11 


127 


133 


145 


151 


155 


167 


171 


175 


Published as a separate and in The Journal of Psychology, 1972, 82, 3-11. 


CFQ: THE FIRST STATEMENT OF A STATISTICAL 
PROCEDURE TO DETERMINE SIMILARITIES AND 
DIFFERENCES FOR BEHAVIORAL SCIENCE 
RESEARCH* 


Texas A @ I University 


Lzonarp M. CoHEN AND JoHN H. FortTHMAN 


A. IntTRopUCTION 


The illogic of assuming similarities when research differences have not been 
demonstrated to the 1%, or at least the 5%, level has plagued behavioral 
science research for a long time. The authors object to the conclusion, reached 
by many, that if two groups are not different, they must be the same. Perhaps 
such an illogical premise became so widely accepted because a simple statistical 
test for the determination of congruities—such as existed for the determination 
of differences—has been lacking. The purpose of this paper is to propose a 
statistical procedure and research design to permit testing factors of con- 
gruence with the same confidence as can be applied to the testing of differences. 

Most importantly, the proposed Congruency Factor Quotient (CFQ) 
test of congruence can be calculated at the desk without resort to electronic 
equipment, Only one square root is required. It is a simple procedure useful 
for small, as well as large, sample studies. To avoid false readings, which 
merely may be artifacts of small sample selection, it is recommended that 100 
cases be required for each of the groups under investigation. Since, however, 
the CFQ technique is designed to test the influence of social factors on the 
congruencies of behavior, attitudes, and feelings, the recommendation of a 
200-case minimum for the study of the congruencies in two separate groups— 
be the variables social or biological—should not be too limiting in practical 
terms. For the very large research, a prepackaged program (5) is available. 


B. Derinirion or CFQ 


CFQ is a simple power quotient where congruence is tested as a function 
of sample size in relation to the cumulative distances in ONE direction of 
measured differences that exist between two sample populations. CFQ is a 
Congruency Factor Quotient. CFQ is a statistic based on the concept that 
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differences between measures will, themselves, be parametric—even though 
such differences result from such nonparametric arrays as “WW” curves or 
straight line distributions. 

The theoretical foundation for the CFQ as a measure of congruity rests 
on the fact that the arrays of all measurable differences from a population, in 
its theoretic entirety, will fall into a normal probability curve. This was 
demonstrated by Massey (9) and Smirnov (12) as long as 25 years ago. 
Thus, the foundation for the CFQ levels of confidence proposed by the 
authors, as shown in Table 1, below, derives from the application of Gaussian 
probabilities to arrays of differences, even though the two arrays being com- 
pared for congruence arise from nonparametric measures. 

Since the CFQ is an extension and modification of the Kolmogorov-Smirnov 
two-sample statistic, those interested in more extensive background of mathe- 
matical developments of the techniques proposed here are referred to the works 
of Kolmogorov (6, 7), Smirnov (10, 11, 12), Massey (8, 9), Dixon (3), 
and Birnbaum (1, 2). The CFQ levels of confidence, presented in Table 1, 
are a simplification of the tables presented by Smirnov (12 p. 280-281). As 
explained later in this paper, the Smirnov “z” becomes the Cohen-Forthman 
Al value, which by the use of Table 1 is immediately translatable into con- 
fidence levels of congruency or similarities, indeterminancy, or a significance 
of differences—thus becoming immediately useable by the typical behavioral 


scientist. By use of the procedures proposed in this paper, the researcher does 
not have to calculate a Smirnoy “Lz” value, nor a probability percentage for 
each measure, 

A deep understanding of the mathematical foundations of the CFQ, how- 
ever, is not necessary for using it in a realistic and meaningful way. Techniques 
for the calculation of the CFQ, and examples of its use and interpretation, 
are presented in this paper. The behavioral scientist not sophisticated in mathe- 
matical theory may begin using this statistic as a measure of significance of 
similarity in much the same way as he has been using chi square or Fisher’s “¢” 
as tests of significance of the differences he has been finding. Before going on, 
the authors must point out that the CFQ statistic will also measure the signifi- 
cance of differences, as well as indicate areas of indeterminancy. Two birds may 
now be killed with one stone—both similarities and differences, and their 


relative significances, may now be calculated by the use of one statistic derived 
from a single research design. 


C. Resrarcy Desicn 
The suggested research design re 


i e sui ; quires that one of the two groups under 
investigation be designated—arbj 


trarily—as the normative constant. ‘The 
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distribution of the vector under investigation is compared, for the second 
group, against the distribution of this designated normative constant. The 
CFQ hypothesis must, therefore, become a two-step statement of the research 
problem: (a) What similarities, or differences, can be demonstrated, statis- 
tically, in the two arrays under consideration? (4) Can the similarities, 
statistically found to be congruent, be demonstrated to derive—logically— 
from the same normative factors? If the factor under investigation meets 
the pre-established confidence levels of congruence, as determined by the CFQ, 
the determined similarities may then be attributable to the normative factors 
under investigation. 

In the examples illustrated below, the hypotheses tested are all concerned 
with the question of whether attitudes expressed by females and males derive 
from the same culturally normative factors. Whether or not there is con- 
gruency may now be tested by the CFQ statistic. The calculations necessary 
are demonstrated in one of the examples, showing how simple computation of 
the CFQ actually is. 


D. CALcuLATION oF THE CFQ 


CFQ levels are determined from alpha integer (AI) values and are pre- 
sented in Table 1. Thus, to determine confidence levels for both congruencies 
and differences, all one need do is compute the AI value for the two arrays 
being compared. The Cohen-Forthman AJ value is identical to the Kolmogroy- 
Smirnov “z” score redesignated AI to avoid confusion with the standard Z 
score, now so widely used in psychometrics. 

In the development of Table 1, the authors have simplified the Smirnov 
(12) data in an additional manner to the modifications already described by 
(a) establishing an arbitrary cutoff point to determine a confidence level 
boundary for the concept of statistical congruence; (4) establishing a gray 
area of indeterminancy; (c) establishing areas of noncongruence which are 
interpretable as a significance of difference. 

AI values of .649 or less represent an 80% or greater chance that such 
values did not arise accidently—but rather resulted because there is a con- 
gruency. When the Jogic of the situation permits such an assumption, one may 
conclude that the two arrays are part of the same distribution—when this is 
demonstrated statistically. The statistical demonstration to an AI value of 
.649 (or less), the authors define—perhaps arbitrarily—as a statistically 
significant similarity or as a congruence. 

The AI range between .650 and 1.359 represents a 79.9% to a 5.1% 
chance that similarities may exist between the two arrays, but that congruence 
has not been demonstrated. Although normative factors may have established 
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some degree of similarity, the authors define this as the grey area of indeter- 
minancy. No statistical confidence of congruity can be placed in similarities 
generating AI values of .650 to 1.359. 

AI values of 1.360 and higher demonstrate statistically significant differ- 
ences or incongruities. An AI value of 1.360 may be interpreted as showing 
statistical differences significant at the 5% level of confidence, and may be 
interpreted as is Fisher’s or Student’s “t” test. Thus, a measure of statistical 
difference significance may be derived from computation of a statistic to 
measure congruences. 

CFQ levels of congruence, indeterminancy, and the significance of differ- 
ences, for various AI values are summarized in Table 1. The authors recognize 


TABLE 1 
Atpua Inpex (AI), Concruency Factor Quotient (CFQ), AND RELATED Levets 
OF CONGRUENCE, INDETERMINANCE, AND SIGNIFICANT DIFFERENCE 


Name of level AI value CFQ level (percent) 

Congruence Zero to 100.00 to 
649 80.708 

Indeterminance 650 to 79.20 to 
1.359 5.23 

Statistically significant 

differences 1,360 5.00 
1.630 1.00 
1,930 10 
2.220 01 


mete or more of the population would conform for the variable under investi- 
gation. 


that the suggested confidence levels of congruence are tentative. Only many 


studies by large numbers of investigators will determine if the standards sug- 
gested are too loose or too tight. 


E. Catcuration or AI Vaturs 


Several examples from the authors’ tesearch on social norms (4) will 
illustrate the calculations required to derive AI values. These examples will 
also show how to interpret the derived CFQ confidence levels as an application 
of the concepts presented in this paper, For studying the notions of SONSHIP 
pt by re “ en Anglo-American Ss on the continuum aggressive/ 

efensive, the calculations required to i its i 
eal cro ee $e Searls the AI value, and its inter- 
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The AI value is calculated from the following formula: 
AL = (f/mn + [m + 21) (Din) 


where m = the number of Ss in the first array; 2 = the number of Ss in the 
second group; and Dyn is the cumulative decimal fraction difference of part 
of the two arrays, as explained below. 

The eight males who responded in the “zero” interval represent 8/133rds 
of the “mm” population. Thus, 0602 is entered in the appropriate cell of 
Table 2. The five female Ss responding to the “zero” interval represent 
5/137ths of the “‘n” array. This is the decimal fraction .0365, and is entered 
as shown in Table 2. The difference between “m” and “‘n,” on the “zero” 
interval is -+.0257. This is entered on the last line of Table 2. 

When all the decimal fractions, for each of the intervals, have been com- 
puted, Dyn is calculated as follows: The decimal differences for the intervals 
showing the greatest number of positive OR negative signs are summed, with 
the direction of the signs ignored. In this example, the decimal differences of 
the five negative intervals are summed, with the negative sign dropped. The 
negative intervals are used as the minus signs outnumber the plus signs five to 
three. Din thus becomes .0089 + .0311 +.0580 + .0430 + .0920 which 
equals .2339. 

The fact that the greatest difference in any step interval occurred in interval 
“1” as shown in Table 2, and occurred in a “--” rather than in a “—” 
direction, does not affect the computation of Dinn. Ignoring a large magnitude 
of difference which runs counter to the direction of the largest number of 
intervals having signs in the same direction serves to prevent distortion of Al 
and CFQ by the (few) extreme score(s). 

Substitution of the “zm,” “n,” and “Dy” values in the AI formula yields 
the AI value. In the present example, as shown in Table 2, the AI value 
equals 1.934. Referral to Table 1 shows the CFQ level of confidence which 
may be ascribed to the derived AI value. 

Reference to Table 1 shows that an AI value of 1.934 indicates incon- 
gruity of view between the males and females on the continuum aggressive/ 
defensive, as it applies to concepts of SONSHIP. As the AI value of 1.934 
is at the .10% level of significance (Table 1), it may be concluded that the 
views held by males and females do arise from different factors. There is no 
necessity to calculate, separately, another statistical test for the significance of 
differences. 

Data generated in our studies of nuclear family roles for the concept of 
SONSHIP on the vector kindness/cruelty for the Anglo-American ale and 
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female respondents generated an AI value of .4694. The arrays for this vector 
are shown on Table 3. Inspection of Table 1 shows that an AI value of 


TABLE 3 
DIsTRIBUTIONS FOR AND RESULTS OF THE CALCULATION OF THE ALPHA INDEX (AI) 
AND Concruency Factor Quotient (CFQ) For THE Concept SON 
ON tHE Kind-Cruel ContiINUUM 


Continuum interval 


. Kind Cruel 

Ss responding Zero 1 2 3 4 5 6 7 
Male (N = 133) 6 45 36 23 14 n 2 1 
Female (N = 137) 4 46 4 10 29 1 3 0 


Note: CFQ = 96.6%; AI = .496. 


-4694 falls between zero and .649—a CFQ level indicating that congruence 
exists. For this example, the hypothesis that a son’s kindness or cruelty con- 
cepts are determined by the same culturally normative factors, whether 
viewed through male or female eyes, may be concluded to have been demon- 
strated. 

Anglo-American males and females, responding about whether their spouses 
should be rugged or delicate, yielded the array of data shown on Table 4. The 


TABLE 4 
DISTRIBUTIONS FOR AND RESULTS OF THE CALCULATION OF THE ALPHA INpEx (AI) 
AND CoNGRUENCY Factor Quotient (CFQ) FoR THE Concepr SPOUSE 
ON THE Rugged-Delicate CONTINUUM 


Continuum interval 


Delicate Rugged 

Ss responding Zero 1 2 3 4 5 6 7 
Male (N = 133) 12 20 18 33 20 5 6 7 
Female (NV = 137) 4 7 6 10 20 28 31 31 


Note: CFQ = less than .01%; AI = 4.532. 


AI value for these data is 4.532. Inspection of Table 1 shows this value to 
be far in excess of the 1.360 required to reach the 5% level of confidence in 
the significance of difference. Thus, there is statistical verification of the cul- 
turally observed pattern that men and women hold different views as to the 
desired delicacy or ruggedness of their spouses. 

The research examples cited merely indicated the type of investigations 
suited to the CFQ techniques testing the levels of congruence. The first 
example demonstrated that there was incongruence in the attitudes expressed 
by men and women in their concepts of SONSHIP on the aggressive/defensive 
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vector. Reasons for this dissimilarity of view were not investigated. Additional 
studies would be necessary to investigate the causative factors of the differ- 
ences noted here. 

In the example of SONSHIP on the kindness/cruelty vector, CFQ levels 
show that congruency of culturally normative expectations has been demon- 
strated. A son is expected to be kind by doth male and female respondents. In 
the example of SPOUSEHOOD on the ruggedness/delicacy vector, CFQ 
levels demonstrated a significant difference in the views held by men and 
women, as would be expected logically. The desire for ruggedness or delicacy 
in a spouse results, no doubt, from a combination of biologically and culturally 
active factors, the exact nature and combination of which will have to be 
determined by further studies. 


F. Summary 


In this paper, the authors have advanced a research method and statistical 
technique for testing the levels of confidence that may be ascribed to the 
findings of congruencies, as well as in the findings of similarities and indeter- 
minancies and of statistical differences or incongruencies. They have cited 
examples from existing data to demonstrate the application of these techniques 
and to demonstrate the computations necessary to derive the suggested statistic. 
The statistical procedure suggested in this paper may be calculated at the 
desk. As its use does not require electronic data processing equipment, nor 
elaborate programming, it is ideally suited to the cost conscious researcher 
who must confine himself to low budgets and unsubsidized projects. The 
derived AI values and CFQ levls will show both statistically significant con- 


gruencies and differences which may result from the research utilizing these 
techniques. 
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THE EFFECTS OF TWO KINDS OF REINFORCEMENT 
ON THE PERSISTENCE SCORES OF CHILDREN 
OF DIFFERENT ABILITY PATTERN* 
Laboratory of Personality and Group Analysis, University of Illinois; 


and Institute III: Exceptional Children and Adults, 
University of South Florida 


Kerru Barton anp James W. BARNARD 


A. InrTropuction 


The multiple-factor approach to the concept of intelligence has led psychol- 
ogists to investigate the correlates of patterns of abilities, rather than con- 
centrating solely on the concomitants of the variations in overall Ji Q. One 
such line of research has involved the investigation of the personality correlates 
of individuals who show ability patterns characterized by differential strengths 
and weaknesses in the two areas of verbal and spatial skills. Several studies 
involving high verbal ability-low spatial ability (HiV-LoS) Ss and high 
spatial ability-low verbal ability (HiS-LoV) Ss have investigated the rela- 
tionship between these ability patterns and various aspects of social behavior. 
For instance, Roe (2) found that geneticists and biochemists tended to score 
higher on spatial tests than they did on verbal tests and that these same 
people were characterized as showing a relative lack of interest in other 
people and were labeled as being nonsocial. Anthropologists, on the other 
hand, scored much higher on verbal tests than they did on spatial tests, and 
their behavior indicated a strong tendency to seek out social interactions with 
others. 

A study by Maccoby and Rau (1) attempted to identify the psychological 
correlates of six different ability patterns including HiV-LoS and HiS-LoV 
patterns. Their results indicated that HiV-LoS Ss tended to be dependent 
when observed in interaction with their mothers and that they expected to 
receive help from their parents when they asked for it even when they did 
not really need it. HiS-LoV Ss, on the other hand, were reported to ask for 
help from parents only when they were sick, alone, or afraid, and they ap- 
parently anticipated that parents would respond negatively if asked for help 
unnecessarily. 


* Received in the Editorial Office on May 22, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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One implication that may be drawn from the above studies is that ap- 
parently HiV-LoS Ss tend to be more sensitive than HiS-LoV Ss to social 
stimuli; that is, they are more sensitive to the reinforcing stimuli that ac- 
company performance on a task in a social context, and less sensitive to the 
reinforcing qualities of the task itself. The HiS-LoV Ss, on the other hand, 
seem to reverse this differential sensitivity; that is, they may be relatively 
more sensitive to the intrinsic nature of the task and less sensitive to the social 
context in which it is performed. 

The present study represented an attempt to investigate the relative effec- 
tiveness of two types of reinforcement with HiV-LoS and HiS-LoV Ss in 
manipulating persistence scores. The first type of reinforcement used was social 
reinforcement in the form of the presence or absence of Z, who, when present, 
administered positive verbal statements to the §. This was considered an extra- 
task variable. The second type of reinforcement was the type of task used, 
and involved two tasks differing in interest value and the degree to which 
they involved manipulation of objects in space. This was considered an intra- 
task variable and essentially was based upon the intrinsic interest of the task. 

Two hypotheses were generated on the basis of the above rationale, The 
first hypothesis states that social reinforcement is more effective in main- 
taining persistence at a task with HiV-LoS than with HiS-LoV Ss. The 
second hypothesis states that HiS-LoV Ss would persist longer than HiV- 
LoS Ss under conditions of intrinsic reinforcement in the form of an in- 
teresting task. 


B. Metuop 
1. Subjects 


All Ss involved in the experiment were chosen from the entire (N = 446) 
white fourth grade school population of Murfreesboro, a large rural town 
in middle Tennessee. Forty boys and forty girls with a CA range of nine to 
ll years were selected on the basis of discrepancies between verbal JQ and 
spatial JQ scores obtained on Thurstone’s (4) Primary Mental Abilities 
Test (PMA). Two groups of Ss were formed. The HiV-LoS group was 
comprised of 20 males and 20 females who obtained a verbal subtest score 
that was 10 or more points higher than their spatial subtest score. The HiS- 
LoV group was comprised of 20 males and 20 females who obtained a 
spatial score that was 10 or more points higher than their verbal score. An 
attempt was made to match the HiV-LoS and HiS-LoV groups with respect 
to overall PMA JQ; however, because of the limited number of HiS-LoV 
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Ss available, it was found impossible to match the two groups adequately. 
Nevertheless, subgroups were adequately matched with respect to JQ within 
the classifications of type of reinforcement and type of task. The group means 
for the verbal and spatial JQ scores are given in Table 1. 


TABLE 1 
Group MEANs For VERBAL JQ, SPATIAL IQ, AND PERSISTENCE SCORES 
Scores MD#: RY MD: NR PBU:R PB:NR 
HiV erbal-LoSpace Ss 
Verbal IQ 116 116 117 114 
Spatial JQ 88 92 92 88 
Persistence scores in 
minutes: seconds 5:07 2:18 10:57 5:58 
HiSpace-LoVerbal Ss 
Verbal IQ 84 73 87 86 
Spatial 1Q 112 107 105 108 
Persistence scores in 
minutes: seconds 4:01 5:12 4:18 7:04 


® MD = Marble dropping task. 

> R = Social reinforcement condition. 
© NR = No-reinforcement condition. 
4 PB = Pinball task. 


2. Design 


A2X 2 X 2 factorial design was employed involving three main effects 
including (a) two ability patterns, HiV-LoS and HiS-LoV; (4) two tasks, 
a marble dropping task and a pinball machine task; and (c) two reinforce- 
ment conditions, a social reinforcement condition and a no-reinforcement 
condition. Since no within § variables were manipulated, each S was tested 
on only one task and under only one condition of reinforcement, The vari- 
able of sex was not included in the final design after an initial analysis 
utilizing a 2 X 2 X 2 X 2 analysis of variance design with sex included 
failed to yield any significant main effects or interactions involving sex. 


3. Apparatus 


The experiment was conducted in a converted school bus, the inside of 
which was partitioned so that the children could be left to work on their 
own when it was so required. 

a. Marble dropping task. The marble dropping task used was essentially 
the same as the standard model described by Stevenson (3). The child was 
asked to select marbles from a seemingly inexhaustible supply of marbles in 
a large container on his left and insert them, one at a time, into a small 
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hole in a board placed directly in front of him. The board was actually the 
top of a table and covered a large receptacle beneath. Performance on this 
task obviously requires very little in the way of skill and, as a consequence, 
has been shown to be very sensitive to the motivational state of the performer. 

b. Pinball machine. This machine was approximately three feet long, two 
feet wide, and six inches high, and was fitted with a transparent Plexiglas 
top. About three inches below this top, a formboard was fitted with eight 
round holes through which marbles could fall, so that any marbles which 
had not fallen through a hole might roll off the edge of the board into posi- 
tion for refiring. This formboard terminated four inches from the front of 
the machine. The whole machine was constructed with legs that could be 
adjusted to modify the tilt of the surface. This was done to ensure that the 
balls had a more or less equal chance of entering any hole, and did not pass 
over the board too quickly. The machine was so designed that once a ball 
had been fired by release of a spring-loaded gun, it would eventually return 
to its original position and so could be fired again and again ad infinitum. 
Enough (six) steel balls 5/8 inch in diameter were used so that even when 
the machine was played at a very fast rate, there was always a ball in the 
position to be fired. 


4. Procedure 


The children involved in the experiment were taken individually from 
the classroom. On entering the laboratory, Ss were given instructions for the 
appropriate task. The social reinforcement condition was established in the 
following way. The experimenter sat by the side of the S during the task. 
At intervals of 30 seconds, the experimenter reinforced the Ss verbally. The 
verbal reinforcement statements were as follows: “I can see that you can do 
this well”; “good”; “you are doing well”; “excellent”; “you can play this 
well”; “very good”. These phrases were used in a random order of presenta- 
tion. 

The no-reinforcement condition was established by leaving the child in the 
room on his own with no visual or verbal contact with the examiner after 
the instructions had been given. Under this condition of no reinforcement 
the child was simply told that the experimenter would be working in ihe 
next room and the child could knock on the door when he was finished. 

. The reinforcement condition thus presumably consisted of praise admin- 
istered verbally, as well as the actual physical presence of the experimenter. 

The dependent measure was the amount of time in seconds that the child 


spent playing the game. This time was referred to as the child’s persistence 
score. 
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C. Resutts 


A distribution of the Ss’ persistence time scores revealed a large positive 
skew, and, therefore, a log transformation was made on the raw data, An 
Frax test (5) was applied to these transformed data, and this analysis con- 
firmed that the homogeneity of variance existed for all groups of Ss. The log 
time scores were then analyzed with use of a simple 2 X 2 X 2 factorial 
design (see Table 1 for group mean persistence scores in minutes: seconds), 
This analysis of variance indicated that there was a significant difference 
between tasks (F = 14.18, 1/64 df, » < .01). Subjects spent significantly 
more time on the pinball machine than on the marble dropping task. No 
other main effects were found. 

The first hypothesis which stated that social reinforcement would be more 
effective with HiV-LoS than HiS-LoV Ss was supported by the finding of 
a significant interaction between reinforcement conditions and types of ability 
patterns (F = 5.50, 1/64 df, p < .05). Further exploration of this inter- 
action was made by applying ¢ tests to the individual group means. Results 
from these analyses showed that HiV-LoS Ss spent significantly more time in 
the social reinforcement condition than they did in the no-reinforcement con- 
dition (p < .05). They also spent significantly more time in the social rein- 
forcement condition than did the His-LoV Ss (p < .05). In addition, it was 
found that the HiS-LoV Ss spent more time in the no-reinforcement condition 
than they did in the social reinforcement condition (p < .05). No other 
significant differences were found. 

Because it was not possible to match the HiV-LoS and HiS-LoV groups 
with respect to JQ, it was necessary to carry out an analysis of variance on 
IQ scores by means of the same simple 2 X 2 X 2 factorial design employed 
with the persistence scores. The results of the analysis showed that the HiV- 
LoS Ss possessed a higher total JQ than did HiS-LoV Ss (p < .01). How- 
ever, no significant interactions were found. 

A correlational analysis was performed on the verbal and spatial scores 
of the total population from which the experimental Ss were selected. Results 
indicated that verbal ability was positively related to spatial ability (Pearson’s 
product-moment correlation r = .45, p = <.01). 


D. Discussion 


The finding that HiV-LoS Ss spent more time on tasks under social rein- 
forcement conditions than did the HiS-LoV Ss supported the first hypothesis 
which stated that social reinforcement would be more effective with HiV- 
LoS than with HiS-LoV Ss. This result was consistent with Roe’s finding 
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that HiV-LoS Ss tended to seek out interactions with others while HiS-LoV 
Ss did not, and with the findings of Maccoby and Rau that HiV-LoS Ss 
tended to be dependent on social cues while HiS-LoV Ss responded in a 
socially independent fashion. 

The finding that HiS-LoV Ss spent less time in the reinforcement condi- 
tion than in the no-reinforcement condition was interesting and somewhat 
puzzling. One possible interpretation of this finding is that the HiS-LoV Ss 
actually found the social reinforcement situation aversive rather than simply 
nonreinforcing. This particular interpretation is consistent with Roe’s con- 
clusion referred to above. 

The hypothesis that the HiS-LoV Ss in the no-reinforcement condition 
would spend longer than the HiV-LoS Ss in the task with the greatest 
amount of intrinsic interest value, was not substantiated. Two suggestions can 
be offered to explain the failure to support this expectation. First, there is the 
possibility that the large size of the task main effect may have masked the 
interaction between type of task and ability pattern through some kind of 
ceiling phenomenon. Second, it is likely that tasks vary in interest value on 
several different dimensions. In this particular study it was intended that the 
two tasks should differ with respect to the extent that they allowed Ss to 
manipulate objects in space. The fact that a main task effect was found, but 
no interaction between type of task and ability pattern, suggested that the 
tasks may not have varied on a dimension involving differences in the extent 
to which spatial manipulation was possible, but instead the tasks may have 
varied on some other dimension which was independent of any differences in 
operations requiring spatial manipulations. Indeed, in retrospect it might be 
argued that the pinball machine did not allow as much manipulation of 
objects in space as was originally intended, because once the ball was fired, 
control over its destiny was largely relinquished. 

The positive correlation (r = .45) obtained between verbal and spatial 
ability scores for the total population of fourth grade school children from 
which the Ss were selected, tends to explain why difficulty was experienced 
in obtaining enough Ss who showed discrepancies in their scores on these two 
abilities. Since the HiV-LoS and HiS-LoV groups could not be adequately 
matched with Tespect to IQ, it could be argued that any differences in the 
effectiveness of social reinforcement for these two groups might just as likely 
have been due to the differences in overall intelligence as to the differences in 
ability pattern. However, the analysis of variance for IQ scores indicated that 
there was no significant interaction between reinforcement conditions and 
ability patterns as had been found in the analysis of persistence scores. This 
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finding strongly suggests that differences in JQ between HiV-LoS and His- 
LoV Ss did not differentially affect their performance under either condition 
of reinforcement. And, in fact, this finding is consistent with the inference 
that the differential sensitivity to social reinforcement found between dif- 
ferent ability pattern groups may be present throughout much of the JQ 
range. 


E. Summary 


The effectiveness of social reinforcement with high verbal-low spatial 
(HiV-LoS) and high spatial-low verbal (HiS-LoV) Ss was assessed by 
recording the amounts of time spent at two types of tasks by fourth grade 
children under conditions of social reinforcement and no reinforcement. As 
predicted, HiV-LoS Ss persisted longer at both tasks than did the HiS-LoV 
Ss under the condition of social reinforcement. It was also found that the 
HiS-LoV Ss spent less time in the social reinforcement condition than they 
did in the no-reinforcement condition. 
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EXPECTED NEED DEFICIENCY AND PREFERENCES 
FOR THREE TYPES OF ORGANIZATIONS* 


University of California, Berkeley 


Wituiam K. GrauaM anp Patricia A. RENWICK 


A. IntTRopUCTION 


Although vocational choice has been a frequent topic for research, there 
have been comparatively few investigations of the psychological processes in- 
volved in organizational choice. Preferences for organizations probably de- 
velop during the period in which the individual obtains the education and 
credentials needed to pursue his chosen career. Vroom (12, p. 17) has pro- 
posed that such preferences are determined by the valences and instrumen- 
talities attached to the perceived alternatives. According to Vroom: 

The valence of an outcome to a person is a monotonically increasing 


function of the algebraic sum of the products of the valences of all other 
outcomes and his conceptions of its instrumentality for the obtainment of 


these outcomes, 


In a study to investigate the utility of this proposition for predicting 
preferences for organizations, Vroom (13) asked students to rate the im- 
portance of 15 work goals and to estimate the probability of actually attaining 
these goals in the organization. Vroom found that “instrumentality-goal 
indices” computed from these data were related to the degree of preference 
that individuals expressed for organizations they were considering as possible 
places of employment. 

The utility of the instrumentality-goal approach was also investigated in 
a study by Pieters, Hundert, and Beer (5). In this study the sum of the 
products of valence of job goals and the attractiveness of companies with 
respect to those goals were used to predict acceptors and rejectors of a com- 
pany’s job offer. This instrumentality-goal index was found to be higher 
for the accepted than for the rejected organization in 86% of the cases. 

Sheard (11) has pointed out that the above studies do not constitute ade- 
quate tests of the instrumentality-goal approach, since they fail to examine 
the comparative validity of other approaches. In a study of college students’ 
preferences for different types of organizations, Sheard compared validity 
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coefficients derived from the instrumentality-goal approach with those ob- 
tained by using a simple sum of goal-attainability ‘ratings. The average intra- 
subject correlations using the instrumentality-goal approach ranged from .78 
to .82, whereas those for the simple sum of goal-attainability ratings ranged 
from .76 to .81. No significant differences were found between the validity 
coefficients obtained from the two procedures. As Sheard notes, these results 
are consistent with the findings of Ewen (1) and of Mikes and Hulin (4). 
Both of these studies indicate that in measuring job satisfaction the weighting 
of job factors by importance contributes little if anything beyond that ob- 
tained from a simple sum of satisfaction ratings. 

The present study was designed to investigate the extent to which potential 
job applicants—both male and female—perceived the possibility of fulfilling 
five Maslow-type needs (Security, Social, Esteem, Autonomy, and Self- 
Actualization) in three types of organizations: a military organization, a 
public utility, and a large business organization. An additional aim of the 
study was to examine the relationship between need-deficiency scores and 
organizational preference as indicated by the level of job satisfaction one 
would expect were he actually employed in the organization. 


B. Mernop 


Subjects were 94 male and female undergraduates from the School of 
Business Administration at the University of California, Berkeley. A slightly 
modified version of Porter’s (6) need satisfaction questionnaire was used to 
obtain measures of base need levels and degree of expected need fulfillment. 
Base need levels were established by asking subjects to indicate “How much 
of the characteristic do you think should be connected with your employment 
in an organization?” Subjects were then given the names of three organiza- 
tions: the United States Army, Pacific Gas and Electric, and IBM. They 
babi the need scales for each organization with instructions to indicate 

the extent to which you feel the characteristics are actually present.” Need- 
deficiency scores for the five need categories were obtained by subtracting base 
need scores from the scores representing the extent to which the characteristics 
or qualities were perceived as present in each of the three organizations. 
Finally, subjects were asked to indicate how they would feel as employees 


of each of the three organizations by checking th i the 
General Motors Faces Scale (2). Bo me IRENE TS FEE. 8 


C. Resurts 


To investigate patterns of need deficiency across organizations, a2 x 3 X 5 
analysis of variance was performed with repeated measures on two factors. 
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Sex, organization, and need category were the factors; amount of need 
deficiency was the dependent variable. The main effect for sex and all inter- 
action terms involying the sex factor failed to reach significance. There was, 
however, a substantial main effect for the organization factor. Thus, percep- 
tions of overall need deficiency differed depending upon the particular organi- 
zation involved (F = 18.51, p < .01). There was also a sizable main effect 
for the need factor (F = 16.33, p < .01). Finally, there was a significant 
interaction between the organization and need factors (F = 17.26, p < .01). 

The mean need-deficiency scores for the five needs across the three organiza- 
tions are plotted in Figure 1. For the two highest-order needs, the need for 
Autonomy and the need for Self-Actualization, Scheffé (10) post hoc tests 
indicated that all pairwise contrasts across organizations were statistically 
significant beyond the .01 level. In the case of the need for Esteem, the 
business organization differed significantly from both the military and the 
public utility organizations (p < .01) ; however, contrasts between the latter 
two organizations were not statistically significant. No significant differences 
were found for the Social need. Finally, with respect to the Security need, 
the contrast between the business and military organizations was significant 
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beyond the .01 level. The mean Faces scale was 4.07 for the military organiza- 
tion, 6.52 for the public utility, and 8.13 for the business organization. All 
pairwise ¢ tests between these means were significant beyond the 01 level. 
These results, of course, indicate that perceived need deficiency is negatively 
related to anticipated job satisfaction. To examine this relationship more fully 
and to investigate the relative contributions of need-deficiency scores to 
anticipated job satisfaction, stepwise multiple-regression analyses were per- 
formed for each organization. For the business organization the consecutive 
multiple correlations with anticipated job satisfaction were as follows: Social 
= .41; + Self-Actualization = —.45; -+ Autonomy = —.48. Addition of 
the remaining two needs increased the multiple correlation just slightly to its 
maximum of —.49, p < .01. The stepwise correlations for the military 
organization were as follows: Self-Actualization = —.43; + Social = —.49; 
-+ Esteem = —.51. Inclusion of the Security and Autonomy needs raised 
the multiple correlation to its maximum of —.52, p < .01. For the public 
utility the stepwise results were the following: Esteem = —.45; + Self- 
Actualization = —.48; + Social = —.51. Addition of the needs for 
Autonomy and Security increased the multiple correlation to its maximum of 


—.53, p < Ol. 


D. Discussion 


The results of this study clearly indicate that Business School students are 
able to differentiate among organizations in terms of the possibilities for ful- 
filling human needs of the sort described by Maslow (3). The overall amount 
of perceived need deficiency varied markedly when comparisons were made ‘ 
between a military organization, a public utility, and a large business organiza- 
tion. Moreover, the magnitude of deficiency scores for specific needs varied 
depending on the organization in question. Whereas the higher-order needs 
for Autonomy and Self-Actualization tended to be viewed as least attainable 
in all three organizations, students perceived a greater possibility for fulfilling 
these needs in the business organization than in the public utility or the 
military organization. The pattern of these results is in close agreement 
with previous studies of perceived opportunities for need fulfillment among 
managers (6, 7) and with comparative studies of need fulfillment involving 
members of business, governmental, and military organizations (8, 9). As a 
final point in discussing need patterns, it is of interest to note that Phere were 
no substantial sex differences regarding the perceived possibilities for need 
fulfillment across organizations. The main effects and all interaction terms 
involving the sex factor failed to reach statistical significance. 


‘ 
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An additional conclusion of this study is that perceived opportunities for 
need fulfillment are related to organizational preference as indicated by the 
level of job satisfaction a person would expect were he actually employed in 
the organization. The least-preferred organization in this study, the military 
organization, was also perceived as having the least potential for fulfilling the 
three higher-order needs: Esteem, Autonomy, and Self-Actualization. The 
most-preferred organization, the business organization, was also perceived as 
most able to fulfill the higher-order needs. The public utility fell between 
these two organizations both in terms of perceived opportunities for need 
fulfillment and in terms of preference. Finally, the best linear combination 
of need-deficiency scores for predicting the preference criteria depended upon 
the organization in question. When all five needs were entered into these 
analyses, the multiple correlations ranged from —.49 to —.55. These maxi- 
mum values, however, were closely approximated when the stepwise analyses 
were terminated after entering only three need categories. 


E. Summary 


To summarize, Business School students, whether male or female, perceived 
very different opportunities for need fulfillment in the three types of organiza- 
tions included in this study. Moreover, the patterns of need deficiency they 
perceived in those organizations were related to the level of job satisfaction 
they would expect as organization members. More information regarding job 
applicants’ images and impressions concerning opportunities for need fulfill- 
ment in different kinds of organizations would be of potential benefit to 
counselors, recruiters, and the applicants themselves. 
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MOTHER’S EDUCATION AND WORKING: EFFECT 
ON THE SCHOOL CHILD** 


Rider College? 


Frances CorNwALL HuTNER 


A. InTRopucTION 


Scholarly concern with the subject of maternal influences on the child’s 
intelligence, scholastic aptitude, and academic achievement and attainment is 
perennial. The entrance into the labor force of increasing numbers of mothers 
of school age children since World War II strengthened both scholarly and 
lay interest in the subject. And the current trend for mothers of preschool 
children to enter the labor force has further intensified that interest. 

A positive relationship between maternal educational achievement and the 
child’s mental ability has been found by a number of investigators with some 
—for example, Honzik (5) and Jensen (6)—attributing this relationship 
to genetic factors, while others do not explore causality (13). The con- 
siderable amount of research devoted to investigating the effect of maternal 
labor force participation on the child’s ability and achievement variables has 
resulted in conflicting conclusions, Past findings have run the gamut from 
concluding that maternal employment hurts children’s academic ability and 
achievement through finding no significant relationship between these vari- 
ables to discovering that working mothers have children who perform more 
ably than do children of nonworking mothers. 

As for the relationships between the education and employment of the 
mother and expectations and aspirations for the child’s education, these have 
usually been discussed in terms of the child’s rather than the parents’ or 


* Received in the Editorial Office on May 23, 1972, and published immediately at 
Provincetown, Massachusetts, Copyright by The Journal Press. ‘ 
1 The research for this study was supported by a grant from the Research Council 
of Rutgers University. The Princeton Borough and T ‘ownship School Boards permitted 
the author to use data on the Princeton public school community compiled by the 
School Boards’ Study Group. 
ia This study was undertaken when the author was at Douglass College, Rutgers 
niversity. ‘ 

8 Hoffman (4), Myrdal and Klein (7), Roy (12), and Stolz (16) found a negative 
relationship between mother’s working and child’s achievement under certain con- 
ditions. But no significant relationship was found by the studies of Nye (10), Hitchcock 
(3), Siegel and Hass (15), Frankel (2), Banducei (1), and Nelson (8). Conditions 
where maternal employment had a favorable effect on the child were found by Myrdal 
and Klein (7), Nolan and Tuttle (9), Roy (12), and Stolz (16). 
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teachers’ expectations (1, 13, 14). The effect of maternal labor force participa- 
tion on either the child’s aspirations or the parents’ expectations for the child 
has not been discussed. 

The Princeton community public school data analyzed in the present study 
offered an unusual opportunity to investigate maternal education and employ- 
ment in relation to an impressive variety of tests of student IQ, aptitude, and 
achievement, and to parent and teacher expectations and parent aspirations 
for the child. This could be done over a seven level classification of maternal 
educational attainment and for three maternal labor force participation 
categories. Two main questions were examined: (a) Are the child’s JQ, 
scholastic aptitude, academic achievement, and expected educational attain- 
ment related directly, positively, and significantly to the mother’s level of 
educational attainment? (4) Are the child’s variables affected by whether the 
mother works full-time, part-time, or not at all? Particular emphasis was 
given to the second question—that is, to the effect of maternal employment 
on the child—because this is an area of especial current concern and one 
where past research has yielded conflicting results, Ideally, such a study of 
maternal employment effects should employ controls on father’s education 
and employment; mother’s working career history; and number, ages, and sex 
of children in the family. The nature of the available data made this impossible. 


B. Dara anp Metuop 


In 1964, the Princeton, New Jersey, Borough and Township School Boards 
conducted an exhaustive investigation of the characteristics of the students and 
parents in the two school systems in preparation for a possible future consolida- 


tion and reorganization of the two systems. The School Boards’ research group 
organized the data in forms suitable for i 


on computer tape at the Princeton Unive: 
was made available for the present inqui 
performance, 


ts research purposes and stored it 
tsity Computer Center. This tape 
ty into maternal influences on child 


Sources for the taped data used in this tesearch were (a) school records 
which supplied student test Scores and (b) parent questionnaires which 
supplied the information on mothers’ educational attainment and mothers’ 
labor force participation. Information on maternal and child variables was 
n of a total school population of 4800 children. The 
ed by the School Boards to be randomly distributed. 
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of the students. Hence the analysis in the present research has been stated in 
terms of students whose mothers have a particular educational or employment 
characteristic. Limiting features were the lack of family identification on the 
tape and the failure to identify students by sex. The School Boards’ research 
group determined the choice of grade levels for which test scores were 
recorded, and decided how the test scores for different JQ tests in the two 
systems were to be combined. 

The data on the children’s ability and achievement consisted of scores at 
selected grade levels on JQ, achievement, and aptitude tests. Two sets of JQ 
test scores were available: (a2) Henman-Nelson and Kuhlman-Anderson 
(HN-KA) 9th grade scores for grades 9-12 and (b) Henman-Nelson and 
Otis (HN-OT) grade 6 and grade 11 scores. 

Four sets of achievement test scores were complied: (a) Iowa-Composite 
test scores for grades 3-8, (4) lowa-Mathematics test scores for grades 3-8, 
(c) College Entrance Board Mathematics test scores for grades 11 and 12, 
and (d) College Entrance Board English test scores for grades 11 and 12. 

The tape provided results for three sets of aptitude tests: (a) the College 
Entrance Board Scholastic Aptitude Test-Verbal SAT-V), (b) the College 
Entrance Board Scholastic Aptitude Test-Mathematics (CEB-M), and (c) 
the Differential Aptitude Test (DAT-8). For all of the tests, mean scores 
were compared for students grouped by seven maternal educational levels, 
ranging from eighth grade or less to postcollege education, and by three 
maternal labor force participation categories: full-time, part-time, and non- 

_working. In addition to the test data the School Board’s study group compiled 
information on the expectations and aspirations of parents as to the ievel of 
education the student would ultimately attain and on the teachers’ expecta- 
tions for the student’s future educational attainments. 

Although the Princeton public school community is not a representative 
sample of the total United States population, it did have students distributed 
over all the maternal educational and employment categories. Of particular 
interest were the large number of students in maternal educational levels 6 
and 7, who provided an excellent opportunity to study the relationship between 

-the labor force participation of highly educated mothers and the ability and 
achievement of their children. Table 1 shows the distribution of students 
within the maternal educational attainment and maternal labor force participa- 
tion categories. Almost half of the children in this study had mothers who 
were college graduates, and about two-fifths of these mothers had graduate 
training. Another fifth of the students had mothers who had attended college 


but had left before graduation. 
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C. ReEsuuts 


Analysis of the student test scores for all of the tests studied showed a direct, 
positive, and statistically significant relationship between the educational 
attainment of the mother and the child’s JQ, scholastic aptitude, and academic 
achievement. The breakdowns in Table 2 of the Henman-Nelson and Kuhlman- 
Anderson JQ test scores and of the Iowa-Composite achievement test scores 
provide examples of the type of result found. In all of the tests studied the 
row F is significant at the .01 level. In all of the tests, the mean scores by 
maternal educational levels fall into four groups within which the mean scores 
do not differ widely, but between which there are distinct jumps in scores, 
These groups are first, levels 1 and 2; second, levels 3 and 4; third, level 5; 
and fourth, levels 6 and 7. 

Maternal educational attainment was directly and significantly related to 
the parents’ expectations and aspirations and to the teachers’ expectations for 
the children’s attainment. Parents’ expectations were a level higher than the 
teachers’ in every instance. And the parents’ aspirations are uniformly higher 
than their expectations. 

The relationship between maternal labor force participation and student 
test. scores was not comparably clear and direct. Table 3 summarizes the 
results of the comparisons of mean student test scores by maternal labor 
force participation category at each maternal educational level. Four-fifths of 
the comparisons showed no significant differences in mean test scores, according 
to whether, at any given maternal educational level, the student’s mother 
worked on not, In the remaining fifth of the comparisons, where significant 
differences did occur, these differences overwhelmingly favored the students 
whose mothers did not work over students whose mothers worked full- or part- 
time. In about one-fifth of the significant comparisons, the students whose 
mothers worked part-time outperformed the students whose mothers worked 
full-time. In only one instance, level 1, of the Henman-Nelson and Kuhlman 
Anderson IQ test, did students whose mothers worked full-time score signifi- 
cantly better than the nonworking group. 

The cases where the students with nonworking mothers outscored those 
with working mothers were heavily concentrated in maternal educational level 
3 whete the mothers were high school graduates. This level had half the 
total of such significant comparisons. At this level the students with non- 
working mothers scored significantly better than students whose mothers 
worked full-time in seven of the nine comparisons made; and did equally well 
in outscoring the students whose mothers worked part-time. The level with 
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the second largest concentration of significant comparisons between the 
maternal labor force participation groups was the seventh level of students 
whose mothers had done graduate work or had had professional training after 
college. Here, the students with mothers working part-time significantly out- 
scored the students with full-time working mothers in six of the nine com- 
parisons, Students with nonworking mothers, at this level, outscored the 
full-time group only three times and did not outperform the part-time category 
at all. 

Only two of the student tests showed significant overall differences between 
the mean student scores by labor force participation categories. These were 
two elementary school achievement tests, the Iowa-Composite and the lowa- 
Mathematics. They showed a significant negative correlation between maternal 
working and student achievement, except at the two highest maternal educa- 
tional levels where there was no correlation. Interestingly enough, for high 
school students high scores in all the tests analyzed were associated with 
having a mother with a college education or more, who worked part-time. 

Unlike most of the student /Q, aptitude, and achievement tests, the ex- 
pectations of parents and of teachers and the parents’ aspirations for the 
child’s future education all showed significant differences between the three 
maternal employment categories. Maternal employment was typically associated 
with a lowered level of expectation and aspiration. Teacher expectations for 
the students were substantially lower at every level than either parent expecta- 
tions or parent aspirations. Parent aspirations exceeded parent expectations 
by a lesser margin. 


D. Discussion 


The conclusion that, in the Princeton public school community, mother’s 
educational attainment and the child’s JQ, aptitude, and achievement were 
directly and significantly related was not a surprising one in view of the 
results of previous research. Nevertheless, the high level of confidence with 
which this conclusion could be stated for all the tests analyzed was impressive. 
Further research is needed to establish how independently potent a factor 
mother’s education was and to what extent it served as a proxy for such other 
factors as father’s education, family income, family socioeconomic status, and 
heredity. It would be useful to investigate further the tendency for the 
students’ scores to fall into four groups of maternal educational levels and to 
ask why these groups divided as they did and why parent and teacher expecta- 
tions did not fall into similar groups. 


Nor was it surprising to find that maternal education was directly related to 
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parent and teacher expections for the child. As Banducci (1) and Sewell and 
Shah (13) have pointed out, the child’s expectations and aspirations are 
higher for higher socioeconomic backgrounds and for children of higher intelli- 
gence. Both of these factors vary directly with maternal education. Further- 
more, it seems reasonable to expect the parents’ expectations to coincide with 
the child’s. The Princeton teachers, however, appeared pessimistic in expecting 
less of the children than did the parents, although the patterns of their expecta- 
tions was similar, 

This study found that maternal labor force participation was not a major 
influence affecting children’s JQ, aptitude, and academic achievement. This 
finding reinforces the conclusions of Nye (10, 11), Hitchcock (3), Siegel and 
Haas (15), Frankel (2), Banducci (1), and Nelson (8). The exceptions 
to this observation were apparently related either to the age of the student, 
or to the child variable tested, or to the educational attainment of the mother. 
As far as age of student was concerned, this study observed that elementary 
school students performed significantly better on academic achievement tests 
when their mothers did not work. On the other hand, high school students 
were usually unaffected by maternal employment, and occasionally they per- 
formed better when their mothers worked part-time. This evidence supports 
the observations of Myrdal and Klein (7), Nolan and Tuttle (9), and 
Stolz (16) that the effects of maternal employment may vary with the age 
of the student. It also suggests that, for younger students, academic achieve- 
ment may be more affected by mothers’ working than performance on JQ or 
aptitude tests. 

The level of maternal educational attainment showed some influence on the 
relationship between maternal employment and child ability and performance. 
Mothers who were college graduates, or who had pursued graduate study or 
professional training after college, typically worked without affecting their 
children’s test results adversely, while in some instances, particularly for high 
school students, part-time working by these mothers was associated with 
better child performance. But if mothers were high school graduates, their 
children more often than not did better on the tests if the mothers were not 
working. This result may reflect lower family economic resources and less ade- 
quate child care for the children of the working high school graduate mother. 
For the highly educated woman, family economic resources were presumably 
larger, and working may have presented fewer child care problems. Employ- 
ment may have provided the educated mother with needed stimulation which 
was reflected in the child’s attitude and performance, especially at the high 


school level. 
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The finding that, in general, parent and teacher expectations for the child’s 
future education were lower when the mother worked is puzzling. Banducci 
(1) observed that the children of working mothers, except in professional 
families, tended to expect and aspire to more education than the children of 
nonworking mothers. This is true of parents’ and teachers’ expectations in 
the Princeton study in a few instances at the lowest maternal educational 
levels. For most of the Princeton school group, the Banducci observation that 
the children in professional families, particularly boys, aspire to and expect 
more education if their mothers do not work was a more accurate description 
of parent and teacher expectations for their students. Perhaps the Princeton 
mothers worked primarily because of family economic need, and this need 
lowered their expectations for their children’s future education. This relation- 
ship between maternal employment and expectations prevailed, however, for 
the college graduate maternal educational levels where, presumably, families 
were at higher income levels and where educated mothers might have been 
expected to work because they wanted to, not because work was an economic 
necessity. 

On the whole, the Princeton study discovered that maternal employment 
was not inevitably, nor even typically, associated with lowered student 
academic ability and performance. It did find that the age of the child, the 
type of academic ability being measured, and, more importantly, the educa- 
tional attainment of the mother appeared to be factors affecting the relationship 
between maternal labor force participation and child performance. 


E. Summary 


‘This study of more than 3500 children in the Princeton, New Jersey, public 
school system investigated how mother’s working and education affected the 
child’s [Q, scholastic aptitude, and academic achievement, as well as the effect 
of these maternal variables on parental and teacher expectations for the child’s 
future education, Results showed a strong positive correlation between the 
mother’s educational attainment and the child variables. The effect on the 
student variables of mother’s working full-time, part-time, or not at all was 
analyzed at each of seven maternal educational levels, 
not found to be significantly related to child ability an 
fifths of the comparisons made, Working was, however typically associated 
with significantly lowered parental and teach : 


ih st i er expectations and parental 

aspirations for the child. The instances of significant negative relationships 
, i 2 

between mother’s working and child scores were concentrated in three cate- 


gories: children whose mothers had completed but not gone beyond high school, 
children in elementary school, and child scores in academic achievement tests. 


Mother’s working was 
id performance in four- 


at's eg 
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A NOTE ON SEX DIFFERENCES IN ADOLESCENTS’ 
DIVERGENT THINKING* 


Rider College 


HeEten Ontvet 


A. Intropuction 


Guilford (2) reports that females score significantly higher than males on 
three of the isolated factors of divergent production: divergent symbolic units, 
divergent semantic units, and divergent semantic systems (Table 16.1, 
p. 404). Since females have often been reported to be more fluent than males, 
the emergence of their superiority on these three fluency factors contained ap- 
Parent support for prior evidence, Guilford also Proposes that many of the 
structure-of-intellect tests “be given to the two sexes in order to make more 
substantial tests of the generalizations that have been suggested and possibly 
to find others” (p. 406). 


B. Merxop 


The present investigation utilized a sample of 434 high school students 
(M = 197, F = 237) with a broad representation of both social class and 
general intelligence (5, 6). Grade representations were as follows: 9th—50; 
10th—147 ; 11th—133; and 12th—104, All students were administered seven 
of Guilford’s verbal tests of divergent production: Word Fluency—Divergent 
Symbolic Units; Expressional Fluency—Divergent Semantic Systems; As- 
sociational Fluency—Divergent Semantic Relations; Ideational Fluency— 
Divergent Semantic Units; Alternate Uses—Divergent Semantic Classes ; 
Consequences-Obvious—Divergent Semantic Units; and Consequences-Re- 
mote—Divergent Semantic Transformations, The factor loadings for the 
structure-of-intellect model reported for each test are derived from Guilford 
and Hoepfner (3). A test of general intelligence, the Otis Quick Scoring 

» Mental Abilities Test, New Edition, 1954, had already been given to all 
the students as part of the school’s group testing program. 
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C. RgEsuLTS 


Tests for the means of males vs. females for all of the divergent thinking 
tests and for JQ are reported in Table 1. Girls did significantly better than 
boys at the .01 level of statistical significance on all four of the fluency tests— 
Word, Expressional, Associational, and Ideational—and also on the Con- 
sequences-Obvious test. There were no significant differences between males 
and females on general intelligence as measured by the Otis 7Q, nor on the 
remaining two tests (see Table 1). 


TABLE 1 
Means, STANDARD DEVIATIONS, AND SIGNIFICANCE OF Sex DIFFERENCES FOR 
Tests oF DiveRGENT THINKING AND 1Q 
(M = 197, F = 237) 


Guilford 
factor Mean SD 
Test symbol* M F M F t 
Word Fluency DSU 38.25 42.88 12.60 9.79 4.30" 
Expressional Fluency DMS 481 5,87 3.22 3.06 5,579% 
Associational Fluency DMR 13.40 15.22 5.61 5.59 3.370" 
Ideational Fluency DMU 44.77 52.05 12.64 12.15 6.07%* 
Alternate Uses _ DMC 18.18 19.49 7.74 7.66 1.78 
Consequences-Obvious DMU 39.76 46.11 15.26 16.44 4,09" 
Consequences-Remote DMT 9.03 9.60 6.20 6.10 97 
1Q—Otis 103.17 105.46 12.65 12.71 00 
« Explanation of symbols: DSU = Divergent Symbolic Units; DMS = Divergent 
Semantic Systems; DMR = Divergent Seuantic Relations; DMU = pivereant 


Serna rile Oe eG = Divergent Semantic Classes; DMT = Divergent Semantic 


p< 01. 


D. Discussion 


Thus, Guilford’s (2) results were confirmed for sex differences on three 
of his extracted factors: Divergent Symbolic Units—Word Fluency; Diver- 
gent Semantic Units—Ideational Fluency; and Divergent Semantic Systems 
—Expressional Fluency. Moreover, the present study reports female superi- 
ority on two additional tests not reported by Guilford previously: Divergent 
Semantic Relations—Associational Fluency; and Divergent Semantic Units 
—Consequences-Obvious. Since Consequences-Obvious shares a loading on 
Divergent Semantic Units with Ideational Fluency, we have further confir- 
mation of female superiority on this factor. The additional female edge on 
Divergent Semantic Relations, a new female superior factor, may be related 
to the feminine ability to excel at Convergent Semantic Relations; which has 
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been reported by Guilford on such tests as Opposites and Verbal Analogies 
(2). Since the sample in the Present study was quite large and representative 
of a wide distribution of both social class and general intelligence (5, 6), 
the reported female superiority on five of the seven tests of divergent thinking 
(representing four divergent factors) may plausibly be generalized to other 
adolescent groups of similar social class and I Q distributions. 

These results, seen together with Guilford’s (2) data, begin to present a 
stronger picture of female superiority on a number of the verbal divergent 
production factors. One is left puzzled, however, by the characterization of 
females as capable of divergent thought on the one hand and conversely, as 
rather “mediocre” in their creative efforts thus far, at least as these are 
evaluated via productivity and eminence (4). The temptation to explain the 
discrepancy in terms of dependent female role behavior (1) becomes too 
enticing to dismiss entirely. Nevertheless, further studies of sex differences 
in divergent thinking, especially with figural and behavioral content, should 
be undertaken before we leap to premature environmental and/or hereditary 
explanations. 


E. Summary 


An exploration of sex differences in yerbal divergent production in a sample 
of 9th through 12th grade adolescents (M = 197, F = 237) is reported. 
Seven of Guilford’s factor-analyzed verbal tests of divergent thinking were 
used. Girls were demonstrated to be superior to boys on five of the seven 
measures of divergent thinking at the .01 level of statistical significance, 
although there were no significant differences between these samples of males 
and females in general intelligence as measured by the Otis J Q. The tests 
revealing female superiority, along with their Guilford factors, were as 
follows: (a2) Word Fluency—Divergent Symbolic Units; (b) Expressional 
Fluency—Divergent Semantic Systems; (c) Associational Fluency—Diver- 
gent Semantic Relations; (d) Ideational Fluency—Divergent Semantic 
Units; and (e) Consequences-Obvious—Divergent Semantic Units, 
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AN EXPERIMENTAL INVESTIGATION 
OF HITCHHIKING®** 


Department of Psychology, Southern Methodist University 


Perer Crassweiier, Mary Aicr Gorvon, anp W. H. Txprorp, Jr. 


A. Inrropuction 


There are many areas of social concern in which theorizing has been 
done primarily on the basis of personal experience rather than systematic 
investigation, Hitchhiking is such an area. The literature reveals a number 
of variables that individual authors have felt to be important. For example, 
it has been claimed that female hitchhikers receive rides more quickly than 
males (2). Other strategies alleged to produce better results include the 
following: hitchhiking during the day rather than at night (2, 3, 8) ; carrying 
as little baggage as possible (3, 8, 10) ; using a sign to indicate one's destina- 
tion (3) ; and traveling alone (3). The only attempt at experimental investiga- 
tion (1) supports the values of being female, traveling alone, and wearing 
traditional clothing. 

The present study was undertaken primarily to verify several hypotheses 
about hitchhiking, but also to demonstrate that strict experimental procedures 
could be maintained in a fluid social situation. The variables selected for in- 
vestigation were (a) clothing of the hitchhiker, (4) city he attempted to 
hitchhike from, and (¢) socioeconomic section of the city. Drivers who offered 
rides were analyzed according to liberalism, age, sex, number of passengers, 
and reason for offering the ride. 


B, Metuop . 


A factorial analysis of hitchhiking was undertaken in Dallas, Texas, and 
Stamford, Connecticut. The lower, middle, and upper socioeconomic sections 
were defined by use of the Dallas Population Handbook: 1960 (6) for Dallas 
and the U. 8. Census of Population and Housing: 1960 (9) for Stamford as 
guides. Socioeconomic sections were selected by obtaining the median family 
income for each census tract. Some census tracts constellated together into 
certain economic ranges, and these constellations formed the basis for desig- 
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nating socioeconomic sections. In Dallas, where the median family income is 
$5976, the following ranges were selected to represent the lower, middle, and 
upper socioeconomic classes, respectively : $2631-3938 ; $4476-7646 ; $10,526- 
25,000. In Stamford, where the median family income is $7728, the cor- 
responding ranges were $5283-5953 ; $7651-8196; $14,300-17,560. 

The same hitchhiker was used for all conditions. Two types of clothing 
were worn. The hip hitchhiker’s clothing consisted of bell-bottoms, a peace 
shirt, beads, and a bandana around the forehead to keep shoulder-length hair 
in place. The hip hitchhiker wore no shoes and socks, and had a cigarette. The 
well dressed hitchhiker’s clothing consisted of socks, shoes, pressed slacks, 
shirt, tie, and jacket. The hair looked short and well groomed. (Long hair 
became short hair by pulling back the hair behind the ears, making a pony 
tail, and tucking the pony tail inside the shirt.) 

Only one lane streets were chosen. Only pleasant days were chosen to con- 
trol for weather; to control for the effects of rush-hour and week-end traffic, 
hitchhiking was scheduled Monday through Friday from 9:00 to 12:00 a. m. 

The hitchhiker went to a particular socioeconomic section under one 
clothing condition and counted the number of cars (excluding taxicabs and 
police cars) that went by before someone offered him a ride. This procedure 
was repeated eight times within each clothing condition, socioeconomic condi- 
tion, and city condition. When a car stopped, the hitchhiker stepped into the 
car and verbally administered McClosky’s conservatism-liberalism scale (5). 
Additionally, the following information was recorded: age and sex of driver, 
number of passengers, and the reason why the driver offered a ride to the 
hitchhiker. The purpose of the experiment was explained to the driver. The 
hitchhiker then left the car and returned to his starting point. In each city 
and Socioeconomic section, 100 passing cars were observed to get the expected 
frequencies of the sex of the driver and the number of passengers, if any. 

Data wére collected from Dallas in April, 1971, and from Stamford in 
May, 1971; the total sample consisted of 96 observations, 


C. Resuxts anp Discussion 


A three factor, independent measures design was calculated to test for the 
effects of different clothing, socioeconomic sections, and cities. The dependent 
variable was the number of cars that went by before one stopped. The average 
values for this variable are presented in Table 1. 

There was a significant difference between styles of clothing (F = 8.92, 
df = 1/84, » < 01). The well dressed hitchhiker received Siadithiait 
more quickly than the hip hitchhiker. This result is consistent with the finding 
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TABLE 1 
Mean Numoer or Cars Passing WELL Dressep AND Hip HITcHuikers In Datias 
AND STAMFORD BY SOCIOECONOMIC Section oF City 


Well Dressed Hitchhiker Hip Hitchhiker 
City Lower Middle Upper Lower Middle Upper 
Dallas 45.3 25.6 28.6 84.6 51.8 19.2 
Stamford 35.6 23.2 6.2 68.6 45.0 6.3 


that drivers are more amenable toward offering a ride to the well dressed 
hitchhiker (1), 

There was a significant difference between socioeconomic sections (Fs 
16.38, df = 2/84, p < 01). The hitchhiker received rides significantly more 
quickly when he was in the upper socioeconomic section, followed by the 
middle and the lower socioeconomic sections. One possible reason for having 
to wait the longest when hitchhiking in the lower socioeconomic section is 
that the hitchhiker was white. In Dallas, approximately 85% of the drivers 
in this section were black, whereas only 70% of those who stopped to offer 
rides were black. In Stamford, the comparable figures were 60% and 20%. 

There was a significant interaction between different styles of clothing and 
different socioeconomic sections (F = 3.80, df = 2/84, ’ < .05). In the 
lower and middle socioeconomic sections of Dallas and Stamford the hitch- 
hiker waited longest for a ride when he was dressed hip. In the upper socio- 
economic section of Dallas, the hitchhiker waited longest for a ride when he 
was well dressed. Perhaps this result is due to a larger number of hippies and 
activists coming from upper socioeconomic sections (4). Over all conditions, 
there was a tendency for the hitchhiker in Stamford to be offered a ride more 
quickly than the hitchhiker in Dallas (F = 3.53, df = 1/84, p< 10). 

By means of a chi-square goodness of fit, with Yates’ correction formula, 
results from the conseryatism-liberalism questions were compared to a na- 
tional distribution of replies for the first six items of McClosky’s scale (7), 
These drivers gave answers to each item that were significantly more liberal 
than those of the average person over all (,? values, all with 1 df, were 61.3, 

"P< 001; 19.3, p < .001; 7.3, p< 01; 23.0, p < .001; 14.3, p < .001; 
17.6, 6 < 001). Thus, the authors infer that drivers who offer rides to the 
hitchhiker will be liberal, 

A three factor, independent measures design was calculated to find if there 
were any significant differences in the degree of liberalism among drivers who 
offered a ride to the hitchhiker in tegard to different styles of clothing, socio- 
economic sections, and cities. There was a significant difference between styles 
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of clothing (F = 6.02, df = 1/84, * < .05). Although the drivers were 
more liberal than the average person over all, the driver who offered a ride 
to the hip hitchhiker was significantly more likely to have a greater degree of 
liberalism than the driver who offered a ride to the well dressed hitchhiker. 

There was a significant interaction in the degree of liberalism between dif- 
ferent cities and different socioeconomic sections (F = 4.65, df = 2/84, 
» < .05). In Dallas the most liberal drivers came from the middle socio- 
economic section, while in Stamford the most liberal drivers came from the 
lower socioeconomic section. 

A three factor, independent measures design was calculated to test for dif- 
ferences among the ages of drivers who offered a ride to the hitchhiker in 
regard to different styles of clothing, socioeconomic sections, and cities. There 
was a significant difference among the ages of drivers in regard to different 
styles of clothing (F = 6.85, df = 1/84, p < .05). Drivers who offered a 
ride to the hip hitchhiker were younger than drivers offering a ride to the 
well dressed hitchhiker. Sixty-six per cent of the drivers who offered a ride to 
the hitchhiker were 29 years old or younger. 

A chi-square goodness of fit, with Yates’ correction formula, was calculated 
to find if there was a significant difference between the sexes of the drivers 
who offered a ride to the hitchhiker. There was a significantly greater prob- 
ability of male drivers offering a ride to the hitchhiker (x = 65.03, df = 1, 
p < .001). This might have been influenced by the sex of the hitchhiker. 

A chi-square goodness of fit, with Yates’ correction formula, was calculated 
to find if there was a significant difference due to the number of passengers 
with the driver when the driver offered a ride to the hitchhiker. A significant 
number of drivers had no other passengers in the car (G2 = 13.45, df = 1, 
p< 001). Perhaps the major reason for this occurrence is that the driver 
had more time to think about the plight of the hitchhiker and how it feels 
to be hitchhiking. This hypothesis is consistent with the most frequent (34%) 
reason cited for the driver’s offering a ride to the hitchhiker: “I have hitch- 
hiked before and know how it feels.” The second most frequent reply was the 
way a person looked (22%). For example, one reply was, “You were dressed 
nicely . . . I don’t pick up any kids who are shabbily dressed.” The third 
most frequent reply was that the driver had a habit of picking up hitch- 


hikers (11%). 
D. Summary 


; A parametric analysis of hitchhiking was undertaken to assess the effects of 
differences in styles of clothing, socioeconomic sections, and cities. Char- 
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acteristics of drivers offering rides to the hitchhiker were explored by exam- 
ining the following variables: liberalism, age, sex, number of Passengers, and 
the reason why drivers offer a ride to the hitchhiker. The same person did 
the hitchhiking for all conditions. Results indicate that a hitchhiker will 
receive a significantly quicker ride if he is well dressed (6 < .01) and if he 
hitchhikes in the upper socioeconomic section of the city (p < 01). Drivers 
who offer a ride to the hitchhiker are more liberal than the average person 
over all (p < .01), The hitchhiker who is not well dressed is more likely 
to receive rides from a liberal driver (¢ < .05) than is the well dressed hitch- 
hiker. The hitchhiker will receive a quicker ride if the driver is alone (p< 
-001) and if the driver is a male (p < .001). The major reason cited for 
the driver’s offering a ride to the hitchhiker was that the driver had hitch- 
hiked before and knew how it felt (34%). 
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A. Intropuction 

In 1960 the first named author began a study (2) the stated purpose of 
which was to determine the ability of some persons over the age of 65 to 
learn oil painting and, further, to note whether any changes in other activities 
and interests would appear to be related to the engagement in painting, Al- 
though the study was conducted under the announced purpose, the present 
writers had in mind a longer range goal: the possibility of learning at later 
points in time whether the older persons who were the experimental sub- 
jects might be found to have faired any differently in certain important 
respects from their carefully matched counterparts in a comparison group. 
This article will report the findings of the follow-up investigation with only 
as much reference to the original study as is needed for clarity of background. 

The writers began the original study with little doubt about the ability of 
older persons to gain proficiency in such a learning experience as oil painting. 
Earlier studies (e.g., 3, 5, 9) had shown that older people “can learn quickly, 
thoroughly, and well those things they want to learn if the learning situations 
are not inconsistent with physical, mental, and emotional capacities” (9, 
p. 117). The principal question was, therefore, whether any significant dif- 
ference would be found between the experimental subjects and the older com- 
parison subjects in the kinds and number of interests and activities in which 
they engaged during the course of the experiment, For example, would the 
experimental subjects engage more frequently in various planned events (con- 
certs, literary presentations, and the like) of the apartment complex in which 
certain of them lived? Would there be a difference in the frequency and kinds 
of informal visitation, or the arranging of parties, or attendance at events 
(theatre, for example) ? 


B. Tue Om Parntinc EXPERIENCE 


Oil painting was selected for the experimental learning task because it 
Provided opportunity for engagement in potentially creative activity and be- 
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cause it was not beyond the physical ability of most persons whose age was 
over 65. Arrangements were made to have the instruction given by two mem- 
bers of the faculty of the art department of the University of Omaha. Classes 
were held in the regular classrooms of the art department of the University. 
All classes met once per week. The period of instruction covered 18 weeks 
beginning the first week in February, 1960, 


1. Selection of Subjects 


Actually, there were three groups of subjects in the study. One was the 
experimental group. The other two served as comparison (control) groups. 

a. The experimental subjects and senior controls. Men and women over 
the age of 65 who had no previous instruction in oil painting were invited 
to participate in the study. The invitation was presented at a mecting of the 
Omaha Retired Teachers Association. It was also presented to administrators 
of retirement homes. As a result of the invitation 58 older persons (54 
women and four men) volunteered for enrollment in the study. Thirty-five 
women and two men were included in the experimental group. Nineteen 
women and two men who volunteered for the study constituted the senior 
control group. 

One fact about the senior control group imposes a shortcoming on the 
design of the study. It is that the members of this group cannot be said to 
have demonstrated the same level of motivation to enroll in the art class 
as the members of the experimental group. Their reasons for choosing not to 
enroll were numerous; they included, for example, the conflict in time be- 
tween the art classes and certain other planned activities. The reasons did 
not, however, include health factors. 

For the purpose of securing as much evidence of comparability between 
the two senior groups as possible, all persons in both groups were given four 
different art-related measurements. These were administered before the final 
selection for the experimental group. The art-related measurements that were 
given before the experiment began included (a) the Meier Art Tests: I—Art 
Judgment (7); (b) the Graves Design Judgment Test (4); (c) two parts 
of the Allport, Vernon, and Lindzey Study of Values (1); and (d) a ques- 
tee : heen anaes and activities. The Allport, Vernon, and 

questionnaire were repeated with both senior groups 
at the close of the experiment. 

Analysis of variance was employed to test the significance of the differ- 
ences between the two senior groups on the scores for the Meier Art Test 
and the Graves Design Judgment Test. The differences were not statistically 
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significant. The median test described by Siegel (8, pp. 111-115) was used to 
determine whether significant differences existed between the scores of the 
two groups on the aesthetic and social values scales of the Allport, Vernon, 
and Lindzey instrument. The differences were not statistically significant. 
The sign test (8, pp. 68-75) was employed for the same purpose to compare 
the two scales before and after the experiment. No statistically significant dif- 
ferences were evident for the pre-experimental measurements. 

b. Age of the senior subjects. The median age of the subjects was the 
same for both groups; it was 70. The range of age in the experimental group 
was 65 to 89; in the control group it was 65 to 80. 

c. Student control group. Because one of the principal questions was to 
determine the calibre of the older persons’ learning of oil painting, this ques- 
tion led to the inclusion of two classes of undergraduate students at the 
University who had enrolled in oil painting as part of their program of 
courses. There was a total of 33 students in the two classes. Their instructors, 
the place and kind of instruction, and the length of it were the same as were 
provided for the older persons. All the measurements reported above for the 
senior groups were given at the beginning of the term to the student classes. 
The questionnaire was not given to the students. Tests of significance of 
difference between the scores of the student group and the groups of older 
persons, prior to the experiment, showed no differences to be statistically 
significant. 


2. Results of the Original Study 


Six main findings were extracted from the data of the original study. 
Briefly summarized they are as follows: 

1. In comparison with the student controls, the older subjects earned a 
higher average score—not statistically significant—on midterm ratings of 
performance. At the close of the instruction their average rating was lower 
(statistically significant difference) than the average for the student group. 
However, the two most highly rated paintings were done by members of the 
older group. The judges (experienced, widely known, art critics) reported 
that the two paintings would have competed very favorably in a regional art 
exhibit. 

2. The mean score of the experimental group on the Meier art judgment 
test at the end of the term was significantly higher (statistically so) than at 
the beginning of the experiment. The same was true for the student group. 
There was no significant difference between the two groups. The Graves test 
was not given to the college students at the end of the semester. 
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3. The experimental group significantly exceeded the student control group 
in the measures of satisfaction derived from the study of art and in subjective 
ratings of their accomplishments. Twenty-three of the 30 experimental sub- 
jects who completed the instruction reported more satisfaction than they had 
expected, and 19 stated that they accomplished more than they had expected. 
Of the 33 college students, 15 reported more satisfaction than expected; 10 
stated they accomplished more than expected. 

4. At the close of the instruction the experimental group showed a greater 
increase than the senior control group in activities that gave them satisfaction 
although, with the exception of one activity, the differences were not statis- 
tically significant. The exception: the experimental subjects were markedly 
more inclined to attend art exhibits. 

5. There was at the close of the term no significant difference between the 
experimental subjects and the senior controls in their involvement in social 
activities. 

6. At the end of the experiment all but one of the experimental subjects 
planned to continue the study of art, and all recommended such a study for 
persons of their age. These older persons’ claims of satisfaction were con- 
firmed when, at a later date, they planned and personally conducted an 
exhibit in which two paintings of each of the 30 subjects were shown; more 
than 300 persons attended the exhibit. Further confirmation of their interest 
and satisfaction was given in the fact that the older persons arranged, on 
their own, for continuance of the course in oil painting. When the findings 
of the original study were being written, 15 of the experimental subjects had 
enrolled for an eight week course that would start in September, 1960. 


Twenty had asked to be enrolled in a similar term of instruction beginning 
early in 1961. 


3. Physical Fitness 


It was not part of the original study to attempt any assessment of dif- 
ferences in change in the health and physical condition of the experimental 
subjects vis 4 vis the senior control subjects. The relatively limited time 
during which the experiment would run seemed to make such an objective 
less than likely to produce meaningful results. Addition of this objective 
would have called for professional services or the sharing of information that 
would be difficult to procure. Only some quite subjective observations were 
made (and carefully recorded) without being entered into the final report: 
Such data proved to be of value, however, in the follow-up study. Two 
summary statements related to physical condition must suffice at this point. 


ee 
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Only three of the original 37 experimental subjects had to withdraw from 
the study because of poor health, Four more withdrew from the study for 
such reasons as the severity of the weather in getting to and from classes, and 
the care required by a sick grandchild. 


C. Finpincs rrom THE Fottow-Up Srupigs 


The material in this part of the report will pertain to the older subjects 
of the study; it does not involve the college student groups. More specifically, 
the findings pertain to the 30 older experimental subjects who completed the 
art instruction of the original study plus the 21 senior control subjects. 
Follow-up investigations were conducted in 1962, in 1965, and in 1971. 

The principal foci of study at the three different time periods were (a) 
number of persons living; (b) health status; (c) continuance of involvement 
in painting and involvement in art exhibits; (d) extent of social activity ; and 
(e) involvement in creative activity other than painting. 


1. Number of Persons Living and Their Age 


The data of Table 1 invite special attention. Twenty-four months after 
the completion of the experiment there were two deaths in the combined 
study group—the experimental and senior control subjects. One member of 
the experimental group had died and one of the controls. But three years 
later the picture changed considerably, There had been no additional deaths 
in the experimental group; in the control group six persons, or 29%, had 

» died. By 1971, 67% of the experimental subjects were living and 38% of the 
* control subjects were known to be living. Three control subjects could not 
be located although much effort was expended to find them, 


TABLE 1 
Comparison or DEATH RATES AT THREE FoLLow-UP INTERVALS 
FOR SUBJECTS OF THE STUDY 


1962 1965 1971 
BS Subjects N % N %o N % 

Experimentals 
*_ known living 29 97 29 97 20 67 
Experimentals 

known deceased 1 3 1 3 10 33 
Controls known 

living 19 90 13 62 8 38 
Controls known 
& deceased 1 5 6 29 10 48 

ontrols not 

located 1 5 2 10 3 14 
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The median age of the experimental subjects who were living in 1971 was 
79.5. For the control subjects it was $2. The ranges in years for the two 
groups, respectively, were 76 to 87 and 78 to 89. 


2. Health Status 


The information pertaining to the health of the subjects at the several 
follow-up dates has, of necessity, a degree of subjectivity about it. But sub- 
jective as the information was, the judgments were no more subjective in the 
case of one group than the other. The comparisons are presented in Table 2. 


TABLE 2 
HEALTH STATUS OF ELDERLY SUBJECTS 


1962 1965 1971 

Exper. Control Exper. Control Exper. Control 

(N=29) (N=17) (N=29) (N=13) (N=20) (N=8) 

Health status N % N % N % N % N % N % 
a a 
a a 
a a 


Excellent by) a0. $ 4 2 0 0 

Good 16 55 6 46 9 45 1 12 

Fair 6 20 3 23 7 35 5 62 

Poor bed 4 1 8 0 0 2 2 
1 5 


Ld 1 
Mentally alert 29 100 17 100 28 96 1 84 20 100 
Not restricted 


to bed 29 100 16 9% 29 100 13 100 20 100 5 6 
® No report for 1962. 


3. Continued Involvement in Painting 


Of the 29 experimental subjects who were living in 1962, 21 (72%) were 
still involved in painting. Seventeen of them (60%) were enrolled in instruc- 
tional classes. By 1965 the numbers had dropped to 17 (60%) who were still 
involved in painting and four (14%) who were enrolled in instruction. 

In 1971 there were 11 (55%) who were continuing their oil painting and 
three (17%) who were taking instruction. Of the nine persons who were no 
longer involved in painting in 1971, five discontinued because of poor health, 


one turned to color photography, one became “too busy with club activities,” 
and two no longer cared to paint. 


4. Participation in Art Exhibits 


_In 1962, 21 persons (72%) exhibited paintings in such places as a senior 
citizens center, a retirement home, several churches, the Art Department of 
the University of Omaha, and in a show sponsored by American Association 
/of University Women. The only subject who was deceased in 1962 had held 
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an exhibit of her paintings at her own home only a few weeks prior to her 
death at age 92. 

No systematic record could be kept of the paintings exhibited between 
1962 and 1965. In 1971 there were six individuals who were involved in 
exhibiting paintings. One of the six is currently (August 1971) in charge 
of an art exhibit. Two expected to exhibit paintings at some later date (later 
than the date of this writing). Four or five of the elderly persons had sold 
numerous paintings, 


5. Extent of Social Activities 


At the three follow-up dates, evaluations were made of the extent to which 
the members of the senior subject groups were involved in social (“social- 
izing”) activities. The evaluations were based on a scale devised by Robert 
J. Havighurst and titled Uses of Leisure Time (6). Table 3 presents data 
related to the social activities, 


TABLE 3 
NuMBER OF SENIOR SUBJECTS INVOLVED IN SOCIAL ACTIVITIES 
AT DIFFERENT FoLtow-Up Dates 


1962 1965 1971 


Exper. Control Exper. Control Exper. Control 
Extent of involvement (N=29) (N=17) (N =29) (N=13) (N= 20) (N=8) 


High 6 1 5 1 5 0 
Above average 9 5 8 4 5 0 
Medium 10 8 13 4 5 3 
Below average 4 2 2 3 5 2 
Low 0 1 1 1 0 3 


The most noteworthy statistic in Table 3 is the consistently greater per- 
centage of the experimental persons who rated “high” or “above average” in 
comparison with such ratings for the control persons. This advantage persists 
throughout the 11 years of the study. For the dates 1962, 1965, and 1971, the 
differences in percentages were 52% vs. 35%, 45% vs. 39%, and 50% vs. 
zero%. 


6. Participation in Creative Activity 


For the three follow-up periods it was possible to secure information about 
the subjects with regard to creative activities other than the involvement in 
painting. The data would help answer the question whether the experimental 
subjects tended to restrict themselves to painting-related activities. 

What kinds of activities were identified for the purposes of the study, as 
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“definitely creative’? Examples are the following: traveling with a camera 
to make slides for lectures, work in ceramics, giving assistance to the blind, 
teaching needlework, teaching French on a part-time basis, serving as trea- 
surer of the local Artists Association. 

Seventeen of the 29 experimental subjects for whom data were secured in 

1962 were engaged in these kinds of interests. Some of these 17 were also 
among the persons who were continuing at that time with their painting. Of 
the 19 controls who were known to be living in 1962, seven were engaged in 
creative activities on the order of the examples given. These figures mean that 
approximately 60% of the experimental persons were taking part in creative 
activities; 36% of the controls were thus engaged. Had the painting activity 
not been omitted from the summary for the experimental persons, the differ- 
ence between the two groups would have been still greater. Equally impressive 
as the experimental subjects’ 1962 record of creative activity is their record for 
1965. Although there were four fewer persons involved in the “definitely crea- 
tive activities,” the kinds of things these persons were doing seemed quite re- 
markable. Some examples were the following: director of two choirs and state 
director of a national organization (one person), president of a women’s club, 
regular guide for the Joslyn Art Museum art tours, active member of the 
local Great Writers group. In 1965 there were five of the control subjects 
who were involved in activities that rated “definitely creative.” 
: Perhaps the 1971 record of creative activity for the experimental subjects 
is even more noteworthy. As of that follow-up date there were 10 persons 
(50%) who were engaged in such activities as (a) local Artists Club, PEO 
Sisterhood, Eastern Star, Daughters of the Nile (quite active in each of 
these) ; (6) organization and direction of a schedule of art exhibits for 
American Association of University Women; (c) direction of a choir of 
persons over 65 years of age with occasional concerts given out of the city; 
(d ) presentation of numerous book reviews; and (e) photography—prepara- 
tion of slides made in every region of the United States. By contrast only two 
of the eight control subjects known to be living in 1971 were involved in any 
activity that qualified as “creative.” One of these held office in an organization, 
but was able to do very little of the work related to the office. 


D. Discussion 


Acceptance of the findings of the study at face value invites the conclusion 
that a set of conditions was generated by the engagement in painting that 


fostered several salutary results. The favorable results are indicated in the 
data that have been summarized. 
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There is a note of caution in the expression “‘at face value.” Much of 
credibility of the findings hinges on the assumption that the two groups of 
older persons were indeed quite comparable in the most important ways. 
Clearly, there would need to be relatively little imbalance in, say, health 
factors at the beginning of the experiment to void some of the principal 
findings. The authors can only reassert the fact that they tried to take every 
precaution to secure two groups that were closely similar, not only in motiva- 
tion related to enrollment in painting but also in physical fitness. If there were 
any disadvantage for either group, it might be the greater extension of age in 
the experimental group at the outset of the study. The oldest of the experi- 
mental subjects were 84 and 89, respectively, whereas the oldest control sub- 
ject was 80. The efforts to control for motivation and the assurance of “no 
difference” in the material of the questionnaire when initially given were 
described earlier in this report. 

The words “set of conditions” underscore the idea that learning to paint 
and even to continue to paint were not necessarily the main therapeutic factors 
that were at work with these elderly persons. As they engaged in painting, 
their associations with other persons took on added dimensions and new 
qualities, and helped to provide considerable validation of their own self- 
significance. In a variety of ways many of the experimental subjects expressed 
to different confidants this sort of assessment of their experience. 

Regardless of the limitations that are present in the study, there is, in any 
case, considerable evidence that a group of elderly persons found much satis- 
faction in a creative learning experience, the spread effect of which was to 
add greatly to the breadth and vitality of their interests and, apparently, to 
the number of their years. 


E. Summary 


The substance of this report evolved from a study initiated in 1960 and 
aimed at determining not only how well persons 65 years of age and older can 
learn oil painting but what other interests might emerge as the art experience 
progressed. Three groups of subjects were involved in the study. Thirty-seven 
men and women whose age was over 65 years comprised the experimental 
group; they engaged in oil painting. Nineteen men and women of similar age 
constituted a control group; they did not participate in oil painting. Two 
classes of college students totalling 33 in number served as a comparison group 
for judging the experimental subjects’ performance in oil painting; the 
college students were enrolled in classes as nearly identical as possible to that 
of the experimental group. 
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The present report is focused on the later life experiences of the two groups 
of elderly persons. In comparison with the college students’ performance in 
painting, the experimental subjects proved to be slightly less proficient at the 
end of one year’s work. However, they manifested greater interest in art than 
the college students; a much larger percentage of the elderly persons con- 
tinued the instruction in art. 

The experimental subjects showed numerous advantages over the older 
control subjects. With the passing of time, the evidence suggests significant 
relationships between engagement of elderly persons in creative endeavors and 
their condition of health and their longevity. In 1971, 67% of the experi- 
mental subjects were living, as compared to 38% of a control group. In 1971, 
65% of the experimental persons rated “excellent” or “good” on an evaluation 
of health in comparison to 12% of the control group. 
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A STUDY OF VISUAL PERCEPTION AMONG AGED PEOPLE* 


Southern Connecticut State College; and Masonic Home and Hospital, 
Wallingford, Connecticut 


Louis Lerga anp Utricu K. RatHEy 


A. Inrropuction 


Perception has been viewed as the intersect of sensation and cognition. This 
complex act refers to the process of selecting, analyzing, and synthesizing 
sensory stimuli so that interpretation may follow. Gestalt psychologists, early 
in this century, focused attention upon visual perception and described this 
function as “the mental fingerprint.” Since that time, tests of visual perception 
have been produced to aid the physician in his diagnosis of cerebral pathologies. 
The Developmental Test of Visual Perception (3), the Visual Motor Gestalt 
Test (1), the Memory-for-Designs (4), and the Visual Retention Test (2) 
are representative of the clinical tools that have been employed for differential 
diagnosis of central deficits among children and adults, 

With the geriatric population, Pelz, Pike, and Ames (7) reported that the 
Monroe Reading Readiness Test, Visual III, was a valuable addition to the 
battery of examinations for assessing chronic brain syndrome. The Visual III 
is described by Ilg-and Ames (5) and consists of four 11 X 8% inch cards 
with four black designs printed on each plate. The aged patient is given a 
pencil and a blank sheet of white paper. Each of the cards is shown to the 
examinee for 10 seconds and then removed. Following each presentation, he 
is instructed to draw as many of the four designs that he can remember. The 
accuracy of the patient’s reproductions is rated from zero to 3; perfect recall 
of the 16 designs is scored as 48 points. 

The Visual III and the tests mentioned previously require the examinee 
to draw. The aged person may not possess the fine motor control necessary 
for negotiating pencil and paper. Furthermore, these psychological examina- 
tions do not provide for a systematic analysis of the patient’s perceptual 
difficulties. The mature of the individual’s visual imperceptions, as well as 
his total score, may be of significance in diagnosis and remediation. 

The present study was concerned with the visual perceptual abilities of 
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the elderly. More specifically, the intent of this investigation was to deter- 
mine whether intact, presenile, and senile groups differed in visual perception 
skills. Is there a characteristic profile of visual perception for each of these 
groups? What is the relationship between visual recognition and recall among 
geriatric people? The prime consideration, of course, was whether a visual 
perception task could be developed that would (a) minimize eye-hand coordi- 
nation, (b) yield profiles of the aged’s visual perception, and (c) remain 
compatible with the Visual III, a test that has been adapted expressly for 
geriatric populations. 


B. Meruop 
1, Participants 


Sixty-three members, residing at the Masonic Home and Hospital, were 
selected to serve in this study. The intact, presenile, and senile groups consisted 
of 21 individuals each (13 men and eight women) who were matched for 
education and previous occupation levels. The majority of these elderly people 
had completed seventh grade and were classified, according to the Minnesota 
Scale for Parental Occupations (6), as having been semiskilled workers. They 
ranged in age from 67 to 96; the median ages for the intact, moderately 
deteriorated, and severely deteriorated groups were 84, 83, and 84, consecu- 
tively. The intact participants were living in the dormitory where they en- 
joyed a large measure of independence. With the exception of preparing their 
own meals, these members were free to engage in all programs both within and 
outside the institution. The presenile were so designated by the medical staff. 
These moderately deteriorated people required nursing home care because 
they were no longer able to function in several activities of daily living. Meals 
and medications were brought to them, although they continued to be involved 
in the recreational life of the institution. The presenile included in this investi- 
gation manifested some disorientation in concepts of time. They could not 
remember recent dates or hours when activities were being conducted. The 
senile patients required total care. These aged retained a concept of self and 
could write their names. They were confused, however, regarding both space 
as well as time. The most deteriorated failed to answer appropriately when 
questioned concerning the location of the institution or their living quarters. 
All of the members could read a randomized series of alphabet letters, one 
inch in height, which were printed individually on white cards, 


2. Procedure 


To eliminate the need for writing and to obtain a profile of the aged’s 
visual perception skills, a multiple choice exercise was prepared. The Visual 
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III is administered routinely at the institution and, therefore, was adopted as 
the standard. 

The elderly person was shown a Visual IIT plate. Following a five second 
interval, a second card was exposed in which one of the four standard designs 
had been changed: ie, a design may have been rotated 180 degrees, a part 
may have been added to or omitted from one of the original figures, or a 
design was distorted to depict a more familiar configuration. Also included in 
the series were four different sets of identical figures. Each of the four Visual 
III cards was presented five times in random order, and a variable set was 
shown following each exposure of the standard. The aged’s initial task was 
to report whether the paired cards were the “same” or “not the same.” If 
different, he was instructed to support his decision by pointing to the figure 
on the second card that he perceived to vary from the standard set. A practice 
session was given in which three sample sets of geometric forms, two different 
and one the same, were shown to the participant, and the task was explained, 
The instructions were as follows: 

I will show you a card which has four designs. Please look at all of the 
pictures until I remove the card. Then I will show you another card, You 
are to tell me whether the designs on the second card are the same as 
those you saw on the first card. If they are the same, say “same”; if they 
are different, say “not the same” and point to the design that you think 
has been changed. 


Unweighted and weighted composite scores, as well as profiles for perceiving 
changes in rotation, omission, addition, and distortion, were tabulated. The 
unweighted total score represented the sum of the individual’s correct same- 
different judgments (20 points) and his accurate identifications of those 
designs that were modified on the second card (16 points). 

Consistent with the Visual III which yields a total score of 48 points for 
Correct reproductions of the 16 designs, the weighted composite score for a 
perfect performance on the multiple choice Monroe was 48. Again, 20 points 
were given if the member's decisions regarding the identity of the 20 sets of 
designs were accurate. Differential weighting was predetermined for each of 
the four variations from the standard. Eight young adults were requested to 
tank, in terms of difficulty level, the paired variations. The consensus was that 
distortion, addition, omission, and rotation changes were increasingly difficult 
to recognize. The following weights, therefore, were applied: rotations 
(R)—3 points; omissions (O)—2 points; additions (A)—1 point; distor- 
tions (D)—1 point. The weighted profile scores for selecting all of the designs 
that deviated from the standard within each of the four categories—R, O, A, 
D— was 12, 8, 4, and 4, consecutively. The composite weighted score included 
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TABLE 1 
ANALYs's oF UNWEIGHTED AND WEIGHTED PROFILE SCORES FOR THE THREE AGED GROUPS 
Source ss df MS F 
Unweighted scores 
Between subjects 150.94 62 
Groups (A) 82.73 2 41.36 36,28" 
Subjects within groups 68.21 60 1.14 
Within subjects 222.00 189 
Profile (B) 127.13 3 42.38 83.10" 
AXB 2.32 6 39 .76 
B X Subjects 
within groups 92.55 180 51 
Weighted scores 
Between subjects 412.71 62 
Groups (A) 234.45 2 117.22 39.47" 
Subjects within groups 178.26 60 2.97 
Within subjects 298.00 189 
Profile (B) 36.41 3 12,14 9.87 
AXB 40.09 6 6.68 5.43% 
B X Subjects 
within groups 221.50 180 1.23 
*p<.05. 


the number of correct decisions regarding the match of the paired cards plus 
differential credit for the appropriate selection of the altered design. An error- 
free performance received 48 points. Four members from each group were 
randomly selected and were readministered the multiple choice Monroe fol- 
lowing a 10 day interval. The reliability coefficient was .89. 

The Visual III also was given to the 63 participants. This test was ad- 
ministered 20 to 30 days before or after the individual had completed the 
multiple choice exercise, The order of presentations of the latter and the Visual 
III was counterbalanced. Each of the four plates was shown to the elderly 


for 25 seconds, and in this way the exposure times for both the Visual III 
and the multiple choice form were the same. 


C. Resurrs 


The mean unweighted and weighted total scores for the three geriatric 
groups are shown in Figure 1; their mean performances for R, O, A, and D 
are represented graphically in Figure 2. The .05 level of confidence was the 
criterion for testing the significance of differences between all group means. 
To determine whether these three populations differed meaningfully in visual 
perception, the total unweighted and weighted multiple choice scores were 
subjected to a simple analysis of variance and a ¢ test for independent mea 


” 
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sures. The statistical computations provided corroborative evidence for the 
obvious differences between groups as reflected in Figure 1. The unweighted 
and weighted F values (28.97 and 33.11, respectively) were significant 
beyond chance expectation, as were all ¢ comparisons. The intact, moderately 
deteriorated, and severely deteriorated members varied markedly in their 


* visual perception abilities, 


The subscore profiles of the three aged groups were treated statistically 
via a 3 XX 4 repeated measures design (8). Summaries of the pertinent 
Statistics for the unweighted and weighted data are given in Table 1. The F 
Tatio for the two factors—groups and profiles—far exceeded the five percent 
level in both analyses. The Newman-Keuls procedure was selected to compare 
mean performances between the subscores comprising the two profiles, As 
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FIGURE 2 
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PERFORMANCES FOR Rotation, Omission, AppiTiIon, AND DisToRTION 


was pre-established, alterations in R, O, A, and D were progressively easier J 
to perceive by these aged people, and yielded decidedly different results. Of 
particular import was the significant interaction of groups and profiles (F = 
5.43) for the weighted visual perception scores. Individual F ratios were 
computed to determine whether the three groups were homogeneous in theii : 
responses to changes of R, O, A, and D. All critical ratios were significant 
and, therefore, the null hypothesis was rejected. The intact, presenile, and 
senile members presented unique weighted profiles, 2 
Pearson product-moment correlation Coefficients were derived to assess the 
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degree of association between the multiple choice Monroe and the Visual III 
performances for the three groups. Significant correlations of .45 and .48 were 

. obtained at the .05 level for the intact people’s unweighted and weighted 
recognition scores and their recall performances, respectively. For the senile 
group, the relationships were .54 and .58. The two multiple choice Monroe 
scores of the preseniles, when correlated with their Visual III data, were 
found to be insignificant (r = .07 and .19). The small sample size and 
the dissimilar manner of administering the two visual perception tasks make 
these findings merely suggestive. Nevertheless, the unweighted and weighted 
multiple choice (recognition) performances of the intact and senile members 
were related to their ability to reproduce (recall) the Visual III designs. 
The correlations between the scores of the moderately deteriorated people 
were inconsequential. Parenthetically, the disparity between the recognition- 
recall performances for the presenile was greater than for either the most, 
deteriorated or intact participants. 


D. Discussion 


The multiple choice format seemed to be an effective method for evaluating 
the aged’s yisual perception performances. Consistent with the judgments of 
the medical personnel, the composite scores, both unweighted and weighted, 
of the three geriatric groups decreased progressively. The weighted profiles of 
these three populations were identifiable. From high to low, the most alert 
people’s weighted profiles were in the order O-R-D-A; the rank drder of 
the moderately deteriorated individual performances was O-D-A-R; the most 
deteriorated patient’s mean scores decreased in a D-A-O-R pattern. The use- 
fulness of these results as they may relate to the elderly requires further study. 

An individual who elects to reside in a multipurpose institution for the 
aged must alter his life style. The new admission’s well-being is often con- 
tingent upén his relocation in an environment that is compatible with his 
physical and psychological needs. A test that will assist the medical and 
paramedical staff in classification and placement of the elderly would be of 
considerable value. Research is now in progress to validate a multiple choice 
examination of visual recognition for geriatric populations-that may be ad- 
ministered by nurses and social workers. 

Significant correlation existed between the multiple choice Monroe and 
the Visual III performances of the intact and senile participants. The failure 
to discover such associations between recognition and recall for the presenile 
population is worthy of speculation. These moderately deteriorated people 
may be the most fragile of the three. Indeed, the presenile individual is 
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precariously straddling two worlds; physically unstable and psychologically 
irresolute, his competencies are less predictable. It is the moderately deterio- — 
rated person, therefore, who requires particular care and consideration, 


E. SumMMary 


The intent of this investigation was to determine whether a multiple choice 
exercise was a feasible corollary to a visual perception test that requires the 
aged to use paper and pencil. 3) 

Three comparable groups of 21 elderly people each—intact, presenile, and P 
senile—participated in this exploratory investigation, The data seemed to 
suggest that the multiple choice format was a viable procedure for assessing: 
visual perception among the aged. The unweighted and weighted composite 
scores decreased progressively with the degree of deterioration characteristi 
of each group. Rotating one of the figures in the variable sets was most 
difficult for these people to perceive followed by omission, addition, and 
distortion changes. The weighted profiles of the intact, presenile, and senile 
members were distinctively different, Moreover, significant correlations were 
found between recognition and recall performances of the intact and senile; 


such was not the case for the moderately deteriorated (presenile) people. The — 
clinical implications of these findings were mentioned. i 
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A. INTRODUCTION 


There is a small but expanding stream of literature that reports on the 
cross-cultural measurement of personality structures through the use of 
paper-and-pencil tests developed in the United States, Several studies have 
attempted to determine the generality of translated versions of American tests 
by using factor analytic techniques to compare the factor structures of instru- 
ments across culturally different samples. In this paper we will first review 
briefly the major empirical studies in this stream of research and then report 
the results of a cross-cultural comparison of the factor structure of the 
Shure-Meeker (11) Personality/Attitude Schedule (PAS). 


1. Purpose 


When a test instrument is translated, one naturally strives for concept trans- 
lation in such a way that all forms of the instrument measure the same con- 
struct. Gordon and Kikuchi (6) emphasize the importance of obtaining 
functionally equivalent translated versions of paper-and-pencil personality 
tests for use in cross-cultural research. They advocate using the construct- 
oriented approach when developing personality tests for comparing respondents 
across nationalities. In addition, Brislin (1) has recently demonstrated the 
technique of back-translation for obtaining functionally equivalent transla- 
tions of test instruments, 

However, the fact that items in one language version of an instrument are 
translated into another language does not assure the measurement of the same 
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construct in the second language version, Generally, an analysis of the results 
obtained with the translated version is required to determine the construct 
equivalence of the items across different linguistic samples. One method of 
statistical evaluation of the translated items of an instrument is through the 
use of factor analysis. With this approach, the factor structure of the original 
test results can be compared with the factor structure of the translated yer- 
sions in attempting to determine if the dimensions underlying the items are the 
same in the various translations of the instrument. 

In the present study we compare the factor structure of a modified version 
of the Shure and Meeker (11) Personality/Attitude Schedule (PAS) based 
on a sample of United States respondents with the factor structures of three 
groups of respondents Tepresenting multinational samples. Factor structures 
for the folowing three multinational samples were obtained through the factor 
analytic procedure used by Shure and Meeker (11): (a) respondents whose 
native language is English and who completed the English language version of 
the PAS; (4) respondents whose native language is not English but who 
completed the English language version of the PAS; (c) respondents who 
completed a translated version of the PAS in their native language. The PAS 
was administered as a part of a tesearch program examining the relationship 
between personality and bargaining behavior across cultures, 


2. Overview of the Literature 


Cattell and Warburton (3) examined the stability of the factor structure 
of the Sixteen Personality Factor Questionnaire between 604 American and 
204 British male and female university undergraduate students. They derived 
the second-order factors of anxiety and extraversion-introversion by factoring 
the correlations among first-order factors. Cattell and Warburton concluded 


2 appropriateness of using the concepts of 
anxiety and extraversion-introversion in these cultures, They further con- 


; consistency, He administered the HSPQ 
to 209 boys and girls (ages 12 to 14) in Australia and to a matched sample 


of 296 in the United States. The 14 scales of the test were intercorrelated and 


EEE 
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factored separately in each sample, An independent rotation of the five emer- 
gent factors yielded four factors matched across cultures. ‘These were the same 
factors as recognized in adults: extraversion-introversion ; anxicty-adjustment; 
cortertiapathemia; and independence. Cattell concluded that the primary 
practical utility of the results of this study was the weights derived, making 
it possible to get scores from the HSPQ on second-order factors as exvia and 
anxiety. 

Comrey et al. (4) administered an Italian version of an English-language 
factorially developed personality test to 507 Italian males and females, about 
half of whom were university students. Factor analysis was used to select the 
items best defining each of the 28 personality dimensions measured. The total 
scores of these item groups were intercorrelated along with two control scales: 
age and sex. The resulting correlation matrix when factor analyzed produced 
a factor solution with substantial similarity to the factor solution for a pre- 
viously tested American sample. The factors that emerged in both cultural 
samples were hostility, compulsion, neuroticism, dependence, and social desir- 
ability. However, the authors state that the correspondence between the 
factors was “not perfect.” 

Kikuchi and Gordon (10) identified the Survey of Interpersonal Values 
(SIV) scales of support, conformity, recognition, independence, benevolence, 
and leadership by means of an item factor analysis on an American sample. 
‘Twenty-four items from a translated version of the SIV were administered to 
132 Japanese college students, and the results factor analyzed. Five factors 
emerged from the orthogonally rotated factor matrix with only a few items 
having their largest loadings on a factor other than the ones originally hypo- 
thesized. The results of the factor analysis indicated that the constructs incor- 
porated in the SIV do emerge as factors in the Japanese culture. The majority 
of the items in the translated version of the test instrument had their principal 
loadings on the same factors as did the corresponding original items. After 
determining that the translated scales were reliable, independent, and mean- 
ingful, evidence regarding construct validity was obtained by correlating 
the scales with those of two other personality tests and comparing the mean 
scores of groups which differed in known respects relevant to the characteristics 
measured by the instrument. , 

Gordon (5) reported a multinational study of the SIV involving student 
samples tested in Hong Kong, Taiwan, India, Japan, Samoa, the Philippines, 
and the United States. The author used Q-typing for identifying personality 
types and describing individuals and groups. Trait level data were obtained 
for the samples where each group had some distinguishing characteristic. The 
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mean trait scores of these group were then intercorrelated. Finally, the 
resultant correlational matrix was factor analyzed and rotated to simple 
structure with use of the varimax criterion, and the emergent factors were 
used to identify the personality types. The resultant three factors, which 
accounted for 95.4 percent of the variance, were found to distinguish clearly 
among the cultures. The Oriental student samples had interpersonal values 
in common which differentiated them from their American counterparts. Also, 
clustering by nationality appeared within the Oriental grouping. 

Jamison and Comrey (8) reported a study comparing the factor structure 
similarity of the Comrey Personality Scales (CPS) between a British and an 
American sample. Their samples consisted of 250 male and female American 
university students and 232 British male and female university students. The 
version of the CPS administered to the two samples was designed to measure 
seven factors: shyness, dependence, empathy, neuroticism, compulsion, hostility, 
and socialization. Each of these seven factors was made up of six to nine 
factored homogeneous item dimensions (FHID), defined as several items 
developed to measure the same variable and shown to be homogeneous through 
actual data analysis. The total scores for the 52 FHIDS were intercorrelated, 
and the resulting matrix factor analyzed and the factors rotated to simple 
structure. The emergent factors were dependence, empathy, neuroticism, 
socialization, compulsion, hostility, and shyness (which split into two factors 
in both samples). The factor loadings in the British and American factors 
Wels correlated. Jamison and Comrey found a high correlation between the 
British and American factor structures, six of the eight factors having a corre- 


lation of over .85. Significant differences did exist, however, in the mean 
FHID and factor scores between the two samples. 


B. Metuop 
1. Subjects 


The subjects used in this study were managers from the following nations: 
Belgium, Denmark, England, Finland, France, Greece, Spain, Switzerland, 
and the United States. At the time of data collection the respondents, all of 
whom were managers employed in middle and upper level positions in private 


enterprises, were enrolled in executive development courses in the countries 
identified above.® E 
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The PAS was administered under one of thr. 
conditions. The first category of respondents ci 
native language was English and who receive 


ee different translation-language 


translated into their respective native lanuage (Danish, Finnish, Flemish, 


French, Spanish, and Greek). This Category of managers is designated as the 
nonEngL-nonEngQ category. 


Selected demographic characteristics of the managers in each of the three 
Categories, as well as the sizes of each group, are as shown in Table 1, 


2. The Instrument and Scales 


Each subject completed a 53 item revised version of the PAS, originally 
developed by Shure and Meeker (11).4 Only four of the original six scales 


were administered—as seven-point and three-point scaled items and as forced- 
choice questions—as described below. 


a. Conciliation vs. Belligerence in Interpersonal Relations. High scorers 
advocate responding to the needy and less fortunate (or even to un- 
friendly, quarrelsome, provocative, and hostile Persons) with under- 
standing, help, and friendliness, They urge admitting their own wrongs, 
and refuse to use threats or belligerent means or to be motivated by 
Tevenge, Instead they advise a diplomatic and constructive response 


guided by considerations of humanitarianism and cooperation (11, 
Pp. 240). 


b. Risk Avoidance vs. Risk Taking. High scorers are unadventuresome, 


exhibit a low activity level, and are unwilling to expose themselves to 
ie 
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Copies of the revised PAS used in this study are available from the first author 
at the address shown at the end of this article. 
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TABLE 1 
Category 


EngL- NonEngL- NonEngL- 
Sample characteristics EngQ EngQ NonEngQ 
Number 181 93 172 
Mean age 40.0 37.0 35.3 
SD 6.3 5.7 7.6 
Mean number years 
business experience 17.1 12.7 11.4 
SD 6.3 5.8 54 
Mean number years 
experience in present 
firm 11.9 8.7 7.3 
SD 12 $A 5.8 
Mean number years 
formal education 
beyond age 18 3.8 4.1 3.9 
SD 18 2.0 2.6 
Mean number years 
residence in present 
location 37.7 23.0 33.3 
SD 10.9 15.6 10.1 


Note: Bivariate subsample method used for missing data. 


dangers or to hazard risks of either a material or physical character 
(11, p. 240). 

¢. External vs. Internal Control, High scorers believe that events are 
controlled by eternal forces (fate, chance, events). Low scorers believe that 
they can exercise some control over the events around them (11, p. 241). 
: d. Suspiciousness vs. Trust. High scorers are characterized by paranoid- 
like traits of selfishness, projection of hostility, excitability, and tenseness. 
Low scorers are characterized by a trusting, unselfish, calm, and optimistic 
orientation (11, p. 241). 


Y 


The PAS was translated from its English language version into the 
required languages by means of the partial use of a back-translation procedure 
advocated by Brislin (1) for its applicability to cross-cultural research (the 
Brislin paper appeared subsequent to the development of our instrument). 
For each country in which the questionnaire was administered, a native of the 
country translated the PAS questions from the English language version into 
his native language. This person was instructed to concentrate on translating 
the meaning of the question, rather than on a word-by-word literal translation. 
This translated version of the instrument was then independently translated 
back into English by another native of the country where the questionnaire 
would be administered. The two translators were then brought together and 


L. L, CUMMINGS, ET AL, 73 


instructed to resolve jointly any difference in the two English language ver- 
sions. Finally, the questionnaire for each country was carefully reviewed by 
a third native of the target country who was familiar with the backgrounds 
of the prospective managerial respondents. 


3. Procedure 


Responses from each of the three categories of respondents were factor 
analyzed by use of the same procedure employed by Shure and Meeker (11) 
in the original development of the PAS. For each category, this involved 
constructing a correlation matrix for the 58 PAS item scores. The resulting 
matrix was then factored by use of the principal axes solution and rotated by 
means of a varimax program (9).° Although Shure and Meeker (11) reported 
item loadings as low as .25, only item loadings of .4 or above are reported in 
our analysis, to permit clearer interpretation of the factors. 

We will report the factors in order of the percentage of total variance and 
the percentage of total factor variance accounted for by each factor. The 
amount of variance explained by each of the first seven factors across the 
three categories is shown in Table 2. These data cannot be compared with 
the Shure and Meeker (11) findings, since comparable analyses were not 
reported in that paper. Table 3 shows the number of each item loading on 
each of the first seven factors extracted in our analysis. This table also shows 
the factors across the three categories (EngL-EngQ, nonEngL-EngQ, and 
nonEngL-nonEngQ). Tables 4 through 7 report our first four orthogonal 
factors for each category along with one of the PAS factors obtained in the 
Shure and Meeker (11) cross-validation factor analysis. Table 8 reports the 
next three factors that emerged from our analysis. These factors appear to 
represent elaboration and subdivisions of the original Shure-Meeker factors, 


C. Resutts 


The first seven factors that emerged from our factor analysis of the revised 
PAS accounted for 50.0 percent of the total factor variance and 33.2 percent 
of the total variance in the EngL-EngQ category (see Table 2). In the 
nonEngL-EngQ category, Factors I through VII accounted for 47.7 percent 
of the total factor variance and 35.4 percent of the total variance. The least 
amount of variance was accounted for in the nonEngL-nonEngQ category. 


—___ 

5 The correlation matrices for the different nonEnglish forms of the questionnaire 
Were examined and found to be generally similar. The samples completing the non- 
English versions were combined in order to generate a sample size sufficient for 


actor analysis, 
<A STATJOB FACTOR 1 program was used on a UNIVAC 1108 computer. 
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Here the first seven factors accounted for 45.0 percent of the total factor 
variance and 30.2 percent of the total variance. Thus, as we move toward 
less similarity, in terms of language form of the PAS and native language of 
the sample, less variance of both types in the responses to the PAS is accounted 
for. 


TABLE 2 
Variance ANALYsis BY Factor: RoTATED Factor Matrix CATEGORY 
Category 
Factor EngL- NonEngL- NonEngL- 
EngQ EngQ NonEngQ 
I % factor variance 13.7 13.1 9.0 
% total variance 91 9,7 6.0 
Il 9% factor variance 8.1 7.0 8.6 
% total variance 3 5.2 5.8 
Ill % factor variance 6.2 6.3 5.8 
% total variance 41 4.7 3.9 
IV. % factor variance 6.0 5.7 5.6 
% total variance 4.0 4.2 3.8 
v % factor variance 6.0 5.5 5.5 
% total variance 3.9 41 3.7 
VI % factor variance 51 5.1 54 
% total variance 34 3.8 3.6 
VII 9% factor variance 4.9 5.0 5.1 
% total variance 3.3 3.7 3.4 
Above 
seven 9% factor variance 50.0 47.7 45.0 
factors % total variance 33.2 35.4 30.2 


In comparing the item composition of the factor structures of the Shure- 
Meeker analysis with our analysis, we find that several of the Shure-Meeker 
factors divided into more differentiated factors (See Table 3). The Con- 
ciliation vs. Belligerence factor, for example, segmented into another factor 
indicative of a propensity toward helping others. Also the Risk Avoidance 
factor split into physical and monetary risk avoidance factors in each of the 
three categories analyzed. The Suspiciousness vs. Trust factor held up across 
all three of our categories, though a lesser number of items loaded on this 
factor in our analysis than in the Shure-Meeker analysis. However, the Ex- 
ternal Control factor did not emerge as a distinct factor in any category. Table 
3 indicates the items loading on each factor and the corresponding Shure- 
Meeker designation for each factor. Later (see Table 9) an attempt will be 
made to label these factors as derived from our analysis, 

We will now analyze our results by factor and, where possible, compare 
them with the four factors presented by Shure and Meeker (11). 


Irem NuMBERS BY FACTOR AND CorR 


L. L. CUMMINGS, ET AL. 


TABLE 3 
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ESPONDING SHURE-MEEKER FACTOR DESIGNATION 


Factor 
I 


II 
Til 
Iv 

v 
VI 
VII 


EngL-EngQ 
[C-B] 1, 7, 13, 15, 24, 
29 


[R-A] 31, 34, 37, 41, 
42, 44 

[S-T] 2, 6, 8, 10 

[C-B] 3, 9, 11, 18 

[R-A] 30, 33, 38 

[S-T] 16, 25 


[C-B] 5 
[E-I] 53, 56, 57 


Category 
NonEngL-EngQ 


NonEngL-NonEngQ 


[C-B] 1, 4, 7, 9, 15, 20, 

22, 27, 28, 29 

[S-T] 8, 16, 25 

[R-A] 31, 32, 34, 37, 
41, 42 

[S-T] 19 

[R-A] 36, 49 

[S-T] 2, 10, 12 

[R-A] 33, 38 

[C-B] 17 

[R-A] 33, 38 

[E-I] 53, 56 


[E-I] 57 
[S-T] 6, 8 
[C-B] 3 


[R-A] 31, 32, 37, 41, 
42, 44 


[C-B] 1, 8, 9, 11, 18, 
28, 29 

[C-B] 20, 22 

[S-T] 16, 25 

[R-A] 30, 33, 38 


[R-A] 36, 49 
[S-T] 2, 4, 14 
[C-B] 5 


[R-A] 40 
[E-I] [C-B] 15 


Note: The key to Shure. 
erence; [R-A] Risk Taki 


External vs. Internal Control. 


1, Factor I 


-Meeker factors is as follows: [C-B] Conciliation os. Bellig- 
ng-Risk Avoidance; [S-T] Suspiciousness vs. Trust; [E-I] 


Table 4 shows Factor I as extracted in our analysis for each of the three 
categories and compares these factors with the Shure-Meeker factor of Con- 
ciliation vs. Belligerence. 

For the EngL-EngQ category, all eight of the items (1, 7, 13, 15, 24, 27, 
28, and 29) that loaded on Factor I also loaded on the Shure-Meeker Con- 
ciliation vs. Belligerence factor. Thus it appears that Factor I in this category 
refers to the dimension that Shure and Meeker have labelled “conciliation vs. 
belligerence”. Factor I accounts for 13.7 percent of the total factor variance 
and 9.1 percent of the total variance of the EngL-EngQ category. 

For the nonEngL-EngQ category, nine (out of 13) of the items (1, 7, 9, 
15, 20, 22, 27, 28, 29) that loaded on our Factor I also loaded on the Con- 

, Ciliation vs, Belligerence factor. However, three of the items (8, 16, and 25) 
loading on this factor loaded on the Shure-Meeker Suspiciousness vs. Trust 
factor. Thus, factor I of the nonEngL-EngQ category, which accounts for 
13.1 percent of the total factor variance and 9.7 percent of the total variance, 
again appears to tap the conciliation vs. belligerence dimension. 

For the nonEngL-nonEngQ category, all six items (31, 32, 37, 41, 42 and 
44) that loaded on Factor I also loaded on the Shure-Meeker Risk Avoidance 
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factor. These items, which refer only to a physical risk concept, account for 
9.0 percent of the total factor variance and 6.0 percent of the total variance 
of the nonEngL-nonEngQ category. Thus, when the questionnaire was 
translated and administered in a language other than English, the first factor 
to emerge from our analysis shifted. In addition, once again with the single 
exception of the nonEngL-EngQ, total variance and the amount of variance 
accounted for decreased as we move toward less similar forms of the PAS and 
more different samples. 


TABLE 4 
Comparison OF THREE LANGUAGE SAMPLES AND SHURE-MEEKER SCALE: FacTOR I 
Categor: 
Shure-Meeker Oat 
Conciliation- 
Belligerence EngL-EngQ NonEngL-EngQ NonEngL-NonEngQ 
Rank Item Loading Item Loading Item Loading Item Loading 
1 1 —.63 29 761 15 849 31 75 
2 15 —.58 4 754 4 79 41 754 
s 7 —.56 15 734 27 690 32 726 
4 13 53 28 682 9 632 42 725 
5 3 50 7 679 f 591 37 639 
6 29 +50 27 651 28 559 44 509 
ih i —A7 13 647 24 522 
8 17 —46 1 .578 8 486 
9 9 —4 29 483 
10 28 —A4 16 452 
i 18 —! 22 437 
12 24 38 20 All 
13 5 37 25 ~—401 
14 27 37 = 
15 22 33 
16 26 —29 
a 20 28 
% factor 
variance Not reported 13.7 13.1 9.0 
% total > , 
variance Not reported O41 9.7 6.0 


2. Factor II 


‘able 5 shows Factor II as extracted from each of the three categories 
and compares these factors with the Shure-Meeker factor of Risk Avoidance. 
For the EngL-EngQ category, all six of the items (31, 34, 37, 41, 42, and 
44) that loaded on Factor II also loaded on the Shure-Meeker Risk factor 
with Joadings greater than .40. The items loading on this factor are almost 
identical to those that loaded on Factor I of the nonEngL-nonEngQ cate- 
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gory. Factor II in the EngL-EngQ category, for example, also refers to 
physical risk avoidance and accounts for 8.1 percent of the total factor 
variance and 5.4 percent of the total variance. Q 

For the nonEngL-EngQ category, all six items (31, 32, 34, 37, 41, and 
42) that loaded on Factor II also loaded on the Shure-Meeker Risk factor. 
As in the previous category, this factor taps a physical risk avoidance dimen- 
sion. Factor II accounts for 7.0 percent of the total factor variance and 5.2 
percent of the total variance of the nonEngL-EngQ category. 

As was true for Factor I, the nonEngL-nonEngQ category generated dif- 
ferent items loaded on Factor II. The eight items (1, 3, 9, 11, 18, 26, 28, 
and 29) that loaded on Factor II of the nonEngL-nonEngQ category also 
loaded on the Shure-Meeker Conciliation vs. Belligerence factor. However, 
it appears that these items refer to helping behavior and satisfactions derived 
from exhibiting a helping attitude, This factor accounts for 8.6 percent of the 
total factor variance and 5.8 percent of the total variance of the nonEngL- 
nonEngQ category. 


TABLE 5 
CoMPaRISON OF THREE LANGUAGE SAMPLES AND SHURE-MEEKER SCALE: Factor II 
Category 
Shure-Meeker 
Risk Taking- 
Risk Avoidance EngL-EngQ NonEngL-EngQ NonEngL-NonEngQ 
Rank Item Loading Item Loading Item Loading Item Loading 
1 at —.60 31 804 34 814 28 728 
2 41 —.60 41 746 42 638 9 653 
3 34 —.57 37 634 31 585 26 598 
se 37 —54 34 619 37 558 3 587 
5 42 —51 42 524 41 518 ll 567 
6 30 —48 44 452 32 472 18 561 
7 32 AT 29 472 
8 31 —45 1 A451 
9 45 —44 
10 49 —39 
11 40 —.38 
12 33 —.38 
13 36 —.37 
14 46 —.35 
15 38 —.29 
16 35 —.27 
17 48 —.22 
% factor 
variance Not reported 8.1 7.0 8.6 
% total 
variance Not reported 54 5.2 5.8 
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3. Factor UI 


Factor III as extracted in each of our categories is shown with the Shure- 
Meeker External vs. Internal Control factor in Table 6. For the EngL- 
EngQ category, the four items that loaded on Factor III (2, 6, 8, and 10) 
loaded on the Shure-Meeker factor of Suspiciousness vs. Trust. Factor III 
of the EngL-EngQ category accounts for 6.2 percent of the total factor 
variance and 4.1 percent of the total variance. For the nonEngL-EngQ cate- 
gory, Factor III consists of items 36 and 49, both of which loaded on the 
Shure-Meeker Risk Avoidance factor. In addition, item 19 from the Shure- 
Meeker Suspiciousness vs. Trust scale also loaded on Factor III. This factor 
appears to be a segment of the physical risk avoidance factor as it specifically 
refers to driving risk. Factor III accounts for 6.3 percent of the total variance 
of the nonEngL-EngQ category. For the nonEngL-nonEngQ category, Factor 
III is composed of items 20, 22, and 25 which loaded on the Shure-Meeker 
Conciliation vs. Belligerence factor. However, item 16 loaded on the Shure- 
Meeker Suspiciousness vs. Trust factor. Factor III, Supiciousness vs. Trust, 
accounts for 5.8 percent of the total factor variance and 3.9 percent of the 
total variance of the nonEngL-nonEngQ category. 

Clearly, Factor III is not easily interpretable in terms of the original 
Shure-Meeker factor structures. It clearly does not fit the label “‘external vs. 


TABLE 6 
Comparison oF THREE LANGUAGE SAMPLES AND SHURE-MEEKER SCALE: Factor III 
Shure-Meeker caenry 
Enema Us. 
nternal NonEngL- NonEngL- 
ne Control EngL-EngQ EngQ NonEngQ 
an Item Loading Item Loading Item Loading Item Loading 
1 51 67 2 784 36 
4 i 9682S 724 
2 3 63 pak oes namie) 766 = 16 704 
s a - a 503 19 = —690 20 —<A49 
5 a eae ee ae. 
6 50 AT 
7 57 45 
8 56 43 
oD 55 37 
10 33 25 
il 15 —.25 
% factor 
variance Not reported 6.2 
% total fe = 
variance Not reported 41 


47 3.9 
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internal control.” Furthermore, there is no overlap in the items loaded across 
the three categories. Thus, the external-internal control factor does not appear 
to hold up across language differentials in the PAS format. 


4. Factor IV 


For each of the three categories, Factor IV and the Shure-Meeker Suspi- 
ciousness-Trust factor are shown in Table 7. 

For the EngL-EngQ category, items 3, 9, 11, and 18 loaded on Factor IV 
and also loaded on the Shure-Meeker Conciliation vs. Belligerence factor. As 
with Factor III of the nonEngL-nonEngQ category, this factor indicates a 
helping dimension. Factor IV accounts for 6.0 percent of the total factor 
variance and 4,0 percent of the total variance of the EngL-EngQ category. 
For the nonEngL-EngQ category, items 2, 10, and 12 loaded on Factor IV 
and were from the Shure-Meeker Suspiciousness vs. Trust factor, Factor IV, 
Suspiciousness vs. Trust, accounts for 5.7 percent of the total factor variance 
and 4.2 percent of the total variance of the nonEngL-EngQ category. For 


TABLE 7 
ComPaRISON OF THREE LANGUAGE SAMPLES AND SHURE-MEEKER SCALE: FACTOR IV 
Category 
Shure-Meeker 
Suspicious- 
Fle vs. NonEngL- NonEngL- 
Trust EngL-EngQ EngQ NonEngQ 
Rank Item Loading Item Loading Item Loading Item Loading 
1 2 St 9 +675 2 746 30 768 
2 12 4 3 636 12 544 38 729 
3 14 44 11 -630 10 489 33 691 
4 4 43 18 Ad 
5 6 43 
6 8 41 
7 10 40 
8 47 —40 
9 35 —.38 
10 43 —35 
il 19 —.35 
12 21 —35 
13 16 34 
14 25 34 
15 39 —.33 
16 23 32 
17 22 28 
18 30 27 
% factor 
variance Not reported 6.0 5.7 5.6 
% total 
variance Not reported 4.0 42 3.8 
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the nonEngL-nonEngQ category, items 30, 33, and 38 loaded on Factor IV; 
these items also loaded on the Shure-Meeker Risk Avoidance factor. How- 
ever, these three items refer only to monetary risk rather than the general 
risk taking and risk avoidance factor. Factor IV accounts for 5.6 percent of 
the total factor variance and 3.8 percent of the total variance of the nonEngL- 
nonEngQ category. Again there appears to be considerable sample variability 
in the order of factor emergence and no overlap in the items across the three 
categories. Only the nonEngL-nonEngQ category appears similar to the 
original Shure-Meeker Suspiciousness-Trust factor. 


5. Factor V 


For the EngL-EngQ category, items 30, 33, and 38 loaded on Factor V 
(see Table 8). This is the same composition as Factor 1V of the nonEngL- 
nonEngQ category as shown in Table 7. Factor V, monetary risk avoidance, 
accounts for 6.0 percent of the total factor variance and 3.9 percent of the 
total variance of the EngL-EngQ category. 

For the nonEngL-EngQ category, items 17, 33, and 38 loaded on Factor V. 
Of these, items 33 and 38 are from the Shure-Meeker Risk Avoidance factor, 
and item 17 from the Conciliation vs. Belligerence factor. It appears that 
Factor V of this category measures primarily a monetary risk taking-risk 
avoidance dimension. This factor accounts for 5.5 percent of the total factor 
variance and 4.1 percent of the total variance on the nonEngL-EngQ cate- 
gory. 

For the nonEngL-nonEngQ category, items 36 and 49 of the Shure-Meeker 
Risk Avoidance factor loaded on Factor V. As with Factor III of the 
nonEngL-EngQ category (see Table 6), this factor assesses a physical risk 
avoidance dimension, since both items refer to fast driving. This factor ac- 
counts for 5.5 percent of the total factor variance and 3.7 percent of the 
total variance of the nonEngL-nonEngQ category. 


6. Factor VI 


For the EngL-EngQ category, items 5, 16, and 25 loaded on Factor Vi 
as shown in Table 8. Items 16 and 25 are from the Shure-Meeker Suspicious- 
ness vs. Trust factor, and item 5 is from the Conciliation vs. Belligerence 
factor. Factor VI appears to tap a propensity for confrontation. This factor 
accounts for 5.1 percent of the total factor variance and 3.4 percent of the 
total variance of the EngL-EngQ category. 

For the nonEngL-EngQ category, items 53 and 56 from the Shure-Meeker 
External vs. Internal Control factor loaded on Factor VI. This factor a 
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TABLE 8 
CoMPAaRISON OF THREE LANGUAGE SAMPLES: FAcToRS V, VI, anv VII 
Category 
NonEngL- 
EngL-EngQ NonEngL-EngQ. NonEngQ 
Rank Item Loading Item Loading Item Loading 
Factor V 
1 30 797 33 +742 49 +857 
2 33 -768 38 -730 36 785 
3 38 721 17 541 
% factor 
variance 6.0 5.5 5.5 
% total 
variance 3.9 41 sbi 
Factor VI 
1 16 -600 53 801 4 759 
2 5 —.574 56 579 14 677 
3 25 571 2 413 
% factor 
variance 5.1 5.1 5.4 
% total 
variance 3.4 3.8 3.6 
Factor VII 
1 57 757 57 762, 5 768 
2 53 +681 3 —.644 15 476 
3 56 478 8 506 40 —401 
4 6 417 
% factor 4.9 5.0 5.1 
variance 
% total 3.3 3.7 3.4 


variance 


counts for 5.1 percent of the total factor variance and 3.8 percent of the 
total variance in the nonEngL-EngQ category. 

For the nonEngL-nonEngQ category, items 2, 4, and 14 loaded on Factor 
VI, and were from the Suspiciousness vs. Trust factor. Factor VI, Suspicious- 
ness us. Trust, accounts for 5.4 percent of the total factor variance and 3.6 
percent of the total variance of the NonEngL-NonEngQ category. 


7. Factor VII 


Factor VII as extracted in each of the three categories is shown in Table 
8. Items 53, 56, and 57 from the Shure-Meeker External vs. Internal Con- 
trol factor loaded on Factor VII of the EngL-EngQ category. This factor 
accounts for 4.9 percent of the total factor variance and 3.3 percent of the 
total variance of this category. t 

For the nonEngL-EngQ category, four items from three different Shure- 
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Meeker factors loaded on Factor VII. Item 57 is from the Shure-Meeker 
External vs. Internal Control factor; items 6 and 8 are from the Suspicious- 
ness vs. Trust factor; and item 3 is from the Conciliation vs. Belligerence 
factor. Factor VII of the NonEngL-EngQ category accounts for 5.0 percent 
of the total factor variance and 3.7 percent of the total variance. For this 
category, Factor VII is clearly not interpretable. 
For the nonEngL-nonEngQ category, items 5, 15, and 40 loaded on Factor 
VII. Item 5 is from the Shure-Meeker Conciliation vs. Belligerence factor; 
item 15 is from the External vs. Internal Control factor; and item 40 is@rom 
the Risk Avoidance factor. Factor VII accounts for 5.1 percent of the total 
factor variance and 3.4 percent of the total variance of the nonEngL- 
nonEngQ category. Again, Factor VII does not appear to be interpretable. 


D. Discussion 


Table 9 compares the seven emergent factors across the three language 
classifications. The results are only partially supportive of the generality 
of the Shure-Meeker PAS instrument. In each of the three classifications, 
Conciliation vs. Belligerence, Risk Taking vs. Risk Avoidance, and Suspici- - 
ousness vs. Trust emerged as identifiable factors. 

However, certain important differences do emerge when these results are 
compared to the Shure-Meeker 1967 extraction. First, the internal-external 
control factor did not emerge in any of the samples, with the exception of 
the weak emergence as Factor VII in the EngL-EngQ category. Second, in 
each sample, the risk taking-risk averting factor differentiated into two com- | 
ponents; namely, physical risk taking and monetary risk taking, and in each 
case the physical risk taking factor emerged prior to the monetary risk taking 
factor. This breakdown of the risk factor into subfactors seems to parallel 
the earlier results of Thurstone (12) and Guilford e¢ al. (7) in the con- 
struction of the scale used by Shure and Meeker in their original work. 
Third, if one examines only the first three factors, it is only with the EngL- 
EngQ sample that the three Shure-Meeker factors (Conciliation-Belligerence, 
Risk Taking, and Suspiciousness-Trust) emerged. Thus the order of factor | 
emergence was similar to that of Shure and Meeker only in the grouping 4 
where both native language and questionnaire were English. Four factors 
FAURE be considered in order to pick up each of the three Shure-Meeker factors 
in the nonEngL-EngQ sample, while six factors are necessary in the nonEngL- 
nonEngQ sample. Thus, as we move toward more dissimilar conditions (in 
terms of the native language of the respondents and the language version of - 
the questionnaire), the factor structure of the PAS increases in dissimilarity 
compared to the original Shure-Meeker instrument. 
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TABLE 9 
PAS Factors Across Turez LANGUAGE Groupincs 
Category 
NonEngL- 
Factor EngL-EngQ NonEngl-EngQ NonEngQ 
r Conciliation vs, Conciliation vs. Physical risk 
Belligerence Belligerence taking 
II Physical risk Physical risk Helping 
taking taking 
III Suspiciousness Driving fast Conciliation vs. 
‘4 vs. Trust Belligerence 
IV Helping Suspiciousness Monetary risk 
vs. Trust taking 
Vv Monetary risk Monetary risk Driving fast 
taking taking 
VI Confrontation Job order Suspiciousness 
* giving vs. Trust 
VII Job order Not interpretable Not interpretable 


giving 


In general, the results reported here are mildly encouraging regarding the 
cross-cultural use (with the associated language differences) of the Shure- 
Meeker PAS instrument. The differences between the original PAS and the 
version reported here are largely ones of differentiation within the original 
factor structure. The risk taking-risk aversion factor consistently split into 
physical and monetary components. In two of the three samples, a factor 
labeled “helping” emerged as a differentiation of the original Conciliation- 
Belligerence scale. 


E. Summary 


This paper reports a cross-cultural and cross-language comparison of a 
Personality instrument originally developed for use in experimental bargaining 
studies (11). The factor structures of the instrument in three different 
language classifications are compared. Four hundred forty-six subjects, from 
Belgium, Denmark, England, Finland, France, Greece, Spain, Switzerland, 
and the United States, all of whom were adult males employed in private 
enterprises, each completed a modified version of the Shure and Meeker 
questionnaire, The factor structure of the questionnaire was extracted for 
each of three subgroups: (a) native English language respondents, completing 
an English version of the questionnaire; (4) nonnative English language 
Tespondents completing an English version; and (c) nonnative English Jan- 
Suage respondents completing a native language version. Three of four scales 
originally identified by Shure and Meeker (11) emerged in each of the three 
subgroups; the three scales were measures of conciliation-belligerence, risk 


84 JOURNAL OF PSYCHOLOGY 


taking-risk avoidance, and suspiciousness-trust. The order in which the factors 
emerged differed across the subgroups. In addition, the risk taking-risk avoid- 
ance factor, as well as the concilation-belligerence factor, segmented into 
subcomponents, Implications are discussed regarding the cross-cultural and 
cross-language generality of the Shure-Meeker instrument, as well as the 
general issue of the cross-cultural measurement of personality. 
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A NORMAL CURVE METHOD FOR COMBINING 
PROBABILITY VALUES FROM INDEPENDENT 
EXPERIMENTS* 


The University of Calgary 


EucEnz S. Epcincton 


A. InTRopuction 


A number of independent experiments may test the same null hypothesis, 
and an experimenter may want to pool or combine the probability values from 
the experiments to determine whether the values in general are significantly 
smaller than would be expected by chance. The experiments whose probability 
values are to be pooled may be performed by different experimenters or by an 
individual experimenter. An individual experimenter may need to pool a 
number of his experiments because of the insensitivity of each experiment 
as a result of the small number of subjects that can be used at any given time. 
Or he may desire to run a series of experiments where he modifies later 
experiments on the basis of what he learns from earlier ones. Instead of 
discarding the statistical results from the pilot studies, however, he may 
prefer to pool the probability values from those experiments with the values 
from the later experiments. 

The method familiar to psychologists for combining probability values 
is that which involves the determination of the significance of a set of prob- 
ability values by reference to chi-square tables (2, pp. 248-250; 3; 4, pp. 
231-233; 5, pp. 43-45). The significance of the overall probability associated 
with the combination is the probability under the null hypothesis of getting 
such a small product of probability values as an obtained product. An al- 
ternative method, called the additive method, determines the significance of 
4 sum of probability values and has been shown to be more powerful than 
» the conventional method (1). 

The additive method consists in adding the probability values and deter- 
mining the overall probability of the smallness of the sum by the following 
formula: 

Sees 

* Received in the Editorial Office on May 30, 1972, and published immediately at 
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where S is the sum of the probability values and 7 is the number of proba- 
bility values added. The minus and plus signs preceding the terms alternate, 
and additional terms are used as long as the number subtracted from S in 
the numerator is less than S. For example, if the sum of four probability 
values was 2.75, the numerator of the first term would be (2.75)*, the 
numerator of the second term would be (1.75)4, and the numerator of the 
third term would be (.75)4. Only three terms would be used. 

When only a few probability values are to be added, Formula 1 presents 
no difficulty, but when there are many probability values, it may be awkward 
to apply the formula. This paper presents a normal curve method to be used 
when a large number of probability values are to be added. It is computa- 
tionally simpler than Formula 1 in such cases and will be shown to provide 
an overall probability value that closely approximates the true value given 
by Formula 1 when there are several probability values to be added. 


B. Normat Curve Metruop 


The normal curve method uses the mean of the added probability values 
and determines the probability of such a small mean by the use of normal 


curve tables, after determining the z-score value of the obtained mean by the — 


following formula: 


Ey 50) 
2887/\/n 


where X is the mean of the n probability values added. The z score is ex- 
pected to be small when there are treatment effects, and so the overall proba- 
bility is estimated by the proportion of area under the normal curve below 
the obtained z-score value, 

For instance, suppose the addition of probabilities associated with nine 
independent experiments gave a sum of 3.15. The mean probability, X, is 
then 3.15/9 = .35, and n is 9. Substitution into Formula 2 provides a z-score 
value of —1.56. Normal curve tables show that the proportion of area below 
a z-score value of —1.56—i.e., the proportion of area more than 1.56 
standard deviations below the mean—is .0594, and so the overall probability 
associated with the nine pooled probability values is .0594. That is, the 


[2] 


‘ 


- 
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probability of nine probability values providing a mean as small as .35 (or, 
alternatively, a sum as small as 3.15) when the null hypothesis is true for all 
nine experiments, is .0594, 


C. RatIoNALE For THE Normau Curve Metuop 


When the null hypothesis of no treatment effect is true for each of n experi- 
ments providing a probability value, the n probability values can be regarded 
as randomly selected, with replacement, from a continuous uniform population 
of probability values that ranges from zero to 1 and has a mean of .50. A 
continuous uniform (rectangular) population has a standard deviation of 
1/12 &*, where R is the range of the population, and so the standard devia- 
tion of the population of probability values is \/1/12 = .2887. 

The theoretical sampling distribution of means from any population has a 
mean equal to the population mean and a standard deviation equal to the 
population standard deviation divided by the square root of the sample size. 
Thus, the theoretical distribution of means of n probabilities under the null 
hypothesis has a mean of .50 and a standard deviation of .2887/./n. Thus, 
Formula 2 gives the z-score value of an obtained mean. 

Reference of the z-score value to normal curve tables is based on the 
assumption of approximate normality of the theoretical distribution of means. 
The population of probability values is a rectangular, not a normal, distri- 
bution, and so the distribution of sample means for any sample size is not 
perfectly normal, but the central limit theorem states that the sampling dis- 
tribution of means for populations of any shape approaches normality as the 
sample size increases. One can therefore expect the overall probabilities ob- 
tained by the normal curve method to provide a closer approximation to the 
true overall probabilities given by Formula 1, the larger the number of prob- 
abilities to be combined. It will now be shown that with even a relatively 
small number of probability values, the sampling distribution of means is close 
enough to normal for the normal curve method to give an overall probability 
that closely approximates that given by Formula 1. 

Table 1 shows the true probability values based on Formula 1 that are 
associated with probabilities obtained by the normal curve method, for various 
numbers of probability values added. An example will explain the derivation 
of the table. Consider the tabled value, .051, which falls at the intersection 
of the row for a normal curve probability of .05 and the column for five 
Probability values. Five percent of the area under the normal curve falls 
below a z score of —1.645, and so the normal curve method assigns an over- 
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TABLE 1 
TRuE PRoBaBILITY VALUES (BASED ON FORMULA 1) AssOcIATED WITH 
NorMAL Curve PRoBABILITY VALUES 


Normal curve 


probability Number of probability values added 
values 2 x 4 5 os 9 
10 al ll 10 104 101 
09 10 10 09 094 091 
08 09 -08 08 082 081 
07 .08 .07 07 072 071 
06 07 06 -06 061 061 
105 05 .05 05 051 .050 
04 04 04 04 040 .040 
03 03 03 03 029 .030 
02, O01 02 02 019 .020 
01 .00 01 01 .008 009 


all probability of .05 to a mean that falls 1.645 standard deviations below .50 
in the sampling distribution of means of probabilities. For » = 5, the standard 
deviation of the distribution of means is .2887/\/5 = .1291. The mean of 
five probability values that would be assigned a normal curve probability of 
.05 therefore is (1.645) (.1291) units below the mean of .50; i.e., the mean 
value would be about .2876. Thus, the sum of five probability values for 
which the overall normal curve probability is .05 would be (.2876) (5) = 
1.438. Substitution of 1.438 into Formula 1 gives the true overall probability 
associated with a mean of .2876 for five probability values; namely, .051. 
For n = 5 and n = 9, the overall probability values from Formula 1 
shown in the table are given to three decimal places to show the closeness of 
the normal curve probabilities to the true probabilities. All of the tabled true 
probability values for x = 4, carried out to the second decimal place, cor- 
respond exactly to the normal curve probability values, and so the normal 
curve method provides a very close approximation to the overall probability 
given by Formula 1 when the number of probability values is four or larger. 
Although the discrepancy between the normal curve probability values and 
the values from Formula 1 is not great for cases where as few as two oF 
three probability values are combined, there is no practical value in using 4 
normal curve approximation in such cases, If the sum of three probability 
values is greater than 1, necessitating computation of more than the first 
term of Formula 1, the overall probability value that is obtained will be 
greater than 1/6 and therefore will be of little interest to many investigators. 
Results providing overall probabilities small enough to be of interest to the 
typical experimenter therefore will require computation of only the first term 


A 
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of Formula 1, which is §*/n!, when the number of probability values being 
combined is two or three, and such computation is easy. 


D. SumMary 


Although the additive method of combining probability values from in- 
dependent experiments has been shown to be more powerful than the con- 
ventional method which determines the significance of results from chi-square 
tables, it is computationally awkward when there are a larger number of 
probability values to be combined. The normal curve method given in this 
paper provides a close approximation to the true overall probability for the 
additive method when there are four or more probability values to be com- 
bined, and it is computationally simple. 
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LANDMARKS IN THE HISTORY OF CLINICAL 
PSYCHOLOGY FROM ITS EARLY 
BEGINNINGS THROUGH 
1971* 


Department of Psychology, Indiana State University 


M. Mixz Nawas 


It has often been said that the lessons of history of a discipline are an 
unmatched source of learning. This maxim has guided my efforts since I 
started this project several years ago. The present brief version, which I 
decided to share with the reader, is a distillation of the essentials about clinical 
psychology as I saw it emerge through the years. 

In covering the span of two centuries, errors are almost inevitable, Omis- 
sions are due in part to selectivity. This is dictated by the necessity of keeping 
the size of the paper within reasonable limits. Of course, my personal con- 
victions and orientation have influenced the selection process, and I make no 
pretense at being impartial in a field that I am strongly committed to and 
which happens to be quite controversial. But I did make a special effort to 
state my views very clearly in order not to blur the distinction between facts 
and opinions. Further, I attempted to cover all major facets of clinical psy- 
chology as a science, as a profession, and as a discipline with many inter- and 
intragroup frictions. 

Dates have been checked in most cases in at least two different sources, but 
this is no guarantee of complete accuracy. I shall be grateful to those who care 
to correct me where I erred and to call my attention to what they consider 
to be significant events of which I may be totally unaware which I failed to 
include in this brief version. 

1752—The first institution for the mentally ill is established at Philadel- 
phia (1). The Pennsylvania Hospital was equipped for what we now call 
“occupational therapy” activities. P 

1776—Mesmer publishes his work expounding mesmerism, or animal mag- 
netism, called “hypnotism” by Braid in 1843 (5). uN H 

1777—Cullen publishes in London First Lines of the Practice of Physik 
theorizing that mental illness is organically determined. Paranoia was de- 
scribed in the book too (12). 


. Received in the Editorial Office on May 30, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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1801—P. Pinel publishes a book marking the beginning of the humanitarian” 
treatment of the insane (15). In 1793 Pinel had already removed the chains | 
from his mental patients (14). 

1812—The first American textbook of psychiatry is written by Benjamin | 
Rush (15). This was a standard text for seven decades (1). 

1831—Upham introduces his Elements of Mental Philosophy. This was 
the classic textbook of psychology used in America before the publication in 
1890 of William James’ Principles of Psychology. In addition, it was the first 
psychology textbook to present a section on abnormal psychology (14). 

1832—Twenty-nine different associations of phrenology made their ap- 
pearance in Britain (15). 

1840—Dorothea Lynde Dix begins her reform movement for the better- 
ment of treating the mentally ill (7). 

1841—Jean Pierre Falret begins in Paris the “aftercare” movement for 
aiding patients who have been released from mental hospitals (15). See 
related entry in 1947, 

1844—First antecedent of American Psychiatric Association (Association 
of Medical Superintendents of American Institutions) is founded with Samuel 
B. Woodward as its first president. In 1893 the name was changed to Ameri- 
can Medico-Psychological Association. This name lasted till 1921, then be- 
came American Psychiatric Association. In 1844 the Association founded 
The American Journal of Insanity, later renamed The American Journal of 
Psychiatry (1). 

1853—The American Psychological Journal is founded. This journal 
lasted only one year. It was devoted chiefly to mental pathology and the 
medical jurisprudence of insanity (15). 

1866—Liebault introduces his theory of “suggestion therapy” (7) which | 
was later extended to “autosuggestion” therapy by Berheim and Coué. 

1875—Dugdale develops the thesis that mental deficiency is the result of 
heredity (15), Related works are Galton’s Hereditary Genius (1869) where 
he advocated the heritability of intelligence, and Goddard’s famous Kallikak | 
family study in 1912 (7, 12). : 
‘Michon introduces “modern” graphology (15). 

1879—Psychology is born as a science when Wundt opens the first psy- 
chological laboratory (3). 

1880—Breuer cures Anna O. with “talking out” therapy (19). 

1882—Child Psychology is expounded by Wilhelm Preyer in his book 
The Mind of The Child (15). ‘ 
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\ 1883—Publication of Kraepelin’s book, Psychiatry, outlining his classifica- 
tions of mental illnesses (1). 

Galton publishes Inquiries into Human Faculty and its Develop- 
ment. Much emphasis is laid on individual differences and assessment of man’s 

“capacities. Among many other distinctions Galton is considered the first psy- 
chometrician (5). 

1886—James McKeen Cattell and Joseph Jastrow receive the first two 
doctorates in psychology (14, 15). 

4 1888—First professorship of psychology in the world is established at the 
University of Pennsylvania. James McNeen Cattell, who was a “lecturer in 
psychology” at both Pennsylvania and Bryn Mawr, became the first professor 
of psychology (14). 

1889—The first International Congress of Psychology is held in Paris 
* (19). 
1890—James McKeen Cattell introduces the term “‘mental tests” in his 
book Mental Tests and Measurements (5). 
q 1892—The American Psychological Association is founded. G, S. Hall was 
its first president. The APA was incorporated in 1925 (6). 
1893—August Hoch is selected and given the title of psychologist by 

Cowles at McLean Hospital in Waverly, Massachusetts (19). 

At the suggestion of J. Mark Baldwin the American Psychological 

Association establishes “The Committee on Physical and Mental Tests” with 

J. McKeen Cattell as its head, for the purpose of standardizing the data from 

ba mental and physical tests (19). 

1895—Breuer and Freud publish Studies on Hysteria marking the be- 
ginning of psychoanalysis (15). 

1896—Witmer (1867-1956) establishes the first psychological clinic, ini- 
tially for serving children, at the University of Pennsylvania (5). The Uni- 
versity of Pennsylvania is considered as the birthplace of clinical psychology. 

1897—Definite procedures are adopted by the American Psychological As- 
sociation for nomination to membership (6). Prtiegiiy: 

, _ 1898—R. T. Wylie introduces psychological testing in a state institution 

+ for the feebleminded in Minnesota (16). 

———DMescaline is discovered. The year 1903 marks the discovery of the 
sedative effects of barbiturates, and 1943 the discovery of LSD-25 (1, 12, 
15, 19). See years 1928, 1934, and 1935 for related entries. i ; 

1899—The negative findings of Sharp (buttressed by those of Wissler in 
1901) impede for many years further investigation of intelligence at the 
college level (19). 
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1904—Adolf Meyer develops the psychiatric interview (12). Meyer mi- 
grated to the United States in 1892 (1). 
Witmer succeeds in persuading the University of Pennsylvania to 
begin offering courses in clinical psychology (12). 
Pfaundler, a German pediatrician, invents an apparatus resembling 
and antedating that of Mowrer for treating eneuresis via conditioning to a 
buzzer (12). 

1905—Binet and Simon develop their intelligence test in France (16). 

1906—A training school for the feebleminded is established at Vineland, 
New Jersey, with Henry H. Goddard as its director (16). 
In his presidential address before the APA, Angell introduces the 
functionalist position enunciated earlier by Angell’s senior colleague, John 
Dewey (14). 
Morton Prince founds the Journal of Abnormal Psychology (12). 
In 1921, the journal was renamed Journal of Abnormal and Social Psy- 
chology, and in 1965 the APA decided to reinstate the earlier name. 
‘The term “psychoanalysis” appears for the first time in an article 
by Putnam in the Journal of Abnormal Psychology (15). 

1907—Witmer founds the journal Psychological Clinic (16). 
‘Witmer presents an extended definition of clinical psychology 
stressing that it is a science and an art (12). This is reminiscent of the so- 
called scientist-professional model. Witmer also felt that each patient should 
be considered as a research study, the task of the clinician being the assessment 
of the effects of recommendations and procedures. This view characterizes the 
approach of behavior therapists. 

—S. I. Franz began the practice of using a routine psychological 
examination for all new admissions to a mental hospital (19). 
The APA establishes a committee to look into the methods and 
procedures of psychological testing. This marks the serious appreciation of the 
profound implications of testing and psychological explorations. The com- 
mittee continues its work until 1919 (6). 

1908—Manhattan State Hospital begins using psychoanalysis on a regular 
basis. This is the first such step taken in the United States (15). i 
G. 8. Hall begins giving courses in Freudian Psychology (19)- 
H. H. Goddard offers the first internship for psychologists at Vine 
land Training School (19). 
William McDougall publishes Introduction to Social Psycholog) 
(5), which gives the concept of instincts such central role as to lead to neat 
ban against the use of the concept for several decades, 
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Clifford Beers founds the first mental hygiene society in Connecticut 
(12), a movement that marks the beginning of what is now referred to as 
community psychiatry, psychology, and mental health. This is the same year 
he published 4 Mind That Found Itself (16). 

1909—Charles Parsons publishes Choosing A Vocation—the first book on 
vocational guidance (15). é 
‘William Healy opens the Juvenile Psychopathic Institute in Chi- 
cago, now called Institute of Juvenile Research (5), and accepts graduate 
student trainees (16). 
Freud and Jung visit the United States at Hall’s invitation (5). 

1910—The term “Dynamic Psychology” is introduced by R. Woodworth 
(15). 

1911—Alfred Adler breaks away from Freud and founds the Society for 
Individual Psychology (7). 
Bleuler introduces the term “schizophrenia” as a substitute for 
Morel’s term of ‘dementia praecox” (1). 
The APA establishes the “Committee on the Relations of Psy- 
chology and Medical Education.” A questionnaire sent to various medical 
schools disclosed that many physicians feel that psychology is important to 
medical education and recommended that a course in general and clinical 
psychology be offered (6). 
Ernest Jones founds the American Psychoanalytic Association ( 15). 
Wallin defines the roles of the clinical psychologist as the diagnosis 
and prognosis of emotional disturbance (18). The current roles of therapy, 
consultation, and research are not included in Wallin’s definition. 

1912—Fstablishment of the American Association for the Study of Feeble- 
mindedness together with its organ, The Journal of Psychoaesthenics (15). 
Wertheimer discovers the “phi phenomenon,” marking the birth of 
Gestalt: psychology (5). f 

1913—Jung breaks with Freud and founds Analytical Psychology (7). 
R. J. Herkes initiates the first internship in a psychiatric institution 
, for adults—the Boston Psychopathic Hospital (19). | 

John B. Watson publishes Psychology as the Behaviorist Views It 

(5) 


1915—On the motion of Whipple the APA officially discourages the use 
of mental tests by people who are “not qualified” to use them ( 19). The same 
year the APA sets up a committee to study the necessary qualifications of 
mental examiners (17). } 
1916—Terman introduces the Stanford Revision of the Binet-Simon Test 
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(14). The test was revised again in 1937 and later in 1960. Terman’s effort 
in 1916 came after those of Healy and Goddard who, in 1911, publishet 
separate versions of the Binet Scale (12). 
W. D. Scott becomes the first professor of Applied Psychology it 
the United States. The first department of “Applied Psychology” was estab: 
lished a year earlier at the Carnegie Institute of Technology, and was the 
dissolved in 1924 (12). 
Jung publishes his work on psychological types in English. 

schmer publishes his body-type classification in 1921 (15). It should be noted 
that Galton spoke also of the “visile, audile, tactile, etc.” types of people 
many years earlier. The earliest typology is attributed to Galen. 
1917—Mental Hygiene, a quarterly, is established (16). The term “mental 
hygiene” was coined in 1843 by Sweetser, a physician from the University of 
Vermont (1). ; 
A committee headed by Yerkes develops the Army Alpha and Arm 
Beta tests—the first group intelligence tests (12). { 
August Hoch introduces the “open-door policy” for treating th 
mentally ill (15) as outpatients or inpatients. i 
Woodworth develops the Personal Data Sheet to be used to detect 
psychoneurotic recruits (16). 
Legislation is enacted in Illinois allowing a psychologist to serve a 
a member of a two-man commission to determine if a person should be com= 
mitted to an institution (12). 


psychological examiner or expert (12). It is not clear if this is the same 
committee set up in 1915 or a different one. , 


clinicians (Maxwell, Miner, Wallin, Mitchell, Pinter, Hollingsworth, 
Schmitt) tentatively establish the A. A. C. P.—American Association of 
Clinical Psychologists (6). “i 
1918—The 27th APA convention at Baltimore devotes most of its pro | 
ceedings to a discussion of the pros and cons of the development of the 
A. A.C. P. (6). 
Woodworth publishes his book, Dynamic Psychology (12). 
The committee set up in 1915 for considering the qualifications 
examiners reports in favor of the certification of mental examiners (17). 
1919—Moreno founds the “Spontaneity Theater” (otherwise called “Tm: 


z 
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promptu Theater”) evolving into a technique subsequently called “psycho- 
drama” (15). Reisman (12) dates this development at 1922. 
‘At the APA convention at Cambridge, Massachusetts, the 
A. A. C. P. is dissolved and is incorporated as the “Clinical Section” of 
the APA (6), thereby marking the founding of the first of the now 32 
divisions of the APA. 

1920—A committee on the certification of psychologists is established by 
the APA (17). Watson (19) dates this event at 1919. 

1921—Rorschach introduces his inkblot test of personality. The same year 
David Levy introduces the test to the United States (12). 
The APA establishes a procedure for the certification of psy- 
chologists, and the first certificates are granted (16). 
‘An APA committee recommends the dissolution of the “Section of 
Clinical Psychology.” Instead, it recommends the formation of the “Section 
of Consulting Psychology” and reaffirmation that the APA’s objective is to 
advance psychology as a science. A Ph.D. in psychology plus publications of 
“acceptable” nature became prerequisites for a “member” status in the APA, 
Dissatisfied clinical psychologists reacted by founding the New York State 
Association for Consulting Psychology (5, 6, 12). The polarization of psy- 
chology as a science and as a profession is taking definite form. 
Founding of the Psychological Corporation by James McKeen 
Cattell (12). 

1923—The Otis Classification Test and Assembly Test of General Me- 


, chanical Ability are developed (12). 


1924—The “Section of Consulting Psychology” was named Division of 
Consulting Psychology (12). The “Clinical Section” continued to exist, 
however, despite the 1921 recommendation for its dissolution. It is worth 
noting that the Division of Consulting is attempting, in 1971, to define what 
“consulting” actually means and what its scope is. 
Analysis of personality through finger painting is initiated by R. F. 
Shaw (15). 
A. Adler introduces his book, Practice and Theory of Individual 
Psychology (15). Earlier forms of the theory appeared in 1907 and 1912, 

——The American Orthopsychiatric Association holds its first conven- 
tion (19) and selects William Healy as its first president (1). The first 
Psychologist to be selected president of the Association is Augusta F. Bronner 
in 1931 (12). 

1925—L. Crane makes a plea for a “doctor of psychology” degree pro- 
gram for clinical psychologists (12). See relevant entry in 1968. 
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1926—New York legislature passes a bill declaring that psychoanalysis 
carried out by nonmedical individuals is illegal. The American Medical As. 
sociation warns its members against cooperating with lay analysts (12), A 
year later, Freud publishes The Problem of Lay Analysis, wherein he declared 
that “Psychoanalysis is a part of psychology .. . [and] not a particular branch 
of medicine,” See 1969 for a relevant entry. 

The Associate, a new class of membership in the APA, is begun (6). 
Development of the Miller Analogies Test and Goodenough’s 
Draw-a-Man Test (16). 

1927—F. L. Wells publishes Mental T'ests in Clinical Practice. In his 
book he objects to the rigid psychometric approach (19). | 
Morton Prince founds the Harvard Psychological Clinic for the 
purpose of bringing academic and clinical psychology together (12). 

The APA discontinues its certification practice on grounds of in-— 

sufficient interest of applicants for certification (17). 
1928—Rudolf Dreikurs introduces family counseling to the United 

States (12). 

Bain publishes Thought Control in Everyday Life, a book that 

advocates the “thought stopping” technique currently in use by behavior 

therapists. 

1930—Carl Murchison edits the first of a series of volumes titled 4 His- 

tory of Psychology in Autobiography (5). 
International Congress of Mental Hygiene and the White House 
Conference on Child Health and Protection are held (1). Preventive men- 
tal health, latent since 1917, is beginning to take form as a significant move- 
ment. 


Establishment of an official classification of 21 psychiatric illnesses 
by the Germal Psychiatric Association. The classification was adopted by 
the Austrians in 1940 (15). 

1931—The Allport-Vernon scale for measuring values is presented in 4 
Study of Values (15). 

‘The first residential treatment center for children, Emma Pendleton 
Bradley Home, is established (12). 

‘The Clinical Section of the APA establishes the Committee om 
Standards of Training for Clinical Psychologists (16). 

1932—S. Beck conducts the first study in the United States using the 
Rorschach (19). This was his Doctoral dissertation titled ‘The Rorschach 
Test as applied to a feeble-minded group.” 

Dunlap publishes Habits: Their Making and Breaking, wherein he 
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advocates the method of “negative practice,” a currently used behavior therapy 

technique. 

Tolman publishes Purposive Behavior in Animals and Man. 

The Executive Committee of the Association of Consulting Psy- 

chologists establishes a subcommittee on psychotherapy (12). 

Horney migrates to the United States; Fromm follows in 1933, 

and Rank in 1934 (12). 

‘Moreno introduces the term “group therapy” (12). 
1933—Adoption and publication by the American Psychiatric Association 

of a classification of mental illness comprising 22 major and 36 minor 

groups (15). The classification was revised in 1934, again in 1942; and the 

final revision was made in 1951 and published in 1952. 
1934—Slavson introduces the term “activity group therapy” (12). 

Metrazol is introduced for treating schizophrenics. In. 1938 insulin 

shock was introduced in the United States, and electroconvulsive shock treat- 

ment (ECT) in 1939 (1, 12, 15). The discovery of ECT dates back to 

1756 (1). 
1935—The APA Committee on Standards of Training for Clinical Psy- 

chologists publishes its report in which clinical psychology is defined as “that 

art and technology which deals with the adjustment problems of human 

beings” (3). 

Lobotomy is introduced by Fulton and Jacobson in England and in 

the United States by Freedman and Watts in 1937 (12, 15, 19). Moniz, a 

Portuguese psychiatrist, performs the first successful lobotomy (1). 

The Thematic Apperception Test (TAT) is developed by Chris- 

tiana D. Morgan and H. Murray. 

1936—A tentative curriculum, the first of its kind, for the clinical psy- 
chology specialty is developed at the Psychology Department of Columbia 
University (16). 

1937—The American Association of Applied Psychology (A. A. A. P.) is 
founded for the purpose of guaranteeing professional COnIpESEAge: and con- 
trolling professional standards. It absorbed the sections of Clinical and the 
Division of Consulting, and became an affiliate of the APA (12, 16). 

The California Test of Mental Maturity is introduced (12). 
———S. Beck publishes his widely used manual for the Rorschach, 
1938—The American Psychoanalytic Association becomes autonomous from 

the International Psychoanalytic Association (1). 

Lauretta Bender develops the Bender-Gestalt Test, and ‘Thurstone 

his Primary Mental Abilities Test (12). 
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Publication by Kallmann of his first work on the heritability of 
schizophrenia (7). 

B. F. Skinner publishes the Behavior of Organisms. 

Publication of the first edition of Mental Measurements Yearbook 
under the editorship of O. K. Buros (15). 

Masserman and Carmichael publish in Journal of Mental Science 
a report calling attention to the deleterious effects of psychiatric hospitaliza- 
tion. Many similar reports have been written since, some documenting a 
highly significant association between deterioration, as well as death, and 
psychiatric hospitalization. Some suggest causative relations. 

‘The APA sets up a Committee on Scientific and Professional 
Ethics (3). 

1939—The term “projective methods” is introduced by L. K. Frank (12). 
‘The Wechsler-Bellevue WB-I is developed (12). Because of its 
limitations the WB-I was supplanted by the Wechsler Adult Intelligence 
Scale (WAIS) developed in 1955, which is being revised in 1971. 

1940—The Minnesota Multiphasic Personality Inventory (MMPI) is 
introduced by the psychologist Hathaway and psychiatrist McKinley. 

‘The ‘‘Psychodrama Institute” is formed in the United States by 
Moreno (15). 

Harry Stack Sullivan founds the “interpersonal theory of psy- 
chiatry” which accords significant attention to environmental forces in shaping 
abnormal behavior. The overemphasis on organic-chemical determinants of, 
for example, schizophrenia is shaken up. 

_ Carl Rogers describes nondirective or client-centered therapy (12) 
in the 1940 volume of the Journal of Consulting Psychology. 

1942—The American Group Psychotherapy Association is founded (12). 
‘The APA and the American Association of Applied Psychology 
(A. A. A. P.) begin making tentative plans for a constitution that would 
unite the two groups (12). 

1944—A joint committee, consisting of members of the APA and the 
A. A. A. P., meets at Vineland to consider the topic of “Graduate Intern- 
ship Training in Psychology” (16). 

‘The A. A. A. P. joins the APA under a new constitution which 
allows for the establishment of 18 divisions (16). Of these, one division was 
called Division of Clinical and Abnormal Psychology which, in 1954, was 
renamed Division of Clinical Psychology. Very significantly the new organiza- 
tion defined its functions as dedication “to advance psychology as a science, 
as a profession, and as a means of Promoting human welfare.” The 
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A. A. A. P.’s organ and legacy, The Journal of Consulting Psychology, con- 
tinues to be published but under the management of the APA. The journal’s 
name was changed to Journal of Consulting and Clinical Psychology in 1968. 
Over 200 clinical psychologists are commissioned by the Army to 
serve in military hospitals and rehabilitation centers (16). 

1945—A report by the Subcommittee on Graduate Internships recommends 
training programs for clinical psychologists in which the internship would be 
an integral and required part (17). 

A Committee on Graduate and Professional Training is established 
by the APA (17). 

Connecticut passes laws governing the title and functions of psy- 
chologists (3). 

The Veterans Administration appoints George A. Kelly as a con- 
sultant to initiate a new clinical psychology program (3). 

1946—An internship program in clinical psychology is instituted by the 
VA and receives approval by the APA (16). James Miller becomes the first 
full-time head of the VA clinical psychology program (3). 

The VA meets with representatives of 22 universities and asks them 
to undertake training of clinical psychologists (3). 
The National Mental Health Act is passed (1). This gives a boost 
to preventive and community mental health efforts. 
F. Alexander, who migrated to the United States in 1930 (12), 
and Thomas French advocate the use of “brief psychotherapy” (1). 
Frankl introduces logotherapy, including the “paradoxical inten- 
tion” technique, as well as the concept of noogenic neurosis—an existential 
perspective on emotional disturbances and treatment (1). 
Organ of the APA, the American Psychologist, begins publication. 
Clinical psychology celebrates its 50th anniversary at the APA’s 
first postwar convention (16) which was held in Philadelphia. See year 1896. 
1947—The American Board of Examiners in Professional Psychology 
(A. B. E. P. P.) is established with the blessings of, but as a separate corpora- 
tion from, the APA (17). In 1968 the Board makes dramatic changes, among 
which one that essentially tells the candidate: “You tell us the specialty that 
you practice and we will examine whether or not you do it with high com- 
petence.” This is a significant departure from the “know-it-all” examination. 
The National Institute of Mental Health begins its program 
of sponsoring training grants in clinical psychology (16). Through 1970, 
NIMH had contributed close to 95 million dollars in support of doctoral 
training programs in clinical psychology. 
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‘The Committee on Graduate and Professional Training in Clinical 
Psychology (C. T. C. P.) led by Shakow presents its report which firmly 
sets the scientist-professional pattern for the training of clinical psychologists 
(16). See relevant entries in 1907 and 1925. 
A listing of accredited schools is published by the C. T. C. P. This 
was the first step in the direction of accreditation by the APA (3). 
The APA accepts the recommended standards for doctoral educa- 
tion in clinical psychology (3). 
D. Cameron introduces the “day hospital” notion (1) which is 
related to both the “aftercare” movement (see entry in 1841) and the current 
“half-way house” practice. 
The Educational Testing Service is established at Princeton, New 
Jersey (12). 

1948—Inspections, via site visits, are made for the first time by the 
C. T. C. P. of accredited schools. The C. T. C. P. recommended also that 
site inspections be made every three to four years (3). 

1949—The grandfather clause, allowing a member to be certified as a 
clinical psychologist on the basis of experience, expires (17). 
The first written exams of the A. B. E. P. P. are administered 
(16), and the first orals in 1950. 
‘The Boulder Conference is held. The scientist-professional model 
of doctoral training is adopted as the guiding philosophy for clinical train- 
ing. The report on the conference was published in 1950 under the editorship 
of V. Ramey. 
‘The Wechsler Intelligence Scale for Children (WISC) is pub- 
lished (12). 

——Salter publishes Conditioned Reflex Therapy, a pioneering work in 
behavior therapy, largely anchored in Pavlovian framework. 

1950—First International Congress of Psychiatry is held in Paris (15). 
It is interesting to note while the American Psychiatric Association was 
founded 48 years before the APA, Psychology held its first international 
congress in 1889, 
Dollard and Miller’s Personality and Psychotherapy is published. 
The book attempts a rapprochement between psychoanalysis and S-R theories. 
Erikson publishes Childhood and Society, a book that extends the 
psychoanalytic developmental stages beyond puberty and explicates the concept 
of ego identity. 


‘The Interamerican Society of Psycholo d the International 
Union of Scientific Psychology are founded (15). gy and the Interna 
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The C. T. C. P. publishes its recommendations for standards in 
practicum training in clinical psychology (16). 
‘The Statement of Principles and Practices in Evaluation, which 
deals with accreditation, is adopted by the APA (3). 
The Ph.D. is made a requirement for employment as a clinical 
psychologist by the VA (16). 

1951—The Committee on Professional Training Below the Doctoral 
Level presents a report of a survey questionnaire which indicates that the 
quality of training was considered more important than the amount. The 
Committee then requested McTeer to survey graduate psychology depart- 
ments to determine their views toward training at the nondoctoral level 
(20). McTeer publishes the results in 1952. Predoctoral training continued 
to be a neglected stepchild, despite many surveys and discussions, until the 
historic actions of 1971 (which see), 
The APA sets up the Committee on Practicum Training. The 
duties of this committee include developing standards and conducting evalua- 
tions of internship agencies (3). Until this date the concern was limited to 
programs of educational institutions. 
‘A. tentative code of ethics is developed for clinical and consulting 
psychologists (12). 

1952—The APA Committee on Test Standards attempts to clarify some 
of the issues that have evolved in validating diagnostic techniques (12). 
A resolution is passed by the American Psychiatric , Association 
favoring the certification of clinical psychologists (12). 
H. Eysenck publishes a controversial report on the effectiveness of 
psychotherapy (12). There have been many replies and rejoinders since, and 
the controversy has abated but is not dead yet, 
Five regional conferences are sponsored by the E. T. B. which deal 
with the problems of predoctoral, particularly M. A.-level, training in psy- 
chology (20). See events of 1953. 

——Three alternative plans for the internship are proposed by Foster, 
Benton, and Robin (3). 

———The ethical code is officially adopted by the APA (16). 

1953—Information is surveyed by the Committee on Subdoctoral Educa- 
tion concerning training and education at the subdoctoral level (20). 

———F ve regional conferences concerning education and training at the 
subdoctoral level are sponsored by the E. T. B. These are different from 
the ones held in 1952 (20). 

———The Taylor manifest anxiety scale is developed (15). 


——— 
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Skinner and Lindsley introduce the term “behavior therapy.” See 
relevant entry in 1966. 

Maxwell Jones expounds his theory of “milieu psychotherapy” (15). 
Project B, which deals with the occupational, educational, and 
institutional implications of psychology, is begun by the APA’s Advisory 
Committee on the Development and Status of Research and Education in 
Psychology (20). 

A preliminary report is given on Project A by the Advisory Com- 
mittee on the Development and Status of Research and Education in Psy- 
chology. Project A is concerned with the need for workers at the subdoctoral 
level (20). 

The subcommittee on clinical psychology, which is a part of the 
E.T. B., is replaced by the Committee on Evaluation (3). 

1954—The Committee on Subdoctoral Education meets and begins to con- 
sider the increasingly pressing issues in subdoctoral education (20). 

A preliminary report on Project B, published by K. B. Clark in 
the American Psychologist, is given by the Advisory Committee on the Status 
of Research and Education in Psychology. In addition, research plans ate 
made concerning the issue of subdoctoral graduates. The final report was 
published in 1957 (20). 

; The American Board of Psychological Services (A. B. P. S.) is 
established. In a sense this was a policing, information resource and certifying 
agency working closely with the APA (3), but rather at cross-purposes If 
some respects with the A. B. E. P. P. The A. B. P. S. was dissolved in 1961. 
i ; ‘A summary of the various conferences sponsored by the E. T. B. 
is published by B. V. Moore in the American Psychologist. These conferences 
dealt with the issues of training at the subdoctoral level (20). 

Clinical vs. Statistical Prediction is published by Paul Meehl. 
Robert R. Holt’s reply was published three years later (9). 

1955—A joint report is published concerning policy on legislation. This 
report was developed by a special committee of the APA and the Committee 
on Legislation of the Conference of State Psychological Associations (3). 

; ‘The Stanford Conference is held. Briefly, this conference dealt with 
various aspects of training of a clinical psychologist (3). No change in models 
is proposed, but greater emphasis was laid on research, as well as on the nee 

to attend to sociocultural factors in behavior, than aS the case in the Boulder 
Conference. 


__—The practice of giving Gold Medal Awards for outstanding work 


ps 
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in psychology is begun by the APA. The first recipient was Robert Wood- 
worth (12). 

1956—In the annual report of the E. T. B. a dim view is taken of the 
report submitted by the Committee on Subdoctoral Education in 1955 (20), 
‘The E. T. B. is reorganized, and the Committee on Subdoctoral 
Education is disbanded and replaced by the Task Force Committee on Sub- 
doctoral Education (20). 

‘The first APA Distinguished Scientific Contribution Award to be 
bestowed upon a clinician is awarded to Carl Rogers (12). 
The first publication is made of approved internship agencies (3). 
1957—The American Psychiatric Association rescinds its 1952 resolution 
supporting the certification of clinical psychologists (12). 
‘The Task Force Committee on Subdoctoral Education led by Noble 
Kelly presents a report about a questionnaire that was sent to various psy- 
chology departments, state associations, and divisions of the APA, In general, 
the report revealed that many universities did not wish the APA to tell them 
how to manage their programs (20). Apparently this reaction led to inaction 
and demise of the Task Force. A new one was set up in 1970 (which see). 

1958—A contract is drawn between the APA and VA in which the APA 

agrees to conduct evaluation visits to various VA clinics and hospitals for the 
purpose of evaluating internship programs (3). 
The Miami Conference is held. This conference dealt with the 
evaluation and solutions of problems that many graduate schools are having 
in relation to their graduate education programs in psychology (3). The issue 
of predoctoral training was tackled more seriously than in previous con- 
ferences, 

——Publication of Wolpe’s Psychotherapy by Reciprocal Inhibition. 
This is a landmark in the development of behavior therapy—a group of 
methods independently proposed in the early fifties in South Africa by Wolpe, 
by Shapiro in England, and by the Skinnerians in the United States. 

———A conference dealing with research in psychotherapy is sponsored 
by the APA and the Division of Clinical. The proceedings were printed in 
1962. A second conference was held in 1961, and its proceedings published in 
1962 as Volume II. Volume III was published (also by the APA) in 1968. 

1959—A moratorium is called by the APA on the further evaluation of 
Practicum internship training agencies. In addition, the ad hoc Committee on 
Problems of Internship Evaluation is established. This action was prompted 
by such considerations as lack of funds for site visits and problems issuing 
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from accrediting programs of universities on the one hand and of practicum 
agencies on the other (3). 
‘A report by Kelly and Goldberg indicates that 40% of the clinical 
psychologists who were interviewed expressed the view that they wished that 
they had chosen another career (11). 
Founding of the Psychonomic Society (15)—a concrete expression 
of concern, if not protest, over the status of psychology as a science. 
In response to the Miami Conference, the E. T. B. establishes a 
planning committee for a Conference on the Roles and Training of Workers 
in Psychology at the predoctoral level (20). 
The APA Board of Professional Affairs holds a meeting of state 
statutory and nonstatutory examining boards to consider some of the issues 
related to reciprocal endorsement (3). 
1960—The Planning Committee for a Conference on the Roles and Train- 
ing of Workers in Psychology at the predoctoral level submits a report which 
was apparently never taken seriously (20). 
The contract between the APA and VA which had been drawn 
in 1958 concerning evaluation visits is cancelled for the same reasons that 
prompted the moratorium on evaluation of training agencies (3). See 1959. 
1961—The moratorium that had been called in 1959 on evaluation of 
internship programs is now lifted. The APA decides to assess a fee of $200.00 
per day to be paid to members of the Committee on Evaluation who conduct 
site visits (3). 
—The A. B. P. S., founded in 1954, is dissolved. However, the 
American Association of State Psychology Boards is established in order to 
protect the public and the psychologist by maintaining adequate standards (3). 
f Concern over laws passed by New York and California permitting 
disclosure of psychological information to nonclients (parents and state 
authorities) prompts the APA to establish an ad hoc Committee on Con 
fidentiality of Records to consider these matters (2). 
! 1962—The Princeton Conference on Manpower and Psychology Training 
18 held (3). Unlike previous conferences this one dealt at length wi 
prackey training and relations between these agencies and programs y 
universities. 
‘The Committee on the Practice of Clinical Psychology is established 
by the APA after the Committee on State Doeasiiatici pasatuhes was 1 
placed by the American Association of State Psychology Boards (3): ‘The 
purpose of the APA’s Committee is to collect information, make recommen® 
tions, and, hopefully, coordinate work of Boards. : 
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‘A. Ellis publishes Reason and Emotion in Psychotherapy, the first 
complete presentation of his rational-emotive therapy. 

1963—The Gould House Conference on an Ideal Program for Training of 
Psychotherapists is held, The participants were 32 clinical psychologists, psy- 
chiatrists, psychoanalysts, and social workers. For a discussion of the pro- 
ceedings and an extended commentary and updating, see Holt (10). 

Rioch and her associates publish their report on the comparative 
effectiveness as counselors of housewives and professional psychotherapists— 
to the embarrassment to the latter (13). 

In an address to Congress, President Kennedy accords the area of 
mental health focused attention, The same year the federal government passes 
the Community Mental Health Center Act which provides funds for the 
construction of mental health centers, 

‘The code of ethics of the APA is revised and extended, 

‘The ad hoc Committee on Confidentiality of Records recommends 
that the profession itself should be its own guard in order to prevent abuses 
and external interventions. It also recommended that a committee be formed 
to deal with matters of testing; e.g., definitions of roles and responsibilities 
of testers (3). This committee, still in existence, is now named the Committee 
on Social Impact of Psychological Assessment (2). 

1964—At a joint meeting the Connecticut Psychological Association and 
the Connecticut District Branch of the American Psychiatric Association dis- 
Cuss matters of mutual concern and attempt to wark toward more effective 
communication between the two professions (8). 

———Bradford, Gibb, and Benne edit a yolume titled T-Group Theory 
and Laboratory Method, which gives a broad perspective on sensitivity train 
ing and other confrontation techniques very much in vogue in the late sixties 
and which continue to be quite popular. i j 

1965—The Community Mental Health Staffing Bill is signed into jaw, 

—— Entire issue (No. 5) of volume 22 of the American Psychologist is 
devoted to the congressional hearings on psychological testing and its implica- 
tions, including invasion of privacy. \. AL 

——— The Chicago Conference on the Professional Preparation of Clinical 
Psychologists is held (3). Its proceedings were published by the APA under 
the editorship of Hoch, Ross, and Winder in 1968. The scientist-professional 
model is reaffirmed. Lip service is paid to predoctoral training recommending 
that the matter be referred to a special group for thorough discussion. ; 

———A report by the Committee on the Scientific and Professional Aims 
of Psychology is printed in the American Psychologist. The Committee re- 
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commends that a degree other than a Ph.D. be awarded to graduates of 
programs training practising psychologists. A “doctorate of psychology” is 
proposed. Much emphasis is placed in this report on the need for professional- 
ization in psychology, especially training in psychology. See possible implica- 
tions of this in entries in 1969 and 1971. 

1966—NIMH changes its regulations so that any mental health professional 
(not only a psychiatrist) can be a director of an NIMH community mental 
health center. 
Association for Advancement of the Behavioral Therapies is founded 
with Cyril M. Franks as its first president. The Association’s name was 
changed in 1969 to Association for Advancement of Behavior Therapy, and 
its organ, Behavior Therapy, started in 1970. 

1967—Stanford drops its clinical program. Its accreditation by the APA 
continues until 1969 even though Stanford had no program. Harvard dropped 
its clinical program in 1970, and its name disappeared from the list of APA 
accredited institutions (for training in clinical) in 1971. 

1968—The name of the A. B. E. P. P. is changed to the American Board of 
Professional Psychology (12). Other and more significant changes were 
introduced. See 1947. 
An ad hoc committee to consider the standards and criteria for 
accreditation of doctoral programs in clinical psychology is developed (4). 
The University of Illinois introduces the “doctorate of psychology” 
program in clinical. (Illinois’ program received APA “provisional” approval, 
for three years, in 1970.) The first professional school of its kind (California 
School of Professional Psychology) begins instruction at the Los Angeles 
campus in 1969 and at San Francisco in 1970. Despite earlier intentions to 
award a Psy. D., the school decides to award a Ph.D. Baylor is the first 
private university to follow Illinois’ lead by introducing a “doctorate of 
psychology” program in 1971. 
The entire May issue of the american Psychologist is devoted to 
the problems of manpower needs in mental health. See the 1962 entry on 
the Princeton Conference. 


1969—The recommendations of the ad hoc committee on accreditation of 
doctoral programs are adopted. The new procedures of accreditation em- 
phasize due process and fair play, and provide a mechanism for appeal of 
APA actions if institutions wish to challenge such actions. See relevant items 
in 1967, and see 1971 for guidelines which were touched upon in this report. 
‘The American Psychoanalytic Association kills an amendment 
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which, if passed, would have accorded the nonmedical analysts full membership 
status (10, p. 392). 

1970—A Task Force on Master’s Level Training, under the chairmanship 
of Charles Thomas, is named by the E. T. B. to examine all issues relevant 
to master’s level education (20). The Task Force held two regional con- 
ferences, one at Atlanta for the South in 1970, and one for western United 
States in 1971. 

‘The Information Clearinghouse on Precollege Psychology is devel- 
oped by the APA to provide assistance and guidance for the teaching of high 
school psychology. ; 

1971—M., M. Nawas presents at the XILIth Congress of the Interamerican 
Society of Psychology, held at Panama, a tentative proposal for an eight-year 
program leading to a “doctorate in mental health” allowing its products to do 
what clinical psychologists as well as psychiatrists do, including the prescrip- 
tion of drugs. 

‘The APA circulates to departments guidelines for doctoral training 
in clinical, school, and counseling psychology. (See relevant entry in 1969.) 
Although the scientist-professional model remains the guiding light for train- 
ing, the APA is more open to innovations. 

The APA Council of Representatives overrules the Board of Direc- 
tors and takes historic action on behalf of master’s degree training. The Coun- 
cil endorses the recommendations of the Task Force (see 1970) which include 
according master’s degree holders full membership status beginning 1976, 
urges funding agencies to support master’s level training, and directs the 
APA staff to take steps for consideration of certification by states. The Coun- 
cil’s action will have to be voted on by the APA membership. Albee, angry 
with this action, predicts that it will weaken the organization and will prob- 
ably not pass. Deese, another Board member, favors the action, (See front- 
Page article in January-February, 1972, issue of APA Monitor.) : 

———A. Lazarus publishes Behavior Therapy and Beyond, 4 concise pre- 
Sentation of techniques used by behavior therapists but of which some are 
clearly nonbehavioristic. This may well mark the beginning Gs RepErocuemcnt 
with psychotherapy and the ending of the “holier than thou” attitude that 


extremists on both sides assume. 
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THE COMPLETION HYPOTHESIS AND THE 
COMPENSATORY DYNAMIC IN 
INTERSEXUAL ATTRACTION 
AND LOVE* 


University of California, Los Angeles 


Ricuarp Crenrers 


A. Inrropucrion 


Freud (12), in one of his comments on the nature of love, indicated that 
at least a part of its origin derives from a completion principle, saying, in 
effect, that one characteristically falls in love with a particular person because 
that individual is seen as Tepresenting a perfection which the loving one has 
striven, unsuccessfully, to attain.) Reik (20, 21), the psychoanalyst who has 
written most extensively on the subject of love, has both echoed and amplified 
this particular Freudian view. He points out that love for another begins in 
an unconscious and powerful striving on the part of the individual to complete 
his ego-ideal. “The mate is chosen because he or she is needed for this aim, 
because [he or] she fulfills this image of our ideal” (21, p. 97). More 
concretely, he says, “The object [of love] has qualities which the lover lacks: 
it achieves where he fails; it is successful, strong and self-confident where he 
feels frustrated, weak and doubtful, full of contradictions and dissatisfied” 
(21, p. 152). 

Cattell. (7) and Cattell and Nesselroade (8) have more explicitly for- 
tmulated the completion principle for mate selection as follows: 


Choice in marriage is directed .. . by a desire to possess characteristics 
(by sharing them in the possessed partner) which are felt by the in- 
dividual to be necessary to his self-concept or to his or her social and 
general life adjustment in marriage, For example, a socially awkward 
person might especially value a partner who is socially adroit and 
poised, Moreover, a person who is not managing his affairs too in- 
telligently would obviously experience gains from a wiser partner. It is 
not part of the principle that this must occur consciously, but only that 
it is sought and conduces to marriage stability if achieved (8, p. 351). 


ees a 
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The principle is further particularized by Cattell and Nesselroade in their 
statement that “every person tends to seek in a partner much the same set 
of desirables—good looks, intelligence, emotional stability, etc., but more so 
to the extent that he or she lacks them” (8, p. 356)? 

It was to the success or nonsuccess of marriages that Cattell and Nessel- 
roade’s own research was primarily addressed, but in it, in their comparisons 
of the personality responses of stably and unstably married couples, no ap- 
preciable supporting evidence for the completion principle was found. Al- 
though, it is thought, the completion hypothesis is more strictly appropriate 
to accounting for mate selection than to marital stability and satisfaction, no 
work in the former context which has expressly tested the theory has been 
found in the psychological and sociological literature in social psychology. 

Careful consideration of the implications of Cattell’s formulation as applied 
to mate selection, however, leads the writer to regard it as having quite 
dubious validity in terms of the principles he himself has seen as applicable 
to explaining marital choice. It leads to predictions of relationships between 
the personality characteristics of partners in intersexual dyads, particularly 
fiancés and spouses, that are contraindicated both by the writer’s own theory 
of intersexual attraction and by the empirical findings resulting from tests 
of it. 

In an as yet unpublished monograph* the writer has developed and tested 
an exchange theory! of intersexual attraction and love, which holds, in effect, 
that we are maximally attracted to and come to experience love feelings most 
strongly toward that person or those persons who are maximally instrumental 
to the gratification of our most important needs.° More formally stated the 
theory proposes that “in intersexual dyad formation each person seeks, among 
his circle of acquaintances within the compass of his self-acknowledged com- 
Beers to form a relationship with that person or those persons whose be- 
havioral attributes and other resources provide, or are perceived or are 
expected to provide, maximum gratification and minimum punifications® for 


. » ieetsien 3 
his needs. Where reciprocal gratification of needs obtains mutual attraction, 
affection and love are expected to result. 


2 Italics this writer’s. 
: AY attreenee and love: An instrumental theory.” 
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ecause our language contains no exact opposite to the term gratification and one 
was needed, this term was invented to serve that purpose in Fae nental theory: 
It may be used in any form in which its opposite appears: e.g., as a verb, to punify; 
as adverb, punifyingly; as adjective, punified; etc. : : 
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The empirical findings in previous tests of the basic postulate of the in- 
strumental theory have been exclusively concerned with the relations between 
the needs of intersexual dyadic partners rather than with resources. However, 
one thing they indicate quite clearly is that there are varying degrees of 
similarity in the need strengths of these partners on a large variety of needs. 
It is, in terms relating needs to resources (i,e., abilities and attributes), hardly 
conceivable that the attraction between persons with similar needs would 
obtain if it were not for the circumstance that the partners also possess the 
resources that would serye to gratify the needs in question. What is more, the 
Tesources to gratify the needs would almost certainly have to be similar rather 
than different, Hence, in keeping with this reasoning, to the above postulate 
a corollary may be tentatively added: To the extent that the needs of such 
attracted persons are similar, then those persons selected as lovers or Spouses 
in virtue of their provision of the gratifications for the needs will be found 
to be similar in actual or perceived resources as well. In order for the corollary 
to be valid it must, it is thought, also be assumed that the resources that will 
serve to gratify a given need are relatively specific to it, so that not just 
any ability or attribute in another person could serve the same function, This 
is an easy assumption to make, however, unless a very stringent specificity of 
need and relevant resource is demanded, and it is not believed that it is, At 
any rate, the proposed corollary may be employed as an hypothesis in the 
context of the present indagation. It may be specifically restated to the effect 
that persons of opposite sex who have already behaviorally indicated attrac- 
tion to and love for each other by becoming engaged to be married will have 
chosen as partners those with behavioral resources relevant to need gratifica- 
tion, but, more particularly, personally and socially valued abilities and ate 
tributes, which are similar to, rather than different from, their own, Still 
more specifically, the interfiancé correlations on indices of such abilities and 
attributes should be positive in direction and of a magnitude significantly 
greater than zero or whatever value might be expected on the basis of the 
operation of chance factors. ali i 

This prediction is directly opposite to that deriving from the Cattellian 
completion hypothesis: “every person tends to seek in @ partner much the 
same set of desirables—good looks, intelligence, emotional stability, ches, but 
more so to the extent that he or she lacks them” (8, p. 356).7 What is Clearly 
specified is that persons are attracted to those whose resources are sel 
from their own and, hence, negative interfiancé Correlations should afin 
The identical prediction, of course, follows from Reik’s formulation: e 
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object has qualities which the lover lacks: it achieves where he fails . . . 
(21, p. 152). 4 
In the light of the principles of instrumental theory, as well as a matter of 
common sense, it must be reiterated, the completion theory is extremely 
implausible and seemingly irreconciliable with what the former theory main- 
tains. More specifically, it is unacceptable in that it cannot readily be recon- 
ciled with the proposition of the instrumental theory that requires reciprocal — 
gratification of needs as a condition of attraction and love. The person — 
deficient in need gratification resources does not have much to offer in ex- 
change for the need gratification potentialities afforded by one highly endowed 
or equipped with such resources. To use one of Cattell’s examples: a socially 
awkward person,” even if strongly attracted to a “socially adroit and poised” 
one, is not very likely to have his attraction reciprocated and thus to win 
such a person as a mate because there is every likelihood that the latter would 
be punified by his behavior and repulsed by it. To be sure, if he has other 
need gratifying resources of such high exchange value in the mate and 
marriage market that the “socially adroit and poised” person can find the 
punifications incurred and the gratifications foregone in interacting with the 
awkward one offset or compensated for by them, he could possibly gain his — 
affection on the basis of such an exchange. Perhaps such bargains are struck 
occasionally i perhaps with some frequency, but the data indicative of this — 
appear wanting. 

The main purpose of the investigation to be described, is to compare the 
predictive efficacy of the discrepant formulations of completion theory and 
instrumental theory, respectively. This can be accomplished quite simply, it” 
is thought, for, whereas the completion theory predicts negative correlations” 
to obtain between the abilities and attributes of love partners, the latter theory 
predicts these to be positive. ig 

Before proceeding with this, however, a brief discussion clarifying the role 
of perceptual processes in each theory is required. a 

_Although not explicitly stated in the completion theory, it is clearly ; 
dicated that both the postulated phenomenon of felt lack of adequacy in 
given socially valued attribute of personality and the attraction to another 
who Dossesses a high degree of that attribute are matters mediated by per 
ception and judgment. That is, the person’s self-evaluation, based on 
perceptual comparisons of himself with others, is presumably what generates 
his feelings of lack in the first place, and his attraction to another must lik 
wise be based upon his perception that the other has in high degree the val 1ed 
attribute that would gratify his need for self-completion. He may, of cour 
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be wrong in both his perception of self and perception of the attractive other. 
But that is of no matter to the theory. What is required for confirmation of 
it is that his perception and judgment of the value of his own attributes be 
different from his perception and judgment of the value of the attributes of 
the other. More concretely, if he rates himself very low on an attribute, the 
affianced partner should be rated very high by him on that same attribute. 
Thus, as previously indicated, the interfiancé correlation should be negative, 

In the instrumental theory, which is constituted of a system of postulates, 
one of these explicitly posits a mediating, essentially perceptual, process called 
need-resource resonance: “Every person is assumed to be consciously or un- 
consciously sensitized by his needs to the perception and subception of re- 
sources in others that have potentialities or actualities for gratifying them. 
In encounters with others he will respond to them with either feelings of 
attraction or repulsion in keeping with his conscious or unconscious sensing 
of their actual or potential resources for his gratification of punification.” 


B. ReszarcH Mgruop 
1. Subjects and Procedure 


One hundred persons constituting 50 engaged couples served as subjects. 
All were students at UCLA and were preponderantly undergraduates. Rep- 
resentatives of no minority group were excluded, but Ss were required to be 
native born and not more than 25 years of age. The age range obtained was 
from 18 to 25 years, with the modal age being 21. Ss were recruited over a 
three-year period by various devices. Some elected to participate as a means of 
Partial fulfillment of their subject time service in courses in psychology that 
required this. Most responded to notices placed on bulletin boards about the 
campus and inserted into the campus newspaper and which, in addition to 
appealing to their motivation to cooperate in a scientific research enterprise, 
offered as compensation for their time a sum of $1.00 per couple. Some were 
induced to participate by being directly approached for this purpose by the 
writer’s research assistant. Ss were informed beforehand in each case that they 
would be spending about an hour or less in taking some tests of personality 
which were being developed and for which the investigator required some 
Normative response data from couples, The hope was expressed that such 
tests might be useful in future premarital counselling, but, it was indicated, 
the only immediate benefit participants could expect to derive from taking 
them was an interpretation of their individual test scores. { j 

Engaged partners were not required to report to the testing place as a pair, 
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but when they did so they were placed in separate rooms to preclude any 
influence that interaction or even each others’ physical presence might have 
had on their test responses. Each S, besides independently completing one or 
more tests dealing with motivational variables, and which are not of concern 
in the present report, responded to an Abilities Inventory which required him 
to rate both himself and his partner on a seven-point scale with values ranging 
from one through seven, for each of 22 abilities and attributes. To balance 
out any sequence effects, half the Ss were given forms requiring them to rate 
first themselves and then then their partner, and the other half received forms 


requiring the reverse. 


2. Instrument 


Inasmuch as the ego ideal has been featured in the completion hypothesis, 
and Ss’ average sclf-ratings on the Abilities Inventory have been found in 
previous research (16) to be substantially (.65) correlated with this variable, 
it was regarded as especially suited to the present need. It was, however, 
augmented by addition of seven new items to the original 15. These were 
Domestic, Affectional, Occupational, Sartorial Abilities, Common Sense, 
Achievement, and Physical Attractiveness. 

Previous to use in this project, in order for the investigator to ascertain the 
contemporary social desirabilities of the items, as well as the sex for which 
each item was consensually considered more desirable, the inventory was ad- 
ministered to 296 upper division males and females in a course in social psy- 
chology. They were requested to rate each item on the seven-point scale for 
its desirability for each sex. Only those 21 items rated above the average 
step on the scale for both sexes were used in the present investigation. The 
items are listed in Table 1 in accordance with their relative desirability for 
the sex group for which each was considered to be the more desirable. 


C. Resutts 


In Table 2 are listed the coefficients of correlation between (a) the self- 
ratings of males and their ratings of their fiancées and (4) the self-ratings 
of females and their ratings of their male partners. 

In only three instances are the negative correlations expected on the basis 
of the completion hypothesis in evidence, and in each case they are small and 
nonsignificant. Thirty-nine of the 42 correlations, on the other hand, are 
in the positive direction predicted on the basis of postulates of the instru- 
mental theory, and of these 39, 35 are significant at or beyond the .05 level. 
Thirty of the latter are significant at or beyond the .01 level. It would seem 
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TABLE 1 
ABILITIES AND ATTRIBUTES CONSIDERED BY UNIVERSITY STUDENTS OF BOTH SEXES As 
SociaLLy DESIRABLE FOR PERSONS OF EITHER SEX, BUT AS SOMEWHAT More 
DESIRABLE FOR PERSONS oF A GIVEN SEX 


Abilities and attributes considered Abilities and attributes considered 
by both male and female raters by both male and female raters 
to be more desirable to be more desirable 
for males for females 

1, Achievement 1, Physical Attractiveness 

2. Leadership 2. Erotic Ability 

3. Occupational Ability 3. Affectional Ability 

4, Economic Ability 4. Social Ability 

5. Entertaining Ability 5, Domestic Ability 

6. Intellectual Ability 6. Sartorial Ability 

7. Observational Ability 7. Interpersonal Understanding 
8. Common Sense 8. Art Appreciation 

9. Athletic Ability 9. Moral-Spiritual Understanding 


10. Theoretical Ability 10. Art-Creative Ability 

11. Scientific Understanding 

12, Mechanical Ability® 

® Mechanical is an exception in that it was considered by both sexes to be below 
average in desirability as an ability for a female, 


TABLE 2 
CORRELATIONS OF RATINGS OF SELF AND PARTNER OF MALE AND FeMAe Fiancés 
on Various ABILITIES AND ATTRIBUTES 


Male’s self-rating Female’s self-rating 
with his rating with her rating 
Variable of his fiancée of her fiancé 

Athletic Ability ali oe 
Leadership 368% os 
Social Ability 32m Ot 
Economic Ability ps 4300 
Erotic Ability sh 3700 
Entertaining Ability 308 —0s 
Observational Ability opi ‘15 
Intellectual Ability. 35 Ege 
Scientific Ability pea “40e6 
Interpersonal Understanding noes “4088 
Art Appreciation eth "328 
Art-Creative Ability 2 ‘ane 
Theoretical Ability ve F 
Moral and Spiritual 788 3546 

Understanding 6 “+ 4308 
Domestic Ability ease ‘eae 
Affectional Ability any 08 
Common Sense cds 396% 
Achievement ak ‘ 55 
Occupational Ability 468% Fi Sede 
Sartorial Ability "3788 aoe" 


Physical Attractiveness 


* Significant at or beyond .05 level (one tail). 
“te Significant at or beyond .01 level (one tail). 
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from such massive evidence to the contrary that the espousers of the compl 
tion theory have not adequately understood intersexual attraction dynamics. 
Affianced partners of both sexes in this sample have clearly chosen mates 
whom they perceive, not as different from themselves in abilities and socially 
valued attributes, but as similar. 

Correlations, being net resultants, can sometimes, however, obscure pé 
ticular minor tendencies in samples of responses such as these. It is possible 
that persons who perceive themselves as below average, or “low,” in given 
abilities and attributes do tend to pick partners who are higher in such char- 
acteristics than those chosen by persons who are themselves above average, — 
or “high” in them, but that this tendency is obscured by the major trend in- J 
dexed by the single correlation coefficient. To examine the data for such a pos- 
sibility, and also to test the alternate hypotheses in a different way, subjects 
were segregated into high and low categories in terms of whether their self- 
ratings on a given attribute were above or below the mean of self-ratings of © 
their sex group for the attribute. Then, the mean ratings assigned to the 
partners of persons in each category were computed, the differences between 
means ascertained, and finally, tested for significance. Table 3 summarizes the 
findings. b 

With the response data arranged for comparison in this fashion, if 
completion hypothesis were valid (“every person tends to seek in a par 
much the same set of desirables, good looks, intelligence, emotional stab 
etc,, but more so to the extent that he or she lacks them”), then it should 
found in examining the differences in ratings between the High and Lowy, 
self-raters in Table 3, that the Lows would have chosen partners whom tht 
would rate high. Further, the values of the ratings of their partners assign 
them by the Lows should exceed those given to their own partners by t 
Highs, 

On the other hand, if the prediction of similarity, stemming from the 

strumental formulation, is well grounded, the High self-raters should 
found to haye rated their partners high, and the Low self-raters would 
found to have rated their partners low. The magnitudes of the differ 
between the respective sets of ratings (Highs vs. Lows) should, of cours 
significantly greater than zero, or whatever value might be anticipated as | 
result of chance. 

i Inspection of the patterns of choice revealed in Table 3 will show th 
is the instrumental theory that is supported by them. Among males, if 
21 cases the female chosen is perceived by the High males as higher 0} 

attribute or ability than the female selected by the Lows. Seventeen of # 
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differences between the 21 mean ratings are at or above the .05 level of sig- 
nificance. Among females less consistency obtains, but in only three instances 
do they choose male partners who are rated by them as lower in an attribute 
than the mean rating given to their partners by female Lows, and in no case 
is the difference in the direction favoring the completion theory a significant 
one. Of the 18 differences whose direction favors the instrumental theory, 11 
are significant. 


D. Discussion 


It is indicated in the statements of the completion hypothesis that have been 
cited earlier that a person perceiving himself as having a low degree of a 
socially desirable attribute will choose a mate who ranks high to an opposite 
degree. Cattell’s phrase “. . . but more so to the extent that he or she lacks 
them” and his example of the socially awkward person seeking one who is 
socially adroit and poised, both indicate that the partner sought would be one 
who possessed the attribute to a high degree, not in just a relative sense, but 
in an absolute one. The data presented thus far lend little credence to this 
view. The present research has indicated that most persons have selected 
mates, or at least have become engaged to persons, who are similar to them- 
selves in Social Ability and Entertaining Ability (good indices to social adroit- 
ness and poise, it is thought). The correlations between males’ self-ratings 
and their ratings of their fiancées are .52 and .50, respectively, for these 
abilities, The corresponding ones for females’ self-ratings and their ratings of 
their fiancés are .36 and .37. All of these are significant at or above the 01 
confidence level. 

Despite all of the foregoing there is every day confirmation of the point 
that individuals seek the socially desirable traits and characteristics in a mate 
that Freud, Reik, Cattell, and countless other observers have noted and em- 
phasized. The error of their completion theory is that they have been over- 
impressed by such seeking, and undiscerning of the restrictions and limitations 
of the realities in which selection in terms of the social desirability of at 
sn t,he ih el eran he 
or her from “cstablishin REN cs mk, individual, being low, prevents him 

i g a relationship with a person whose resources are 
greatly different from his own, while the high ranking individual with more 
to offer is the one who is more likely to carry off the prize. The demand for 
persons possessing the desirable attributes to a high degree is probably much 
greater than the supply also, which even more restricts the individual’s likeli- 
hood of “success beyond his means” in this exchange market. 
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E. An Atrernate Hyporussis 


Yet, if the “socially awkward” cannot often aspire to winning the “socially 
adroit and poised” mates they might like, one may hypothesize that the mates 
they do select are ones perceived by them to be at least more adequate than 
the selectors perceive themselves to be. It is reasonable to suppose that their 
very lack makes them more highly motivated than the better endowed to seek 
a partner whose rank in given attributes is seen by them as higher than their 
own. Moreover, provided they do not aspire too much beyond their means, 
or do not take too demanding a desserts stance, these low ranking individuals 
have a much larger field of choice than those whose rank is high, because 
these latter must compete for the always fewer higher ranking individuals. 

That persons will have established relationships with those they perceive 
to be merely better endowed in socially valued traits than they themselves 
are is, of course, not what is stated in the completion theory. The quite ex- 
plicit phrase, “but more so to the extent that he or she lacks them,” plainly 
indicates that the lower one ranks in a given ability or attribute the higher 
the rank of the mate he seeks should be. Even given the likelihood that the 
low rating individual will tend to distortion in his appraisal of his partner 
in the direction of overevaluation of him or her (1, 2, 4, 6, 10, 17, 18, 22), 
it has been seen that the partner is not rated as high on these abilities and 
attributes as the partners chosen by persons who do not have his “lack.” 

That individuals who rate their own abilities and attributes below average 
will tend to select mates whose abilities and attributes are perceived by them 
as at least higher than their own is an alternate formulation which, in place 
of the completion hypotheses, can be tested in a yery simple way with the data 
at hand. However, the patterning of intermotivational dynamics underlying 
intersexual attraction is an intricate one, containing both attraction producing 
and repulsion creating forces, These cross-out and compromise each other in 
many ways. One of these motivational forces is seen as operating strongly 
here. Because of this it is necessary to qualify the foregoing hypothesis by 
adding an important phrase: “An individual who rates his own abilities and 
attributes below average will tend to select a mate whose abuts and at: 
tributes are perceived by him as at least higher than his own,’ but only to a 
degree commensurate with his need to gratify his sexual identity and sex role 
maintenance and enhancement drive. : 

In the as yet unpublished work previously referred to, such a drive was 
posited as one of the several most important forces underlying intersexual 
attraction and repulsion phenomena, and proved itself (at least to the writer) 
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an important key to understanding them. Space limitations make it incon- 
venient to discuss it further here, except to say that it motivates persons of 
the respective sexes each to select an opposite sex love partner whose be- 
havioral and other attributes, by conforming to the culturally and socially 
defined sex typing obtaining in the milieu, enable him or her to do likewise. 

It was explicitly noted earlier that some of the abilities and attributes 
under consideration are consensually more highly valued when found in a 
male, whereas others are thought to be more desirable for a female (review 
Table 1). The sexual identity drive is expected to manifest its operation by 
having patterned the data to be next considered in such a way as to reveal the 
low self-rating male more often than not to have become engaged to a female 
whom he does not rate above himself in the masculine abilities, but only in the 
feminine ones. Precisely the same sort of behavior should be found charac- 
teristic of the low self-rating female, but, of course, reversed with respect to 
the genderic congruency of the trait. To be somewhat more specific, in the 
interest of clarity, Leadership is an attribute associated much more strongly 
with the male identity and role than with the female identity and role, and, 
hence, most males would not be expected to seek and form relationships with 
females they perceived as possessing greater leadership ability than themselves. 
Females, on the other hand, would be attracted to and from relationships with 
males with more leadership qualities than they themselves possessed, for this 
would facilitate sexual identity and role maintenance for both of them. 

A reflection of all these forces can be seen in the data and comparisons 
presented in Table 4. There, the ratings of partners chosen by persons of both 
sexes who rate themselves low and high respectively are shown.® In the rela- 
tive terms of these comparisons the differences in rating patterns of low and 
high self-raters are striking. Among Low males, if the ability or attribute 
is a feminine one, such as Social Ability, Domestic Ability, Art-Creative 
Ability, Sartorial Ability, or Physical Attractiveness, the partner is assigned 
a higher ee than the self with very high incidence. Where the ability or 
attribute is Tess congruent with the feminine gender and more congruent with 
the masculine identity and role, such as in the cases of Athletic Ability, 
Leadership, Scientific Understanding, and Theoretical Ability, the percentages 


8 In the high self-rating category, since those who ighest 
rating, seven, could not rate their partners higher than cae ene 
was made for these relatively few cases by assuming that the same proportion of them 
wold have rated the partner higher, had it been possible, as did those who scored 
lower (in the high category). This is regarded as effecting a more equitable com” 
parison than could otherwise be made, but the overall results are essentially u? 
changed by the operation. 
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TABLE 4 
PERCENTAGES OF PERSONS oF BoTH SEXEs IN Low AND Hic SELF-RATING CATEGORIES* 
Wuo Perceive THEIR FIANcés To Be HIGHER THAN THEMSELVES 
IN VARIOUS ABILITIES AND ATTRIBUTES 


Percentages rating their fiancés higher 


Males Females 
Low High Low High 
Genderic self- self- self- self- 
Ability or attribute emphasis raters raters raters raters 
Theoretical Ability M 12 0 83 69 
Leadership M 13 11 60 40 
Athletic Ability M 14 5 75 63 
Scientific Ability M al 9 80* 47 
Common Sense M 30 3 63* 12 
Achievement M 32* 0 58* 27 
Entertaining Ability M 36" 5 89* 19 
Observational Ability M 45* 0 78* 34 
Intellectual Ability M 45% 14 oo 26 
Economic. Ability M 50* 0 834 25 
Occupational Ability , M 53% 6 82* 29 
Erotic Ability F 41% 0 58” at 
Interpersonal 
Understanding F 45* 7 rd i 
Art Appreciation F 52* 8 32 6 
Moral-Spiritual 
Understanding F 59® 18 27 18 
Affectional Ability F 60* 32 40 29 
Physical Attractiveness F 69* 33 50* 8 
Domestic Ability F 70 60 ath 3 
Social Ability F 76* 19 seh 5 
Art-Creative Ability F 79 59 42° : 
Sartorial Ability F 82* 45 43 1 


® Ns are approximately 25 for each subgroup, varying slightly from one distribu- 
tion to the tet (e.g, pean 24, 27 and 23, etc.) according to the magnitude of the 


mean, 
* Indicates a percentage that differs significantly (.05 level) from the corresponding 


Percentage of the “high” group. 

of ratings higher than those’ of the self are comparatively small. The males 
who are high on self-rating, despite the fact that much smaller percentages 
of them rate their female partners higher, nevertheless display the same prin- 
ciple of genderic preference in their pattern of ratings. Finally, the same 
comparison between high and low self-rating females shows the same influence 
of the sexual identity dynamic. But here it apparently operates against a 
tendency, pronounced in the low self-raters, for females to rate the partner 
higher than the self more commonly than males do this. For the majority of 
abilities and attributes linked more strongly to femininity, the percentage of 
females rating the male partner higher is again, however, appropriately smaller 
than the proportion who rate him higher than the self on the more masculine 


characteristics. 
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The above findings are not evidence favoring the completion hypothesis in 

the terms in which it has been originally formulated. Rather, the data simply 
reflect the motivations of individuals, especially those who rate themselves 
below average in resources, to make compensatory selections of mates in terms 
of characteristics considered socially desirable as long as the characteristics are 
those that conduce to sexual identity and sex role maintenance. That low self- 
rating individuals do this much more consistently and emphatically than do 
high self-rating persons is, of course, a finding that may quite plausibly be 
interpreted to be the resultant of the same sort of motivational forces that the 
completion theorists assume to be operating. To that extent, at least, the data 
derived from the present investigation are in accord with the theory under- 
lying the completion dynamic. But such a dynamic, now that the limitations of 
its effectiveness are more or less clarified, had best be reconceptualized and 
differently named. It is proposed to term it the compensatory dynamic in in- 
terpersonal attraction and love. The following is an attempt to state it ina 
more formal and general manner. 

Individuals who perceive themselves to rate below the average person in 
respect to a given ability or attribute consensually considered socially desit- 
able will, in their choices of friends, lovers, and spouses, be instigated more 
strongly than those of average or better self-ratings to compensate for their 
lack of excellence by establishing relations with those they perceive to be 
better endowed in such a respect than they themselves are. ; 

Such a formulation goes beyond, of course, the data adduced in support of 
the compensatory dynamic in the present investigation, for it has been gen- 
eralized to include same sex friends as well as those of opposite sex. Yet a 
fairly large amount of data regarding same sex friendship choices indicates 
that this is quite an appropriate step (3, 11, 14, 15, 19, 24). } 

In the most recent of such studies, that of Beier, Rossi, and Garfield (3), 
subjects were required to respond to the items of the Minnesota Multiphasic 
Personality Inventory in three different ways. First, they took the test them 
selves as subjects normally would, then responded to the items as they thought | 
their best friend would, and finally, as they thought their most disliked 
quaintance would respond. There were systematic differences in the compa! 
sons of their own responses and those attributed to friends which are qui 
compatible with the hypothesis of the operation of a compensatory dynamic. 
More specifically, subjects tended to perceive that their friends were P 
chologically stronger and better able to cope with the challenges of life than 
they themselves were. The friend as compared with the subject was seem 
him as more sociable, less moody, less concerned with bodily functioning, 
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incapacitated or handicapped by feelings of inadequacy, and more active and 
more realistic than himself. Here, as in other studies on the same point, what 
is documented is only that the friend is seen as better endowed than the self, 
not that he is qualitatively in a different category, as the completion hypothesis 
asserts. 


E. Summary 


Because the completion theory of intersexual attraction maintains that in 
mate selection individuals seek to establish relationships with partners whose 
abilities and attributes are opposite to rather than similar to their own, as is 
implied in the writer’s instrumental theory, a critical comparison of the dis- 
crepant predictions of each theory was carried out with a sample of 50 engaged 
couples in order to determine which theory is more accurate. The engaged 
individuals’ ratings (on a seven-point scale) of both themselves and their 
fiancés on each of 21 socially desirable abilities and attributes were obtained 
for the purpose. 

Comparisons both in terms of correlations of self and partner ratings and 
in terms of high self-raters vs. low self-raters revealed the predictions derived 
from the instrumental theory to be on the whole confirmed, whereas those 
derived from the completion theory were not. 

A further hypothesis, to the effect that individuals do tend to be attracted 
to, fall in love with, and become engaged to opposite sex individuals whom 
they perceive to be at least better endowed than themselves was proposed, 
tested, and substantially confirmed. A consideration of the findings of research 
on same sex friendships found those also to be essentially congruent with the 
hypothesis, which, to distinguish it from the somewhat similar completion 
postulate of Freud, Reik, and Cattell, was designated the compensatory dy- 
namic in interpersonal attraction. 
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PERSONALITY AND NONVERBAL RIGIDITY: 
SOME PRELIMINARY FINDINGS* 1 


Southampton College, St, Francis College, and Merrick, New York 


Wiuu1aM E. Simon, Louis H. Primavera, BERNARD Kiem, 
AND Rosert M. Cristau 


A. INTRODUCTION 


The personality construct of “rigidity” is one that has been extensively 
discussed in the psychological literature (e.g., 8, 13, 14, 15, 17, 19, 28). On 
an empirical level a number of different tests have been developed to measure 
rigidity (e.g., 1, 10, 16, 21, 22). Breskin (2), pointing out that most of 
these tests of rigidity are either verbal or problem-solving in nature, has re- 
cently developed a “non-verbal test of rigidity employing visual stimuli sug- 
gested by the laws of Praegnanz” (p. 1203). This test, the Breskin Rigidity 
Test (BRT), is predicted upon the assumption that “a more rigid person, 
given the opportunity to express a preference between pairs of figures dif- 
fering only in ‘goodness of fit,’ will tend to select the ‘better’ fit” (p. 1204). 
The BRT consists of 15 such pairs of figures. For each pair the examinee 
is required to indicate which figure he prefers. Each of the 15 responses thus 
obtained is scored either 1, indicating a preference for the figure that is a 
good fit, or zero, indicating a preference for the figure that is not a good 
fit. These 15 scores are simply summed to yield an overall rigidity score. 
Consequently, the minimum possible rigidity score is zero, while the maximum 
possible rigidity score is 15. One advantage of the BRT is that it usually 
takes less than two minutes to complete. It is also a measure of rigidity that 
would appear to be largely uncontaminated by intelligence (4). i 

In attempting to assess the theoretical meaning of nonverbal rigidity, as 
measured by the BRT, recent studies have investigated its relationship to 
a number of other variables. Breskin and Gorman (3) found that under- 
graduate females who were high in nonverbal rigidity were more field de- 
pendent, as measured by a group form of the Embedded Figures Test (12), 
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than females who were low in nonverbal rigidity. Field dependence-indepen- 
dence, in turn, is a dimension that has consistently been found to be related 
to a number of personality characteristics (29, 30). It should be pointed out 
that the association between nonverbal rigidity and field dependence did not 
hold for a group of male undergraduates. In another study, Gorman and 
Breskin (9) found that flexible undergraduates were significantly superior 
to rigid undergraduates on tests of semantic redefinition [the Re-2 Object 
Synthesis Test (11)], inductive reasoning [the I-1 Letter Sets Test (7)], 
and drawing completion. The two groups did not differ on a test of associa- 
tional fluency [the FA-3 Associations IV Test (11)]. Breskin, Gorman, and 
Hochman (5) found significant negative relationships between performance 
on the BRT and performance on both the Stroop Color-Word test (27) and 
Schaie’s Test of Behavioral Rigidity (21, 22). More recently, nonverbal 
rigidity has been found to be unrelated to either intelligence in elementary 
school children (4) or academic achievement in high school and college (18). 
These latter two findings support the theoretical contention (3) that the 
BRT is “a test in which no mathematical or intellectual skills are necessary 
to receive either a high or low rigid score” (p. 541). 

The purpose of the present study was to elucidate more fully the nomo- — 
logical network surrounding nonverbal rigidity by exploring the relationship 
between BRT scores and (a) ordinal position of birth, (b) psychological 
need structure, and (c) self-evaluation. 


B. Merxop 


Subjects were 176 undergraduates attending a Catholic college for women 
in the Northeast. The mean of their age distribution was 19.8 years, while 
the standard deviation was 1.41. All Ss were requested to provide information 
regarding their ordinal position of birth. Ss were also requested to complete — 
the BRT, the Edwards Personal Preference Schedule—EPPS (6), and 20 
21-point self-rating adjective scales. 

‘The EPPS is one of the most frequently used instruments for measuring : 
psychological needs. It measures, on an ipsative basis, the strength of the fol- 
lowing 15 needs: Achievement, Deference, Order, Exhibition, Autonomy, 
Affiliation, Intraception, Succorance, Dominance, Abasement, Nurturance, 
Change, Endurance, Heterosexuality, and Aggression. 

The 20 21-point adjective scales on which Ss were asked to rate themselves” 
each consisted of an attractive or socially desirable characteristic and its 
unattractive or socially undesirable antonym. These 20 scales were as follows: 
unpopular-popular, careful-careless, dishonest-honest, independent-dependents 
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unhappy-happy, sincere-insincere, insecure-secure, strong-weak, uninteresting- 
interesting, polite-impolite, immature-mature, reliable-unreliable, optimistic- 
pessimistic, unambitious-ambitious, unintelligent-intelligent, unanxious-anxious, 
unsociable-sociable, confident-unconfident, uncreative-creative, and productive- 
unproductive. The left-right position of the two characteristics composing each 
of the 20 scales was randomly determined with the one stipulation that the 
attractive characteristics be on the left half of the time, and on the right the 
other half of the time. The order of presentation of the scales was also 
randomly determined. 


C. Resuttrs anp Discussion 


Scores on the BRT ranged from zero to 14 with the mean of the distribu- 
tion equal to 6.64 and the standard deviation equal to 3.59, These values 
appear to be generally consistent with the values obtained in other studies 
using the BRT with female undergraduates. For example, Breskin (2) re- 
ports a mean BRT score of 6.7 and a standard deviation of 3,0 for a sample 
of 64 female undergraduate psychology students. 

Attention was first turned to the relationship between nonverbal rigidity 
and ordinal position of birth. Since four Ss failed to complete the information 
regarding birth order, there were 172 Ss available for this analysis. These 
172 Ss were divided into the following four groups: firstborns (n = 58), 
only children (n = 16), last borns (n = 31), and middle borns (n = 67), 
A middle born was defined as any S who did not fall into any of the other 
three categories. Distinctions were not made among second borns, third borns, 
etc. The mean BRT score for each of the four Groups was as follows: first- 
borns, ¥ = 6.59 (s = 3.83); only children, Y = 7.19 (s = 3.61); last 
borns, Y = 7.13 (s = 3.76); and middle borns, XY = 6.45 (s = 3.40). A 
one-way analysis of variance indicated that the four means did not differ 
significantly (F < 1). oe 

These preliminary results thus indicate that nonverbal rigidity, as measured 
by the BRT, is not related to ordinal position of birth. This finding does 
not eliminate the possibility that future research, using either (a) males or 
(b) different female populations from the one used in the present study, might 
find significant relationships between nonverbal rigidity and birth order. 
With regard to the first possibility, it is well to point out that interaction 
effects between sex and birth order have been found in a number of studies 
(eg., 20, 24, 26). With regard to the second possibility, it is well to point 
out that results based upon females attending a noncoed Catholic college 
might not be generalizable to other groups of females. It may be the case 
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that females attending noncoed Catholic colleges differ systematically on im- 
portant personality dimensions from females attending other types of colleges. 
‘The Ss in this study, for example, were found (25) to differ significantly 
(p < .01) from Edwards’ normative sample on 10 of the 15 EPPS needs. 
More specifically, they scored significantly higher on Autonomy, Succorance, 
Nurturance, Heterosexuality, and Aggression, and significantly lower on 
Achievement, Deference, Order, Dominance, and Endurance. 

Pearson product-moment correlation coefficients were computed between 
nonverbal rigidity and each of the 15 psychological needs measured by the 
EPPS. Twenty-four Ss, who either failed to complete the EPPS or scored 
lower than 10 on the consistency scale of the EPPS, were omitted from this 
analysis. The 15 correlation coefficients are consequently based upon a sample 
of 152 subjects. Nonverbal rigidity, as measured by the BRT, was found to 
correlate significantly with eight of the 15 EPPS needs. It was positively 
related to the needs for Deference (r = .448, p < .01), Order (r = .292, 
> < .01), Abasement (r = .207, p < .05), and Endurance (r = 191, 
p < .05), while it was negatively related to the needs for Autonomy (r= 
—.232, p < .01), Change (r = —.263, p < .01), Heterosexuality (rg 
—.162, p < .05), and Aggression (r = —.256, p < .01). The correlation 
coefficients between nonverbal rigidity and the needs for Achievement (r = 
—.052), Exhibition (r = —.097), Affiliation (r = .059), Intraception (r 
= —.073), Succorance(r = .067), Dominance (r = .002), and Nurturance 

_ (r = —.001) all failed to reach statistical significance (p > 05 in all 
cases). 

; This pattern of correlations suggests that the person who was perceptually 
rigid was also more likely to be psychologically rigid. The finding that non- 
verbal rigidity was positively related to the need for Order and negatively 
related to the need for Change seems to be consistent with Breskin, Gorman, 
and Hochman’s proposal (5) that the BRT may measure what has been 
termed “‘obsessive-compulsive intellectual rigidity” (23). An active search 
for order and an active avoidance of change would appear to be characteristic 
of the behavior of many obsessive-compulsive personality types. The negative 
relationships between nonverbal rigidity and the needs for Heterosexuality 
and Aggression are of considerable interest. There would appear to be at least 
two theoretical explanations for this finding. It might be that people who are 
high in nonverbal rigidity, as compared to people who are low in nonverbal 
rigidity, actually do have heterosexual and aggressive needs that are less 
intense. An alternate interpretation, however, is that people who are high in 

nonverbal rigidity have heterosexual and/or aggressive needs that are 4 
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strong or stronger, but the behavioral manifestations of these needs are either 
consciously suppressed or unconsciously repressed. In the present study there 
is obviously no basis for choosing between these two possible theoretical ex- 
planations. As one means of investigating the tenability of the latter explana- 
tion, it would be interesting to measure the needs for aggression and hetero- 
sexuality through projective techniques, and to compare their relationships to 
nonverbal rigidity with the negative relationships found in the present study. 

The Pearson product-moment correlation coefficients between nonverbal 
rigidity and the 20 self-rating adjective scales enumerated above all failed to 
reach statistical significance (p > .05). Thus, Ss high and low in nonverbal 
rigidity did not differ in the extent to which they rated themselves as popular, 
careful, honest, independent, happy, sincere, secure, strong, interesting, polite, 
mature, reliable, optimistic, ambitious, intelligent, unanxious, sociable, con- 
fident, creative, and productive. 


D. Summary 


This study investigated the relationship between nonverbal rigidity, as 
measured by the Breskin Rigidity Test (BRT), and (a) ordinal position of 
birth, (b) psychological need structure, and (c) self-evaluation on adjective 
rating scales, With regard to ordinal position of birth and self-evaluation 
on adjective rating scales, no significant relationships were found (p > .05). 
With regard to psychological need structure, as measured by the Edwards 
Personal Preference Schedule (EPPS), a number of interesting relationships 
were found. Nonverbal rigidity was found to be positively related to the 
needs for Deference (6 < .01), Order (p < .01), Abasement (p < .05), 
and Endurance (p < .05), and negatively related to the needs for Autonomy 
(p < 01), Change (p < .01), Heterosexuality ( < .05), and Aggression 
(p < .01). Some theoretical speculations concerning these findings were 
advanced and discussed. 
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A. INTRODUCTION 


The recent literature contains studies showing the existence of a psycho- 
physical scale for velocity determination (5, 10, 11, 12, 13). In these studies 
of velocity scaling where ratings are used, the exponent in a Stevens-type 
power function (14) is generally about 1.0. Further, normal visual velocity 
judgments may be highly accurate (e.g., for baseball, tennis, automobile 
driving), but these judgments are not verbalized as specific magnitudes. The 
present study was designed to answer three questions: (a) With what preci- 
sion can subjects estimate real movement in degrees per second instead of by 
a rating? (b) Does the use of degrees per second alter the exponent? (c) 
What are the effects of practice? 


B. AppaRATUS AND PROCEDURE 


Four subjects comprised the experimental group, and all had normal vision, 
Two of these (MY and RK) participated in all the different stimulus con- 
ditions (21 sessions) and alternated as experimenters. A third (RB) received 
the “easiest” and the “hardest” session, and the fourth (EG) received only 
the “easiest.” MY, RK, and EG had participated in psychophysical experi- 
but were otherwise unpracticed in velocity estimates. RB 


ments on occasion, 
but was unpracticed in 


participated in psychophysical studies almost daily, 
velocity estimations. i 

The range of velocities (.76 deg/sec to 11.1 deg/sec) probably included 
most of the effective continuum for our type of stimulus. Evidence for this is 
provided by the fact that higher speeds than those used in this study resulted 
in fusion and that the lowest speed and zero were occasionally confused with 


each other. 
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The velocity scale used by the subjects was the actual speed of the stimulus 
instead of retinal velocity. This allowed the subject to respond in whole 
numbers (zero to 85 deg/sec) and avoided the use of decimals. Throughout 
the experiment, subject-viewing distance was constant, and a linear trans- 
formation of the responses was performed in order to report the data in terms 
of retinal velocity. 

The subject’s display (visual angle subtense = 12°) was composed of an 
opaque strip of film with small perforations one inch apart and was lighted 
from behind. When the film moved in the dark, 11 to 12 dim dots were 
visible, but a wedge effect allowed the leading and trailing dots to appear 
and disappear gradually. This facilitated central fixation if the exposure was 
brief (<1.5 sec), but longer exposures resulted in pursuit eye movements 
(optokinetic nystagmus), particularly in the midrange of the stimuli em- 
ployed. 

Each of the two principal subjects (MY and RK) was tested on 21 occa- 
sions (sessions) ; all sessions took place in the morning, with only one session 
per subject on any given day. Since these two subjects served as experimenters 
for each other, they were tested each day in random order. Most sessions took 
place on alternate or consecutive days, but longer delays did occur. 

For each session the subject was seated comfortably; a chin rest positioned 
his head in space, 5.5 feet from the display. About one determination per 
minute was obtained, and the order of administration of the velocity levels 
was random. Broad-band noise (peaking at lower frequencies) was used to 
mask possible auditory cues that the equipment might provide. 

A In a pretest session, the lowest and highest levels were used, and the sub- 
ject was told these values in degrees per second. These served as moduli, and 
no other knowledge of results was received until Session 8, when that knowl- 
edge was provided after each determination. For a typical determination, the 
subject was seated with the room dimly lit; the room lights were then ex- 
tinguished, and a fixation point was seen in the dark. After that, the moving 
target was presented 1-1/2 degrees above the fixation point. Whether the 
fixation point remained on when the target was presented or not was de- 
termined by the stimulus condition (see Table 1). The subject therefore 
never spent more than 15 seconds in the dark (and usually much less). This 
was done because of the possible influence of dark adaptation on the corneo- 
retinal potential (1, 2, 4, 7, 8) which, while presumed not to be a factor in 
the present investigation, would be one on a related study of illusory move- 
ment (in preparation). 

Pearson product-moment correlation coefficients (r) and standard errors 


= 
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TABLE 1 
STIMULUS PARAMETERS FOR 21 VELociTY EsTIMATION SESSIONS 
(Curves) For Two SuByEcTS 


Knowl- Number of tae Number of 
ie viewin; umber 
Set edge of Fixation velocities tines? adcteriuinas 
sion results point used (seconds) tions 
1 Al 
4 bsent Present 12 15 60 
3 ” ” ” 
3 ; : 2 4 ‘s 
5 ” ” ” ” ” 
6 ” ” ” ” ” 
7 ” ” ” ” ” 
8 Present if if is “A 
9 A ” ” ” ” 
10 ” ” ” ” ” 
i M id % ef re 
12 2 ” ” ” ” 
13 " Absent 4 a4 Z 
4g ie 3 48 ” ” 
15 e ” ” 6 ” 
16 ” ” p 1.5 mi 
7 e ” ” ” ” 
ts a ” ” ” ” 
19 2 ” ” ” ” 
20 » ” ” ” vt 
21 Absent ®: le y < 


of estimate (SE = SD\/i—t*) were used to describe the results, Since a 
particular aim of this study was to improve the precision of velocity estima- 
tions as much as possible, the task for the main subjects began with simple 
determinations: (a) 12 velocity levels (.8 deg/sec apart); (b) a fixation 
(reference) point; and (c) 15 seconds for a determination. Subsequently, 
they were refined (see Table 1). In Sessions 3 to 7, 24 velocity levels 
(.4 deg/sec apart) were used, with knowledge of the real velocity after each 
determination added at Session 8. The latter was continued in subsequent 
sessions up to the last one. In Session 13, the fixation point was removed, In 
Session 14, velocity levels were increased to 48,2 and in Session 15, viewing 
time was reduced from 15 seconds to 6 seconds. Viewing time was reduced 
further to 1.5 seconds in Session 16 and, as mentioned, knowledge of results 
was removed in Session 21. It may be helpful to consider the study as a 
whole as falling into six distinct experimental conditions: (a) Sessions 1 and 
2 constituted a pilot study where only 12 velocity levels were used. (6) In 
Sessions 3 to 7 there were 24 velocity levels. (c) In Sessions 8 to 12 knowl- 


levels across this range were equivalent to steps approxi- 


2 Forty-eight veloci 
overt Ys nd probably represented the limits of the apparatus, 


mately .2 deg/sec apart a 
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edge of results was added. (d) Sessions 13 to 15 served as a transition period 
where the reference point was removed and viewing time reduced. (e) The 
number of velocity levels was doubled at Session 14, and viewing time was 
reduced to 1.5 seconds for Sessions 16 to 21. (f) Knowledge of results was 
removed for the last session. 


C. Resuitts AND Discussion 


Figure 1A shows the correlation coefficients between real and perceived 
movement for 21 sessions for the two principal subjects and for the two 
subjects who received an abbreviated schedule; Figure 1B contains the stan- 
dard errors. A general comment may be made that all correlation coefficients 
were high and did not appear to change systematically over time. Further, 
the day-to-day variability of the standard errors appeared to get smaller, but 
otherwise no systematic change was apparent. The data for subjects RB and 
EG are included in the figure to compare the performance of unpracticed 
subjects for the most difficult level (but with knowledge of results) used in 
this experiment. Subject RB was a highly trained observer in psychophysical 
determinations, and EG was not. The latter resembles the principal subjects 
in the experiment (male, graduate students in psychology), and the former's 
performance probably represents the best that can be expected from a highly 
sophisticated observer, but one not practiced in velocity determination per se. 
The second testing of RB showed performance equal to the highly practiced 
subjects (MY and RK) and suggested that practice effects, if they occurred, 
did so very rapidly. It is a shortcoming of this investigation that EG did not 
receive a second session, but his data were not analyzed before the equipment 
was disassembled. A point-by-point inspection of RB’s data showed asymptotic 
performance after the first 10 determinations in (his) Session 1. The same 
analysis on the first session for MK and RK showed a similar but less clear- 
cut result. 

Sessions 3 to 7 were preceded by two “warmup” and “probe” sessions which 
the experimenters had decided would dictate the next experimental condi- 
tions: (a) If the correlation (i.e., correspondence between real and apparent 
movement) was high and standard error low, the task would be made more 
difficult. (6) If the correlation was low or standard error high, additional 
training and/or knowledge of results would be offered. In the first two 
sessions, 12 equal interval levels were employed across a range of .76 deg/se¢ 
to 11.1 deg/sec. Five determinations were made at each level. A reference 
point was used, and the subject viewed the moving stimulus until he was 
satisfied with his determination. This was always accomplished within 10 to 
15 seconds. 
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REAL AND PerceiveD MOVEMENT IN 


Susyects MY AND RK For 21 SESSIONS, IN SUBJECTS RB AND EG on 


‘ABBREVIATED TESTING SCHEDULE; 


Because th 
unexpectedly 


more difficult by doubling the velocity 
terval). Two determinations 


7 at each of 


B—STANDARD ERRORS FOR 
EXPERIMENTAL CONDITIONS AS IN A 


e data for the first two sessions for both MY and RK showed 
high correlations and low standard errors, the task was made 
levels employed (i.e., halving the in- 
per subject were then obtained in Sessions 3 to 
24 velocity levels; viewing time and reference point were as 
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before. It may be seen from Table 2 that the correlation coefficients for MY r 
were high, and while RK showed more variability, no trends were apparent. 
There was also variability'in the standard-error data, but an absence of any 
trend (e.g., learning). The last three of these five sessions in this part of the 
experiment may be considered suggestive of improvement in subject RK, but 
it was a small change. 

The next group of five sessions followed the first group, but knowledge 
of results was provided after each determination. No change was anticipated 
in the magnitude of the correlation coefficients, but it was hoped the standard 
errors could be reduced. A reduction of the standard error was more evident 
for MY than RK, although some improvement was seen in both. The cor- 
relation coefficients remained about the same as those obtained from these 
subjects when knowledge of results was not provided. The results in these 
sessions (8, to 12) did suggest that the subjects had greater precision than 
the task required; therefore the task was made more difficult. 

In three separate transition sessions the following changes were made: (a) 
Fixation (or reference) point was removed. (4) Number of levels was 
doubled to 48. (¢) The subject no longer had as much time as necessary for 
his determinations (10 to 15 seconds), but was permitted to view the display 
for only six seconds. The first two of these transition conditions appeared to 
produce some reduction in the correlation coefficients and increased the stan- 
dard errors from previous levels (particularly the first). Any effect, if present, 
had disappeared by the third (Session 15) of these sessions, which also had 
the additional difficulty of reducing display time for each determination. The 9) 
data suggest that shortening the length of time the subject had to view the 
filmstrip may in fact have improved his performance. There are other parallels 
to improved performance with rapid judgments (for example, the proverbial 
first guess on a multiple-choice examination has a greater probability of being 
correct). However, the experimenters had thought that reduction in the 
amount of viewing time would make an already difficult determination more — 
difficult, and thus more variable. In velocity matching, for example, Gibson 
(6) allowed four minutes. 

At this YUTICURSES there were only two avenues available for making the 
task more difficult: (a) to remove the knowledge of results (which might 
introduce unnecessary contaminants, since the subjects could have developed 
some kind of set or attentional shift as a function of this inclusion), and (b) — 
to reduce exposure time. No others were available because of apparatus limita 

tion. 


Sessions 16 to 20 were conducted with 48 levels, no fixation point, an 
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1.5 seconds (instead of 6) for a determination. Except in the cases of Session 
16 for subject MY, and Session 17 for RK, the correlation and standard 
errors remained fairly constant. The poor showing by MY on the first of 
these sessions may have been due to the increased difficulty of the task because 
of the 1.5-second determination. The smaller difference for subject RK in 
his second session is unexplained, but might be day-to-day variability. Indeed, 
in the absence of pronounced trends for all 21 sessions, the most important 
variable in this study may have been the state of the subject on a given day, 
The terminal session was used to determine whether these subjects (MY 
and RK) who hod been trained with knowledge of results (Sessions 8 to 
20) would be able to duplicate their reliable performance when this informa- 
tion was denied, or whether, as was expected, a dependence on knowledge 
of results had occurred. Table 2 shows that correlation and standard errors 
were at least as good under this condition, and perhaps better. It is not known 
whether special significance should be attached to this improved performance. 
If the difference is real, it might be due to the fact that knowledge of results 


' TABLE 2 
CORRELATION COEFFICIENTS (r) AND STANDARD Errors (SE) FoR THE Two PRINCIPAL 
Susjects GRoUPED BY EXPERIMENTAL CONDITION 


Subject MY Subject RK 
SE SE 
Session r deg/sec r deg/sec 

1 96 884 95 1,005 
2 95 1.197 99 460 
95 1.161 95 1,078 

H 97 857 94 1,156 
5 97 823 93 1,180 
6 96 1.076 95 1.066 
7 97 988 97 853 

< 844 95 1.109 

3 3 923 92 1,256 
10 96 843 98 635 
11 98 647 97 805 
12 7 691 98 673 
13 93 1,112 93 1.072 
14 94 1.016 96 921 
15 +98 685 98 669 
16 91 1.224 95 910 
17 96 -780 92 1,052 
18 96 778 95 926 
19 96 917 95 946 
20 95 1.010 95 1,017 
21 97 776 97 748 
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served as a distraction. Evidence for this view comes from the fact that the 
benefits of knowledge of results when introduced (Session 8) were of doubt- 
ful significance. On the other hand, performances on the terminal session may 
have profited by the increased motivation and attention coincident with its 
being the last one, and it is possibly analogous to the well-known end spurt 
in vigilance experiments. What seems to be important in this result, however, 
is that performance is not degraded by removal of knowledge of results. This 
indicates that knowledge of results is not necessary for the fine discriminations 
of velocity of which these subjects were capable. 

It should be realized that a primary purpose of this experiment was to 
obtain, by practice, maximum precision in a subjective scale of real visual 
velocity, is absolute angular units. With this knowledge it should be pos- 
sible to study better certain forms of apparent movement (e.g., autokinesis, 
oculogyral illusion, spiral aftereffect) and the differences between the two 
(cf. 9). Viewed in this way, the entire series of sessions in the present study 
may then be considered as a training program. To this end, therefore, both 
subjects were trained to make increasingly more difficult and more precise 
estimations of velocity and, at the same time, maintain a high level of reli- 
ability and validity in their estimations. The end result was two subjects who 
could carefully estimate real visual velocity and make confident and valid 
discriminations, Whether the error and just-noticeable-difference could be 
reduced further awaits future study with more precise apparatus. The pre- 
diction that humans can make precise estimations of velocity was mentioned. 
in the first section of this paper. That they could structure and verbalize this 
ability so well was unexpected. The practical significance of the size of the 
standard errors and correlation coefficients and the knowledge of whether 
the ability to estimate real movement can be transferred to apparent move- 
ment are under study (15). 

It should be mentioned that while no major change in the subjects’ pet- | 
formances occurred when the fixation point was removed (Sessions 13 to 21), 
this should not be considered as lack of support for the Aubert-Fleischl para- 
dox (3). Although eye-movement records showed small pursuit movements 
(optokinetic nystagmus), these were of small magnitude and occurred only 
when, in the absence of a fixation point, the viewing time was long. Addi- 
tionally, the visual display in the present study was quite different from what 
would be used to test the differences between phenomenal perceptions of 
velocities with pursuit eye movements versus fixation: it was very small (12° 
of retinal angle) and dim, and there was a continuous series of spots that 

increased in luminance to the center of the display and then decreased as they 
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left the center. The impression the subjects had was of “. . . spots of light 
coming and going in space” and was phenomenally similar to the oculogyral 
illusion ; i.e., velocity without displacement. 

Exponent values were not calculated for the 45 functions obtained in this 
experiment, since a 45° angle approximation of the curves shown in Figures 
2, 3, and 4 appears to fit all the data. Two facts are important in this regard: 
(a) All correlations were of a very high order and most (80%) were .95 
or higher, and () the range of values used by the subjects was equal to the 
range of the real velocities presented. Correlations this high, where the range 
of values is equivalent, indicate that the exponent in a power function could 
not be much different from 1.0. This finding is concordant with other psy- 
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FIGURE 2 
REPRESENTATIVE CURVE FOR SUBJECTS MY AND RK (SESSION 5) 
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FIGURE 3 
REPRESENTATIVE CURVE FoR SuBJECT RK (SEssion 18) 


chophysical studies of velocity estimation where rating scales were used G; 


10, 11, 12, 13) and indicates that judgments and true velocities (cf Figures 
2, 3, and 4) are about equal. 


D. Summary 


This study was designed to measure how accurately subjects can estimate 
real movement in degrees per second (deg/sec) and to determine whether the 
precision of the estimates can be improved with practice. The display apparatus 
was composed of an opaque strip of film with small perforations and was 
lighted from behind. Two subjects were tested 21 times over a broad range 
of velocities, and two others received an abbreviated schedule of testing. No 
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FIGURE 4 
REPRESENTATIVE CURVE FOR SUBJECT MY (SESSION 18) 


systematic practice effects were noted across the 21 sessions, but some improve- 
ment was observed within the first session. No consistent bias of over- or 
underestimation was observed; the relationship between real and apparent 
movement was linear. The subjects made discriminations among 48 velocity 
levels, spaced .2 deg/sec apart; standard errors were approximately 1.0 deg/sec. 
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A FACTOR-ANALYTIC SYNTHESIS OF PERSONALITY 
THEORIES: BASIC DIMENSIONS RECONSIDERED* 


Geneva College 


James D. Evans anp H. Wattace Smiru 


A. Intropuction 


In an attempt to identify basic dimensions for the comparison of major per- 
sonality theories, Schuh (4) performed a cluster analysis on the summary 
table of variables and theorists in Hall and Lindzey’s (1, p. 548) classic text- 
book. Schuh found that most of the variables used to describe personality 
theories could be accounted for by four dimensions, and that, likewise, the 
theorists themselves were reducible to four fundamental issues. Since Schuh’s 
analysis, Hall and Lindzey (2) have revised their volume, added four more 
variables to their summary table (p. 592), and replaced two of the original 
theorists with new ones deemed more central for the psychology of per- 
sonality, This article reports a factor analysis of Hall and Lindzey’s revised 
comparison of personality theories. Consideration is given to continuities as 
well as changes in factorial composition since Schuh’s analysis of the first 


table. 


B. ProcepuRE 


1. In their comparison table, Hall and Lindzey (2) rated each theory 
high, medium, or low on each of 22 important, substantive variables (V), 
depending on whether the theorist (O) stressed, was neutral towards, or de- 
emphasized the variable. In the current analysis, high, medium, and low 
ratings were assigned values of three, two, and one, respectively, A 

2. A principal-axes factor analysis was performed on the V matrix. Then 
the matrix was transposed, and an O factor analysis was carried out. the 
principal-axes method of matrix condensation was used because, unlike Schuh’s 
cluster analysis, it provides an unique matrix solution (3). The orthogonal 
dimensions resulting from the V and O analyses, and having eigenvalues 
greater than unity, were rotated orthogonally to facilitate interpretation. All 
computations were performed by an electronic computer (IBM 1130). 

3. Each theorist was assigned a T’ score on each of the V dimensions in the 
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following manner: The theorist’s z scores on variables loading on a give 
factor were weighted by their respective loadings and summed ; this weigh 
sum x score was converted to a 7’ score with a mean of 50 and an SD of 10. 


C. Resutts 


1. The seven factors resulting from the V-analysis accounted for 87% of 
the total variance. The first factor was composed of the following variables 
(factor loadings are given in parentheses): learning process ( .43), formal 
analysis (.91), and heredity (—.41). This dimension represents the level of © 
abstractness entertained by the theorist. The second factor contained the 
items: heredity (.69), organismic emphasis (.65), group membership de r- 
minants (—.75), biology (.86), and social science (—44). This dimension 
closely matches Schuh’s “amount of change possible” cluster and is so labeled. 
The Pearson r between this factor and Schuh’s counterpart is —.88.1 The 
third factor consisted of the variables: unconscious determinants (.79), : 
ward (.47), early developmental experience (.86), continuity of development 
(.74), group membership determinants (.54), and social science (.82). This 
dimension resembles Schuh’s “importance of self-concept” cluster and, thus, 
is given the same name. In fact, it correlates —.88 with Schuh’s dimension. 
The fourth factor to emerge was defined by the items: personality structure 
(.74), multiplicity of motives (.88), and multiple mechanisms (.81). This 
dimension pertains to the number of structures employed by the theorist. 
fifth factor loaded highly on the attributes: reward (.59), learning process” 
(.51), field emphasis (—.82), and uniqueness (—.76). This dimension re- 
flects the importance the theorist places on learning. The sixth factor was 
composed of the items: purpose (—.93), continuity of development (.44) 
organismic emphasis (—.47), molar units (—.81), homeostatic mechani 
(—.72), psychological environment (—.73), and self-concept (—.82). Thi 
dimension corresponds roughly to Schuh’s “usefulness of self-concept” cl 
and correlates with it to the extent of .74. The seventh and last factor yiel 
by the V-analysis contained the variables: contiguity (.91), learning p: aces 
(.44), and homeostatic mechanisms (—.59). This dimension refers to th 
importance of contiguity in learning. 

The results of a factor analysis are likely to change as new variables 
cases are added and when different techniques of matrix condensation 
used. In light of the possible changes, then, three of Schuh’s V dimens 
seem quite robust and probably represent the most basic themes in the 


1 The dimensional T scores of the 15 theorists commo "s ana 
and the present study served as the correlation ‘dates hl re Se 
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of personality theory. This continuity across factoring techniques, in spite of 
the addition of four new variables and a few altered ratings, is especially 
significant in view of the fact that such “off-center” theorists as Skinner and 
Binswanger and Boss have been substituted in the revised table. 

2. In addition to replicating three of Schuh’s four V dimensions (his 
“complexity of the process of change” cluster did not appear), the current 
analysis unearthed four “new” dimensions. The theorists’ J’ scores on all 
seven dimensions are presented in Table 1 for purposes of comparison and 
review. 

3. The O-analysis produced five factors accounting for 78% of the total 
variance. The first factor to emerge was composed of the following theorists 
(factor loadings are given in parentheses): Binswanger and Boss (.74), 
Miller and Dollard (—.49), Cattell (—.53), Rogers (.72), Goldstein (.92), 
Angyal (.81), and Allport (.52). This dimension is named Self Emphasis, 
as the unifying trend among the positively loaded theorists is their stress 
upon the importance of the self or self-processes. Interestingly, the matter of 
emphasizing vs. de-emphasizing the self and related constructs accounts for 
over one-fifth of the variation among personality theories. The second factor 
consisted of Fromm (—.88), Horney (—.88), Adler (—.61), and Freud 
(—.51), and is labeled Psychoanalytic Emphasis. The third factor was de- 
fined by Sheldon (.83), Murray (.48), Lewin (—.60), and Freud (.48), 
and reflects a continuum of Biological Emphasis. The fourth factor was com- 
posed of Binswanger and Boss (.48), Skinner (—.88), Miller and Dollard 
(—.76), Murray (.50), and Allport (.56). This dimension polarizes theo- 
rists on a continuum of Learning Emphasis. The fifth dimension was com- 
posed of Sullivan (.79), Jung (—.77), and Freud (—.55). This final factor 
is not readily interpretable and, thus, is not named. 

Because of the multifaceted and complex nature of most personality theo- 
ries, the theorist factors are not easily interpreted. A given theorist may be 
characterized by his stance on several different issues. In the absence of a 
systematic, a priori rationale, the labeling of a theorist factor can generally 
proceed from any one of diverse angles. Therefore, the O dimensions probably 
are more ambiguous and of less descriptive value than the V dimensions, 
which can be defined more succinctly. In any case, the names applied to the 
theorist factors are only meant to be suggestive. a 

The results of the present O-analysis show that most of the variation 
among major personality theories boils down to five issues or philosophies. 
Three of these dimensions—Self Emphasis, Psychoanalytic Emphasis, and 
Biological Emphasis—parallel Schuh’s Self Emphasis, Social Emphasis, and 
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third (unnamed) clusters, respectively, The emergence of a new Learning 
Emphasis dimension comes as no surprise in view of Hall and Lindzey's 
addition of Skinner to their table. The fifth (unnamed) theorist factor re 
vealed by this analysis bears a slight resemblance to Schuh's fourth (unnamed) 
theorist cluster. It is apparent, in both Schuh’s study and the current investiga- 
tion, that the Self Emphasis factor is the most important (in terms of variance 
explained) and, perhaps, the most clearly identifiable theorist dimension, 


D. Summary AND ConcLusion 


This brief report presents the results of a factor-analytic synthesis of the 
variables and theorists in Hall and Lindzey’s (2) revised comparison table 
of major personality theories. The V and O dimensions identified in this 
study were compared with the dimensions found by Schuh (4) via a cluster 
analysis of Hall and Lindzey’s (1) earlier table. The O factors yielded by 
the present analysis were deemed to be of limited descriptive value, mainly 
as a result of the diversity of issues relevant to each theoretical position. The 
V dimensions, on the other hand, are more clearly interpretable than the O 
factors and seem to offer more utility to the student in search of theoretical 
synthesis. Moreover, the theorists’ 7’ scores on these factors generally make 
sense to the knowledgeable student of personality. The results of the V- 
analysis suggest that in addition to Schuh’s self-concept and amount-of-change 
clusters, the number of structures employed, the importance placed on learn- 
ing, and the level of abstractness entertained by the theorist describe the sub- 
stance of major personality theories in fundamental ways, Overall, the V- 
analysis indicated that the current superstructure of personality theory is more 
complex than Schuh’s analysis would suggest. 
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SOCIAL STEREOTYPING OF MALE AND FEMALE BODY 
TYPES WITH DIFFERING FACIAL EXPRESSIONS BY 
ELEMENTARY SCHOOL AGE BOYS AND GIRLS* 


Purdue University 
Donato W. Ferker 


A. Intropuction 


The presence of social stereotypes associated with body build has been dem- 
onstrated in varying populations. With boys from six to 10 years of age, 
Staffieri (10) found that common behavior traits were associated with vari- 
ous body types and that the mesomorph somatotype was associated with so- 
cially positive behavioral descriptions, Lerner (7) found similar results in a 
sample of males ranging in age from 10 to 20 years, and in another study 
(8) found that kindergarten males and females held a consistent aversion to 
“chubbiness.” Caskey and Felker (2) found that girls in grades one to five 
differed from boys in that the ectomorph form received favorable assignments, 
but did not differ in that the endomorph of both sexes received unfavorable 
assignments, ; 

‘The well established presence of these stereotypes has led to an investiga- 
tion of variables that might moderate or interact with the stereotyping by 
somatotype. Both Stafferi (10) and Caskey and Felker (2) found that the 
body build of the subject did not influence the stereotyping. Johnson and 
Stafferi (6) investigated the potential stereotypic property of common, as 
opposed to uncommon, names in a population of eight- to 12-year-old males. 
They found that the body build stereotype was more dominant than the 
stereotype related to boys’ names. 

Bradshaw and McKenzie (1) found that various facial features, such as 
mouth width and nose length, were differentially associated with stereotypic 
attribution in samples varying in age from five-year-olds to adults. i 

The present study investigated the stereotypic property of facial expression 
in combination with body builds in both boys and girls in same-sex and cross- 
sex stereotyping situations. Facial expression was chosen for two reasons, Fa- 
cial expression and attention to facial expression and characteristic have re- 
ceived some attention in the literature. It has been consistently shown that 
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infants show a preference for human faces over nonsocial, geometrical pat- 
terns (3, 5, 9). Facial expression also is a characteristic that should have 
social connotations and perhaps is even more evident than body build because 
of its constant visual presence. While body build can be somewhat hidden 
by clothing, ete., faces are usually visible, and expressions unhidden. It would 
be expected that children’s expectations of behavioral characteristics of re- 
latively unknown individuals would be strongly influenced by expression. 


B. ProBLEM 


Two questions were investigated in the present study: (a) Do children 
engage in social stereotyping in a cross-sex situation, and are the cross-sex 
stereotypes the same as the same-sex stereotypes? (b) Do facial expressions 
of frowning and smiling influence the stereotyping commonly associated with 
body type? 

C. Merrnop 


1. Subjects 


All children in the first, third, and fifth grades of an elementary school in 
a rural area of Indiana were used for subjects. Subjects who moved during 
the testing or children transferring into the school during the study were not 
included in the sample. The total sample included 27 grade one boys, 23 
grade three boys, 25 grade five boys; 17 grade one girls, 21 grade three girls, 
and 24 grade five girls. : 
2. Procedures 


: All children were weighed and measured for height by the experimenters 
in one day at the school. A ponderal index (height in inches divided by cube 
root of weight in pounds) was used as the measure of body type. Each child 
was tested individually to determine stereotyping behavior. Two sets of body 
forms, one male and one female, were used. The body forms were 12-inch 
high, self-standing, front-view black silhouettes with a white background. 
Each sex contained two each of the ectomorph, mesomorph, and endomorph 
body types. One of each body type had a “smiling face” plastic overlay, and 
the other of each body type had a “frowning face” overlay. : 

The child was brought to the testing area and was told that the experi- 
menter was going to ask his opinion about some imaginary people and that 
the six silhouettes were going to be the imaginary people. Each child was 
asked to tell how the forms differed. The experimenter pointed out the differ- 
ences in body and face if the child did not point out these two characteristics. 
The child was then asked a series of questions containing adjectives; ¢& 
“Which one is strong?” The child indicated his choice by pointing to # 
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silhouette. Forty adjectives which were previously used by Staffieri (10), 
Felker (4), and Caskey and Felker (2) were used. The children were 
randomly assigned by sex to sex of silhouettes and sex of experimenter. 


3. Analyses 


Previous studies have used chi square analysis to determine the differential 
assignment to body types, Since this study dealt with a number of variables 
in which interactions might appear, it was determined to use continuous 
rather than discrete scores. Each positive adjective was assigned a value of 
+1 and each negative adjective a value of —1. A score for each silhouette 
was computed for each subject by totaling these values and adding 24 to all 
resultant scores in order to eliminate negative values. Thus a higher score 
indicated a higher ratio of positive adjectives assigned. A series of analyses 
of variance were conduced to test the differences in positiveness of adjective 
assignment to body types, 


D. Resurts 


1. In all comparisons, both when subjects were responding to like-sex 
silhouettes and when subjects were stereotyping opposite-sex silhouettes, there 
were significant differences between the positiveness of assignments to body 
types. For both boy and girl subjects of all ages and male and female silhou- 
ettes the differences were significant at the .01 level. In all comparisons the 
differences were due to the greater number of negative assignments to the 
endomorph than to the mesomorph and ectomorph. For first-grade girls there 
was a significant interaction (F = 3.10, df = 2, 90, p < .05) of silhouette 
body type by sex of silhouette. This interaction was due to the higher rating of 
the female linear silhouette and would be consistent with Caskey and Felker’s 
(2) finding that the linear body type is preferred by females. A highly similar 
interaction was found for fifth-grade boys (F = 7.50, df = 2, 138, p < .001) 
and could reflect the later social development of boys. : 

2. In all comparisons between the smiling and frowning silhouettes there 
were large differences (p < 001), with the smiling silhouettes receiving 
more favorable adjective assignments. Two interactions involving the smiling- 
frowning attribute reached significance. Female smiling silhouettes received 
more favorable adjective ratings by grade three girls than did male smiling 
silhouettes, and male frowning silhouettes received more favorable ratings 
than the female frowning silhouettes (F = 6.59, df = 1, 114, p < .01), In 
the fifth-grade female sample there was @ significant silhouette body type by 
smiling-frowning interaction (F = 9.25, df = 2, 132, p < .001) because 
the smiling linear silhouette received higher ratings than the other smiling 
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silhouettes, and the frowing balanced silhouette received higher ratings than 
the other frowning silhouettes. 


E. Summary 


In a sample of first-, third-, and fifth-grade boys and girls it was found 
that there was consistent stereotypic assigning of adjectives related to both 
body type and facial characteristic. In all comparisons, for both male and 
female subjects at all grade levels, the endomorph body type received the 
least favorable adjectives. First-grade girls rated the linear female form more 
favorably than the other female forms. A similar effect with boy subjects did 
not appear until grade five, perhaps as a result of boys’ later development of 
sex specific judgments. 

It was also found that there were large differences in rating associated with 
facial characteristic, with the smiling forms receiving the more positive ratings. 
The interactions between body type and facial characteristic for the female 
samples once again indicated that female subjects tended to make more sex- 
specific ratings than their male counterparts. Although there were some 
differences, the most striking finding was the high consistency of stereotypic 
behavior across grade, sex of subject, and sex of body form. 
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SOME PSYCHOLOGICAL DETERMINANTS OF ALPHA 
ATTENUATION AFTER PATTERNED FLASHES* ? 


Laboratory of Clinical Psychology, Hokkaido University, Japan 


Minami Kanon Anp SHoji KirajiMa 


A. IntTRopucTIon 


Since the beginning of electroencephalography, there have been numerous 
experiments made with the object of discovering the relationships between 
alpha attenuation and (a) the attentive state, (4) vigilance, or (c) cerebral 
arousal supported through the subcortical system under sensory stimulation. 

In recent years, several works have described the temporal and spatial 
distribution and correlation of amplitude and frequency in the alpha rhythm 
pattern, while some studies have analyzed stimulus change (13), stimulus 
characteristics (1), and tasks and attitudes as effective factors in alpha attenua- 
tion and even in enhancement (10). There still remains some doubt as to the 
validity of trying to establish a necessary connection between alpha attenuation 
and external stimuli; there seems to be general agreement that alpha attenuation 
is not directly dependent upon the stimulus parameters, but upon an arousal 
reaction of a nonspecific nature. The heart of the matter is to specify psycho- 
logical and physiological conditions that determine unspecific arousal activities, 

One physiological approach to this problem is to trace the interrelationships 
between the neuronal discharges in postsynaptic activities of visual cortex 
neurons, in visual evoked potentials, and in background activities. Another 
psychological approach may be to analyze the temporal changes in alpha 
attenuation affected by simply constructed stimuli of limited parameters, 
while other stimuli are held stationary. One of the attempts to do this is 
found in “the stabilized-image situation” (7). 

Under normal visual conditions, we inquired into the relationship between 
pattern cognition and alpha attenuation, by means of presentation of visual 
stimuli of shorter duration than the latency of neuronal discharges, while the 
other environmental stimuli were held as invariable as possible. This experi- 
mentation was designed to examine those problems quantitatively : that is, 
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(a) comparison of the characteristics of the alpha rhythm pattern following 
patterned flashes of varied discriminability with those that followed common 
diffused flashes; (b) the relationships between the alpha attenuation response, 
the visual percept inferred from subjective reports, and the inferred structure 
of stimulation; and (c) the visual conditions in which the attained habituation 
to one stimulus can be transferred to another following stimulus. 


B. Mareriats AnD Metruop 
1. Subjects 


The subjects were 32 graduate and undergraduate volunteers, male and 
female, ranging in age from 18 to 28 years, selected from among 40 candidates. 
(Eight were discarded because their alpha amplitude recorded in the resting 
period was lower than 20 wV, or because of technical failures.) All subjects 
had normal or corrected vision and little anxiety towards the experimental 
situations. Each subject was alone in the half underground, semianechoic room 
which was electrically shielded and air-conditioned to about 23°C. Noises 
generated from experimental operations were perfectly cut off inside the 
room. The subject’s movements were monitored by means of electromyogram, 
electro-oculogram, galvanic skin response on the left palm, minor tremor on 
muscularis orbicularis oculi, and microphones placed near him; noctovision 
was applicable if necessary. 


2. Stimuli 


Environmental stimuli were reduced to a minimum, raising as abundant an 
alpha thythm of the subject with his eyes open as with his eyes closed. Experi- 
mental stimuli as single or paired flashes were created by two-channel slide 
Projectors outside the room. The light sources were delivered independently 
through two Nihon Kohden photic stimulators. Stimulus-to-stimulus and 
trial-to-trial intervals were operated by a preset Sanei time-controller and 
monitored with a cathode ray oscilloscope and pen recorders. 

The xenon lights were 50-60 psec in duration; the luminances were almost 
equal, and the recovery ratios during repetition were 90-95 percent at 50 
msec intervals and almost fully over 500 msec, Flashes were projected on 4 
front-flattened translucent screen shaped like a dome covering the visual 
field of the subject. The screen was made of a frosted Servic resin and inclined 
to 75° of the same gradient with the subject’s body on a reclining seat. 

Stimuli appeared against the center of his natural vision at 84 cm distance 
from his eyes. The luminance was .01 to .1 mL, and the screen was not 
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perfectly dark after the flashes. The patterns were white-on-black (or colored 
in a few cases) with letters and geometrical and pictorial figures made by a 
single flash or by paired flashes of the size 1°41’ to 7°30’, mostly 4°30’ of the 
visual angle, except in the cases of the diffuse masking light, and a random 
pattern consisting of random arrays of small geometrical figures, which ex- 
panded over the screen and faded out at its margin. 


3. Apparatus 


A Nihon Kohden Model ME-131-A, 13 channels electroencephalograph, 
an alpha wave analyzer, and an averaged transient analyzing computer ATAC 
201 were used. The EEG recordings were induced from vertical, temporal, 
and mainly occipital areas, and linked earlobes as a common reference; that is, 
O,-Ay, Oz-Ag, O1-Oz, Cz-Ai, C5-A1, or Ce-Az in the 10-20 system. 

Each silver disk electrode was attached by means of bentonite paste and 
plasters; each resistance was 5 kQ or less and was constant throughout the 
sessions. 

While observing each subject’s alpha dominancy, one of Oz-Ay or Oy-Ai 
and one of Cg-Ay or Cs-Az were analyzed through the alpha wave analyzer 
which consisted of band-pass filters, a preset threshold slice circuit of the filter 
outputs, and an analogue-digital converter. The number of alpha waves in 
various amplitudes was automatically coded into countable pulses: e.g., four 
amplitude levels were discriminated (zero to 5 pV, zero to 10 pV, over 10 pV, 
and over 20nV), and the waves over 10 pV level were scored here, 


4, Scoring 


The pulse number of the alpha waves after flashes, denoted as “alpha score,” 
were counted in two seconds, starting from the initial alpha attenuation, In 
this report, median, mean, and standard scores (Z scores) of the raw alpha 
scores were used. Though it was not at all evident that the alpha ‘Scores of 
each stimulus sequence in each subject should distribute in a Gaussian form, 
the alpha scores were transformed into the Z scores because it was necessary 
to compare the data of each subject on a common base. The parametric methods 
on the Z score and the nonparametric Wilcoxon matched-ranks test (15) 
were applied to the data for statistical tests of hypotheses. 


5. Procedure 


Experimentation was divided into four series. The experiments were com- 


posed of five or six categories of stimulus-sequences. In each series, subjects 


and stimuli were changed in order to reject the effect of individual and 
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stimulus specificities. The subject was instructed to hold his gaze naturally 
with his eyes open. He was asked what he had seen after each sequence, 
Stimuli were exposed at random or in a regulated manner. This paper is 
mainly concerned with the data from the regular repetitions of 30 trials at 2.5 
sec intertrial intervals because the same tendency had been found in the ran- 
dom one. 

a. Preliminary series (N = 13 out of 21). Patterns of variable complexity 
involved in reflected figures and the actual objects were tested under supra- 
or subliminal light intensities. After several presentations of partially visible 
patterns containing subliminal contents, they were lighted with 5 to 30 c/sec 
flickers for 2.5 sec, then again returned to the former state. During flickering, 
the substructures became visible; thereafter they were again invisible as before. 
The same design was performed with blank light as a reference session. 

4. Main series (I, N=7; II, N= 9 out of 10; and III, N = 3). Session 
1; Each pattern and blank light were repeated independently. Session 2: In 
the simple sequences, paired stimuli consisting of the primary pattern and the 
following diffused light were presented with a variable interstimulus interval 
(ISI). At 50 msec ISI, the pattern was masked, but at the ISI over 200 
msec, it became visible. In compound sequences, the same or a dissimilar addi- 
tional pattern as the primary one followed after the simple sequence. Session 
3: At first the large random pattern was repeated. Then paired stimuli were 
presented ; the random pattern followed the primary little pattern at a variable 


ISI. At 50 msec, the primary pattern was masked, but at 200 msec, it re 


mained visible, Session 4: Paired patterns in which a pattern combined with 
another pattern were presented. At 70 msec ISI, they appeared stable as 4 


unitary pattern, but at 200 msec they seemed to be separate or a moving 
pattern. 


C. Resurrs 


Except in a few cases, the percepts the subjects reported were coincident 


with those inferred from the experimental procedures. The data dealt with on _ 


the occasions where the one and the other were compatible. In every stimulus- 
sequence, the alpha rhythm, including a zero to 5 pV amplitude rhythm, 
attenuated radically in the initial several trials; then it developed fluctuatingly- 
In the initial 10 trials, the differences among the mean alpha scores for differ 
ent stimulus groups were Statistically significant, F (31, 256) = 4.21, > < 
.001. In the intermediate 10 trials, the differences were less significant, F (31, 
256) = 1.64, » < .05, and during the last 10 trials every sequence elici 
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similar and high alpha scores in which alpha attenuation tended to be minimum 
(Table 1). 

In general, the mean duration in alpha attenuation for the paired flashes 
to appear as a single pattern was longer than that for them to be diffused, 
The same tendency was confirmed in the mean alpha scores (Figure 1, A and 
B). These tendencies might be concordant with the data concerning alpha 
attenuation with eyes closed, as reported in Wilson and Wilson (16) with 
single flash, and in Morrell and Morrell (11) with double flashes. 

The following concerns the data of the initial 10 trials. Each difference 
between two relative sequences was significant in the Wilcoxon test for the 
medians (p < .02 or less). 

1, There was a significant correlation between the discrimination of 
stimuli and the degree of alpha attenuation. The higher the complexity of 
the pattern perceived by the subject, the lower the alpha score was. In paired 
flashes, the primary pattern was masked with the diffused flash at 50 msec 
ISI (A group). The pattern was visible in reduced contrast at 200 msec ISI 
(B group), and it combined with another pattern at 70 msec ISI (C group). 
The alpha scores were highest in A and lowest in C. Differences among those 
were statistically significant, F (2, 24) = 40.50, p < .001. 

2. As the random pattern was repeated in 60 to 90 trials, the alpha score 
for it, even if the pattern was very complex, increased as much as that for the 
diffused light. After the random pattern had been associated with the primary 
pattern at 200 msec ISI and seen as background for it, the alpha score for 
the next random pattern rapidly increased. When the habituated random pat- 
tern followed its antagonistical visible pattern at 200 msec ISI, a radical” 
alpha attenuation occurred, but when the antagonistical pattern was not 
visible at 50 msec ISI, the alpha score was equivalent to the repeated random 
pattern. The difference between them was statistically significant, t = 4.53, 
é < .001. 

3. Habituation to the pattern in a sequence could be transferred to the 
same single Pattern in the succeeding sequence so far as the primary pattern 
‘Was perceived (mean of the alpha score = 45.3, SD = 27.96). If the primaty | 
pattern was masked, alpha attenuation in the su 
Noticeable (mean = 19.0, SD = 


significant, ¢ = 3.38, p < .005. 


cceeding same pattern was 
19.38). The difference between them was 


D. Discussion 


The other data obtained through Presentation of similar light flashes if 
our laboratory indicated that the average critical periods required to recognize — 


MEAN DURATION OF ALPHA 


MEAN ALPHA SCORE 


A—Mean duration of alpha attenuation; 
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paired flashes as a single flash in a probability of .5 were 70 msec for the 
diffused light and 150 msec for the patterned light. The former was st; 
until 50 msec, then radically lost its apparent unity; the latter was stable 
until 150 msec, and the stability decreased slowly. 
The activities maintaining these images to be invariable for a time m iy 
have the property of “temporal quantization” as described in the work of 
Boynton (3), In the perception of simple flashes, the main _packag 7 
workers have been found in a photochemical process of the retina, known ¢ 
Bloch’s law, and in the neural summation of the peripheral and central nervous 
system. Reciprocities in the duration and the intensity of light may be the cau: 
of the phenomenon in which a pattern was masked by a diffused light following, 


it seems to be difficult to explain the data in this paper by his theory, where 
masking occurred at the appropriate ISI and, on the contrary, each stimulus 
can be identified at simultaneous exposures, 

It is noticeable, further, that the Pattern recognition is less veridical to thé 
fluctuation of the sensory input than is the photic perception. The fact that 
the masking occurs in two Patterns in the dichoptic viewing. indicates 
it mainly depends upon some temporal quantization of central processes. 7 

Paired stimuli given at 50 msec ISI might have interfered with each other, 
and then reading out of the primary pattern might be stopped. Consequently 
it would cut off the subjective experience of seeing the pattern and arrest the 
alpha attenuation. Donchin and Lindsley (4) reported that the averaged 


in luminance (9000 ml.) than the masked one (.9, 9.0, and 90 mL), and 
masking occurred when 


limen of the primary stimulus was elevated by the luminance summation evoked 
by the brighter stimulus that followed. In Perception, however, summation 
should not be limited to the peripheral events; rather it is explained by the 
temporal interaction processes of the neural networks. It can be also due to the 
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reflected with a xenon light, recording the AEP. In general, the more “con- 
trast rich” the pattern-composition was, the higher was the amplitude of the 
“late wave.” ; 

This information on the AEP is equivalent to the “‘classical” findings by 
Adrian on the diffused vision in which occurrence of alpha attenuation de- 
creased. 

Resemblances between the (a) late component of the time-locked com+ 
pound evoked potentials and the (J) alpha attenuation in relation to stimulus 
properties, subjective states, and their temporal changes, suggest that cortico- 
subcortical reverberating circuits with the peripheral feedback system are 
authochthonously in action, though the actual relationships among them are 
still not understood. Bourne et al. (2) recorded the topography of the visual 
evoked responses on the occipital scalp. The report indicates that the temporal 
variation of the spatial potential gradient has a dynamic rotational tendency, 
It is toward anterior areas from inion in the initial 120 msec after flashes, and 
it reaches to a major peak just above the inion through a quick rotation in 
190 or 310 msec after flashes, These are in agreement with the data and the 
model proposed by Rémond (12) for the alpha thythm pattera, ; 

Long and complicated neuronal constellations elicited by pattern stimulus 
do not always depend upon physical intensity of the stimulus, We propose here 
some speculative model “schematization” which patternizes the 
disturbances. It may develop with stimulation, connect with other cortical 
functions, and finally generate the percept for a time—that is, the subjective 
experience—and during this period it may keep the visual image as invariable 
against the fluctuation of sensory input, and compensate, as & conceptualization, 
a loss of sensory input resulting from diminution of evoked activities by 
repetitive stimulation. As MacKay (9) so well pointed out, highly redundant 
sensory inputs may be necessary for the maintenance of perception as stable 
because the redundancy would make the adequate selection among sensory 
inputs, and thereafter the formation of invariant representation of the stimulus 
would be possible. There seems to be need of some incubation period for the 
schematization. ‘The subliminal pattern within the visible contour became 
visible if it was presented 6 to 30 times per second. Te suggests that even a 
faint stimulus, which evokes too few neuronal discharges to clicit the percept 
and alpha attenuation, would be given several times in its utilization period. 
At the end the input could actualize the whole schematic mechanism through 


summation and integration. 


Libet et al. (8) reported that threshold conscious experience of sensation 
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was elicited by 30 to 60 c/sec electrical stimulation of the postcentral gyrus 
in the lower or liminal threshold intensities. To elicit the sensation, it needed 
a .5 sec or more utilization period, and the same subjective intensity continued 
during the prolonged stimulation of .5 sec or more after the period. 

‘The mechanism may consist of multifold activities. In its simple reactive 
stage, the responses seem to be more veridical to the physical parameters of 
the stimulus; e.g., the simple reaction time for the masked primary stimulus is 
similar to that for the visible one and is shorter than the latencies of the alpha 
attenuation. In its later stage, some still unknown processes, in which the 
participation of the dominant hemisphere should take place (and masking 
interference with percept-relevant processes could do also), should modify 
these simple responses and elaborate the perceptual experiences. This stage 
may consist of a releasing mechanism of the adapting behavior of the organism 
and have a close relationship to the alpha rhythm as its homeostatic ready 
state. The actual and specific nature of this stage is, of course, still to be 
developed. 

During alpha attenuation, schematization may have two stages: a formative 
enrichment and a stabile regulative process. Alpha attenuation may reflect 
the former and its restoration, alpha habituation, the latter. The higher the 
requirement of the stimulus, the longer the time course necessary for the 
enrichment, 

Even if the issues on alpha attenuation that have been raised by Berger and 
Adrian have remained unsolved, the occipital alpha attenuation might be con- 
sidered to be fairly functional to the subjective state related to the operational 
visual stimulation. 


E. Summary 


Under the condition of reduced external stimuli, subjects were exposed to 
a sequence (30 trials and 2.5 sec intertrial intervals) of the pattern consisting 
of two xenon flashes. Corresponding with the subjects’ reports, occipital alpha 
rhythm was measured with variable steps of amplitude and frequency for 2 
sec after the presentation of the stimuli. Alpha wave numbers over 10 nV 
from each subject after the presentation of the paired stimuli were compared 
with one another on the basis of its medians, means, and standard scores. 

1. After the presentation of the pattern, alpha attenuation was higher 
than after the presentation of the diffused flashes. It was noticeably high in 
the initial 10 trials, and alpha wave scores obtained from various patterns were 
significantly different. After 20 to 30 trials or more, the differences among 
the scores decreased and finally reached a score similar to that obtained when 
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the diffused flash was used as a common stimulus. This diminution was re- 
markable after the pattern had been seen as the background of another. 

2. Alpha attenuation was affected by the pattern-complexity of yarious 
pairs of flashes, and alpha habituation was transferred to the same pattern 
that was presented in a succeeding sequence, provided that the primary pattern 
was perceived. 

3. With regard to the data of masking time, critical fusion frequency, and 
evoked potentials to the pattern, some suggestions were made concerning a 
temporal mechanism “schematization.” 


14, 
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AN INVESTIGATION OF THE ATHLETIC PERSONALITY 
TYPE HYPOTHESIS* * 


University of Santa Clara 


Rotanp Lowe anp Ron Sani 


A. Inrropuction 


A recent article by Fletcher and Dowell (2) reported results showing that 
high school athletes revealed significantly more aggression and dominance than 
nonathletes. The Edwards Personal Preference Schedule was the psychological 
instrument used in the study. The subjects in the study represented many 
sports. The work and the articles the authors discuss apparently assume that 
an athletic personality exists which distinguishes athletes from nonathletes. 
The authors cite several articles supporting this assumption, For example, re- 
search by Behrman (1) using swimmers and nonswimmers found the swimmers 
to be more sociable and outgoing in comparison to the somewhat shy and re- 
tiring nonswimmers. Seymour (5) found that little league baseball players 
develop a higher.level of social acceptance than nonparticipants. 

The present study is an investigation of aggression and achievement 
tendencies of athletes, and reports results conflicting with the first studies 
mentioned, particularly the study of F' letcher and Dowell (2), Also, these con- 
flicting results ate discussed in terms of a theoretical rationale other than that 


of the athletic personality. 
B. Mernop 
Data for this study were obtained from 30 football players on the varsity 
team at the University of Santa Clara, divided equally between offensive and 


defensive players, The players were selected by their coaches as the 30 best 
players on the varsity squads, The Adjective Check List (3) was the 


psychological instrument used in this study. 


C. ReEsuLts 


the aggression scale and the achievement scale 
‘As scores should be outside the 40-60 T score 
or very high, the results 


‘The Ss average T scores on 
were 52.2 and 50.9, respectively. 
range to reflect the variable being either very low 
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indicate that aggression and achievement orientation are not distinctive aspects 
of these football players. As a further test, the players were ranked by their 
coaches from 1 to 15 according to their overall ability and performance on 
either offense or defense. Spearman rank correlations between coaches’ rating 
and both aggression and achievement were calculated. For the 15 offensive 
players, aggression and achievement correlations with the coaches’ ranking 
were .19 and —.25, respectively, while for the defensive players the same 
correlations were .15 and —.08. Thus even the outstanding football players 
were not characterized by greater aggression or achievement as measured by 
this test. 


D. Discussion 


With reference to football players, these results are not consistent with 
the general hypothesis that athletes differ from nonathletes in such personality 
characteristics as aggression and achievement. The clear differences with the 
findings of Fletcher and Dowell (2) may be a function of several factors. 
First, their subjects were high school students; second, their subjects came 
from a variety of sports; and third, they used a psychological test quite differ- 
ent in both method of construction and theoretical rationale. However, regard- 
less of these important differences, the present results are inconsistent with the 
assumption of general personality differences between athletes and nonathletes, 
particularly with respect to aggressive and achievement tendencies. 

Rather than being a reflection of an aggressive personality, the football 
player’s aggressive behavior may be more a function of the situational deter- 
minants common to football games, wherein aggressive and achieving behavior 
are highly rewarded. In their analysis of the dynamics of aggression, Megargee 
and Hokanson (4) assert that the situational factors producing aggression have 
been studied the least in comparison with the attention other factors have 
received. In the same work they also offer the following view of a football 
game: “No one who has attended a college football game needs to be con- 
vinced of how the reactions of the crowd can facilitate the socialized aggression 
of the players” (p. 160). 

The present study restricted its focus to one sport, and thus generalizations 
across sports are not indicated. However, the failure to find distinctive per- 
sonality traits characterizing football players in itself suggests limitations of 
the athletic personality hypothesis. The situational determinants hypothesis 


is one useful alternative approach to shed light on the athlete’s behavior both 
during and after the game. 
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E. Summary 


Athletes and nonathletes were not found to differ on aggression and achieve- 
ment tendencies as measured by the Adjective Check List. The results were 
discussed in terms of the situational determinants of the athletes’ behavior as 
compared with the hypothesis of an athletic personality type. 
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DOGMATISM AND PREFERENCE IN 1970 
TENNESSEE SENATE CAMPAIGN* 


Memphis State University 


C. V. Rosen anp Cuarres T, Kenny 


A. InTRopucTION 


Since the development of the California F Scale (1) investigators have 
frequently examined relationships between the authoritarianism (measured 
by the F Scale) of the voter and his preference for political candidates and 
philosophies. Evidence in the literature has thus far been inconclusive. Some 
researchers have offered supporting data (6, 10, 12), while other (7, 11) found 
no such support. 

Rokeach (9) advocated the concept of dogmatism in order to provide a 
generalized approach to authoritarianism without regard to specific political 
content. Various investigations (4, 5, 8) support Rokeach’s contention that 
the Dogmatism Scale is a measure of authoritarianism independent of any 
particular political orientation. Rokeach, however, has implied that the con- 
servative ideology is perhaps more compatible with authoritarianism than the 
liberal ideology. Various studies (3, 4) have also supported the belief that 
conservatives are in fact more dogmatic than liberals. DiRenzo (2) pro- 
posed that one factor in the controversial findings between political choice 
and authoritarianism was the failure of most political issues and candidates 
to offer “, . . sufficiently specific and differentiated stimuli” (p. 1197). DiRenzo 
found a significant relationship between dogmatism and preference for Gold- 
water in the 1964 presidential election. 

Following DiRenzo and Hanson the present study examined the hypothesis 
that persons preferring William Brock in the 1970 Tennessee senatorial 
campaign would score higher on Rokeach’s Dogmatism Scale than those 
preferring Senator Albert Gore. It was hoped that the Gore-Brock campaign 
would overcome the frequently observed failure of political candidates to 
present sufficiently distinct images and allow voters to make their choices on 
the basis of such variables as dogmatism. Senator Gore was long noted as a 
liberal Senator. Particularly in the several years prior to the 1970 election, 
he was increasingly identified with a number of positions characteristic of the 
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“Jiberal’”’ stance. For example, he was a prominent member of the anti-Vietnam 
War movement, an advocate of decreased emphasis on military spending, a 
supporter of various civil rights measures, and was recorded as voting against 
the conservative nominee to the Supreme Court. Gore’s followers and the 
general media viewed him as a liberal of long standing. The Republican 
nominee, William Brock, possessed reasonable conservative credentials. He 
was noted for voting with the Nixon administration on many issues having 
the characteristic conservative-liberal schism. His campaign emphasized the 
conservative aspects of his career and voting record. Most importantly, perhaps, 
he was pictured by the media as being the conservative candidate. 


B. PRocepuRE 


A political questionnaire and the Dogmatism Scale were administered to 
277 students in an introductory psychology course on the morning of the 
election. The questionnaire included birthdate, college major, sex, number of 
siblings, college class, political preference, voter registration, intentions of 
voting, preference in 1968 presidential election, preference in 1970 senatorial 
election, and whether this preference was based on liking the candidate or dis- 
liking the other more. Rating scales were included to determine (a) degree 
of political liberalism or conservatism; (b) degree of preference for Gore vs. 
Brock; (c) degree of political orientation (Hawk vs. Dove) ; and (d) interest 
in politics. One question asked subjects to rank seven issues in order of impor- 
tance to the individual subject. 


C. Resutrts 


The hypothesis was not supported, although the difference was in the 
expected direction (¢ < .10). In addition, Dogmatism scores were signifi- 
cantly lower (p < .05) for the 154 subjects who had preferred Nixon, a 
moderate to conservative candidate, in the 1968 election compared to the 39 
who had preferred Wallace, viewed as an extremely conservative candidate 
(Ms = 114.0 vs. 122.8). Although the 60 supporters of Humphrey, the most 
liberal of the three major candidates, had lower scores than the Wallaceites 
(Ms = 112.9 vs. 122.8), because of a smaller N than in the Nixon group 
and because of greater variability, the difference was not significant. Subjects 
listing an independent political position were significantly lower (p < -05) 
in Dogmatism (42 = 109.7) than those listing Republican or Democratic 
affiliation (Ms = 117.7 vs. 117.5). 
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D. Discussion 


Three possible reasons for the failure to support the hypothesis were as 
follows: (a) Since only 13.3% of the subjects were of voting age, a large 
Proportion may have had little invested in the campaign. (4) Brock and 
Gore may not have been noticeably different in the dogmatism with which they 
presented their views. (c) Brock’s supporters may have been sensitive to various 
accusations of intolerance and closed-mindedness by supporters of the liberal 
candidate, and possibly were motivated to answer the scale in a less dogmatic 
manner, Although the failure to find significant differences between Humphrey 
and Wallace supporters is intriguing, conclusions should be tentative until 
further research is done on pre-election attitudes. On the other hand it would 
appear that the means for the Nixon and Humphrey supporters are so close 
that some confidence may be placed in the failure to find significant differences 
between them. 


E. Summary 


‘The hypothesis that persons preferring a conservative senatorial candidate 
(William Brock) would score higher on Rokeach’s Dogmatism Scale than 
those preferring a liberal candidate (Albert Gore) was studied by administer- 
ing the Dogmatism Scale and a political questionnaire to 277 college students 
on the morning of the election, The hypothesis was not supported. Dogmatism 
scores were significantly lower for subjects (a) preferring Nixon to Wallace 
in the 1968 presidential election; (4) listing an independent political position 
compared to those listing either Democratic or Republican affiliation. Reasons 
underlying the lack of support for the hypothesis were discussed. 


REFERENCES 


Aporno, T. W., FRENKEL-BRUNSWIK, E., LEVINSON, D. J., & Sanrorv, R, N, The 
Authoritarian Personality. New York: Harper, 1950. ; 
2. D1ReNnzo, G. J. Dogmatism and presidential preferences in the 1964 elections, 


Psychol. Rep., 1968, 22, 1197-1202, 
Hanson, D. J. Dogmatism and authoritarianism. J. Soc, Psychol., 1968, 76, 89-95. 


. Validity test of the Dogmatism Scale, Psychol. Rep., 1970, 26, 585-586. 


4. 

5. Keruincer, F., & Roxeacu, M, The factorial nature of the F and D scales, J. 
Personal. & Soc. Psychol., 1966, 4, 391-399. AS 

6. Mutton, O. Presidential choice and performance on a scale of authoritarianism. 
Amer. Psychol., 1952, 7, 597-598. ° ‘ 

7. Paut, I. H. Impressions of personality, authority, and the fait accompli effect. J. 


Abn. & Soc, Psychol., 1956, 63, 338-344. ee 
Pxant, W. T. Rokeach’s Dogmatism Scale as a measure of general authoritarian- 


ism. Psychol. Rep., 1960, 6, 164. 


9. Roxeacu, M. The Open and Closed Mind. New York: Basic Books, 1960, 
10. SANForD, F, H. Public orientation to Roosevelt, Public Opin. Quart., 1951, 15, 189- 
216 
11. Wuutams, P. J., & Wricut, C. R. Opinion organization in a heterogeneous adult 
population. J. Abn, & Soc, Psychol., 1955, 51, 559-564. 
12, WricHTsMaN, L. S., RADLoFF, R. W., Horton, D. L., & MEcHERiKorr, M. Authori- 
tarian attitudes and presidential voting preferences. Psychol. Rep., 1961, 8, 43-46, 


Department of Psychology 


Memphis State University 
Memphis, Tennessee 38111 


174 JOURNAL OF PSYCHOLOGY 
; 
: 


Published as a separate and in The Journal of Psychology, 1972, 82, 175-186, 
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A. INTRODUCTION 


A century ago, Galton’s publication of Hereditary Genius traced the family 
histories of prominent Englishmen from the Norman Conquest to the Vic- 
torian period. As early as 1876 he suggested that the history of twins might 
provide clues to the relative influence of heredity and environment. Stimu- 
lated by Binet’s work, differential psychologists during the early decades of 
this century explored the relationships between the intelligence test scores of 
parents and their children. After reviewing these studies, Tyler (4) reported 
that intrafamily correlations in intelligence test scores tend to be of about the 
same magnitude as those for structural physical characteristics; namely, about 
-50. 

About each of 506 Yale fathers and their Yale sons comparable data about 
socioeconomic factors, their participation in various extracurricular college 
activities, their scholastic aptitude, and their academic achievement had been 
accumulated just before and during their undergraduate careers. The scores 
each had achieved on the College Entrance Examination Board’s Scholastic 
Aptitude Test (SAT) were included, Strong motivation may be assumed be- 
cause the test was part of the college application procedure, 

Altogether there were 32 items of information used, and these were classi- 


fied into three categories, as follows. 


1. Academic Ability and Achievement 


Included here were the son’s and his father’s (a) SAT-V (verbal) score; 
(b) predicted freshman year average at Yale; (c) actual freshman year aver- 
age; and (d) graduation status. Predicted freshman year average had been 
computed before the student's matriculation by weighting secondary school 
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records and College Entrance Examination Board scores. For this study their 
graduation status was scaled to reveal nonrecipience of the Yale baccalaure- 
ate, and recipience with or without dean’s list standing in any year, cum 
laude, magna cum laude, and summa cum laude. 


2. Socioeconomic Information 


This information included his father’s and paternal grandfather's occupa- 
tion and the occupational level at which each was employed; his mother’s and 
both of his paternal grandparents’ attendance at Yale or another college or 
university; his and his father's church affiliation; the amount of financial aid 
awarded to father and son during the maximum year such support was pro- 
vided by Yale; their college majors; the secondary school each had attended, 
whether public or private and if the latter, whether it was the alma mater of 
both; his mother’s occupational level; his subsequent occupation. 


3. Participation in College Activities 


This category included sons’ and their fathers’ participation in (a) uni- 
versity athletics; (6) drama, music, and publications groups; (c) fraternities, 
societies, and clubs; (d) son’s participation in the athletic activities of his 
residential college. 

The academic data were already in quantitative form; records of participa- 
tion in college activities were scaled by categorizing according to number of 
years of participation. Much of the socioeconomic information was arbitrarily 
translated from categories (e.g., affiliation with a religious denomination, 
such as Catholic or Protestant; an occupational classification, such as medi- 
cine or business; an area of study, such as the humanities or the sciences, etc.) 
to numerical scale values 1, 2,3... .. Hence, averages calculated therefrom 
have no content meaning; standard deviations, however, indicate relative 
degrees of dispersion among the categories. 

In the classes of 1931 through 1935, 506 fathers had sons in the classes of 
1955 through 1970. Of these 506 fathers, 71 also had second sons in the 
classes of 1957 through 1970. Thus, our study population comprised 506 


pairs of father-son records, and included were 71 triads of father-two-son 
records, All were white. 


B. Mernop 


Certain characteristics of the data influenced the analysis. For the sake of 
consistency we included only fathers and sons who had completed the fresh- 
man year at Yale and whose cumulative records were complete. Because of 
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the massiveness of the data, coding scales were used; in some of these the 
nature of the information resulted in coarse intervals of limited number. For 
example, for certain academic records, socioeconomic status materials, and 
college activities data, the maximum numbers of intervals were six, nine, and 
10, respectively. 

Some of the distributions, particularly those relating to extracurricular 
participation, were markedly skewed; more than half of the fathers and two- 
thirds of the sons participated in no university-wide athletics; nearly two 
thirds of the fathers were not members of a drama, music, or publication 
group. Thirty-three per cent of grandfathers, 82% of the grandmothers, and 
36% of the mothers had not attended college. Approximately half of the 
fathers but less than one-fifth of the sons had received financial aid during 
their undergraduate years. About 85% of the mothers were not employed 
except as full-time homemakers. The coarse groupings tend to mask small 
differences in the original data. Thus, it seems unlikely that other researchers 
would find lower correlation coefficients than ours. . 

Multiple regression and stepwise multiple regression procedures were used 
in the analyses. Table 1 presents father-son correlations, means, and standard 
deviations. Table 2 summarizes the correlations between the many indepen- 
dent and the two dependent variables, the son’s SAT score, and his Yale 
graduation status. Data in the tables are arranged according to (a) the 506 
pairs of fathers and only or first sons and (b) the 71 triads of fathers and 
two sons. 

Because of their close correspondence to the 506 pairs, the means and stan- 
dard deviations for the 71 triads are not detailed. Fisher's discriminant func- 
tion analysis (2) of the 435 pairs (i.e, 506 — 71) versus the 71 triads re- 
vealed the close comparability of the two groups (F = .98, p > .20). On 
only one of the 32 scales was the difference between the mean scores of the 
pairs and triads statistically significant at the .05 level: fathers of two sons 
had been more involved in Yale athletics than the fathers of one son. 

That many of the correlations seem low results partly from the restricting 
influences in our data as discussed above. Garrett (3) provides a formula 
that can be used in certain instances to correct for this situation. When ad- 
justed for restriction in range of son’s SAT scores (i.e., SD of 7.4 vs. 10.0 
for the college applicant population), the low correlation of .24 between the 
scores of fathers and sons was markedly increased to an estimated coefficient 


of .70. 
Certain of the father-son correlations based on socioeconomic scales are 
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TABLE 1 
FATHER/SON CORRELATIONS, MEANS, AND STANDARD DEVIATIONS 
506 F/S pairs 71 F/S triads 
Ee Fathers Sons Correlations 
Variables tint M SD M SD F/S; F/Sg 8; Sq 
Academic scales 
Graduation Status AS80270 1A 24 11 24 —12 07 
Freshman Year Average 22.753 64 771 5.5 22 26 © 32 
Prediction 14 743 66 75.9 5.7 18 23 14 
SAT-V 24 52.9 85 61.0 7.4 2 27. al 
Socioeconomic scales¢ 
Church Affiliation 4 27 14° «27 (14 44 54 = 473 
Yale Major Area 10°28 409 24 O09 «38 21.20 
Secondary School Attended 46 #24 O8 2.9 0.7 46 40 66 
Financial Aid 22 lt 14 (04 (09 22 2365 
Occupation 09, $0 21 46.19 as 
College activities scales 
Athletics 21 13 2.0 07° #13 —.01 06 43 


Drama, Music, Publications 13 12 22 14 2.0 —13 —11 32 
Fraternities, Societies, Clubs 30 2.9 25 22 21 35 2 1S 
Residential College Athletics! — — — 23 28 - — 


® Assuming random sampling, p = .01 when r > .12. 

> Assuming random sampling, p = .01 when r > .30. 

_ © Ranges of means and SDs on other scales for parents and grandparents, respec- 
tively, were as follows: College Attended—means were .2 and 1.0, SDs were 4 and 
‘8; Occupation (grandfather)—mean = 5.2, SD = 2.2; Occupational Level—means 
were .4 and 3.6, SDs were .9 and 1.5, 

@ These data were not available for fathers, since the residential colleges had not 
been established when many were Yale undergraduates. 


not surprising. The preponderance (some 80%) of fathers and their sons 
who had attended private secondary schools lends credence to the .46: co- 
efficient; a higher correlation than .44 in church affiliation might have been 
anticipated. Coefficients of .66 and .73 were found between comparable data 
for the first and second sons in the triad group. 

Table 2 presents the correlations between the independent and the depen- 
dent variables. Those for the academic scales are detailed; for the socioeco- 
nomic and college activities scales only maximum and minimum coefficients 
are indicated. The massiveness of the data necessitated the presentation of 
only those findings most important to an understanding of the outcomes of 
the study. 
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C. REsuLtTs 
1. Multiple Correlations 


Multiple correlations between the antecedent variables and the son’s SAT 
scores and his graduation status are presented in Table 3, the reading of which 
may be made clear through the following example. The father’s combined 
academic scales, consisting of his own graduation status, his freshman year 
average, his prediction, and his SAT score, correlated .25 with his son’s SAT 
score and .19 with his son’s graduation status: both of these coefficients were 
derived from the 506 pairs. The parents’ socioeconomic scales correlated .26 
and .25, respectively, with sons’ SAT and graduation status. 

These multiple correlations reveal unexpectedly strong relationships be- 
tween the son’s two academic criteria (his SAT score and his graduation 
status) and his own socioeconomic status and that of his parents. The ex- 
tent to which family characteristics relate to the son’s academic performance 
is the focus of the next section. 


2. Stepwise Regressions 


Multiple correlation procedures permit the determination of (a) which of 
several independent variables contributes most to the dependent variables, 
and subsequently (4) the order in which the other variables enter to produce 
the maximum multiple. Draper and Smith’s computer programming tech- 
nique (1) provides a cumulative record of the contribution of each variable 
as it is entered into the series. Table 4 presents a summary of the cumulative 
multiple correlations based on the 506 pairs. Columns one and two present 
the data in conventional descending order; columns three and four present 
them in reverse order. Thus, in column one the son’s SAT score correlated 
-20 with his own socioeconomic data; the addition of his father’s academic 
scales and his parents’ socioeconomic scales raised the correlation to .31 and 
37, respectively ; and his father’s college activities scales and his grandparents’ 
socioeconomic scales increased it to .37. Thus, combined background factors 
correlated .37 with the SAT scores earned by the sons. 

When this series of multiple correlations was calculated in reverse order, 
as in column three, starting with the grandparents’ socioeconomic scales, the 
initial correlation was .09; the addition of all three of the parents’ scales in- 
creased it to .35. Thus, data about his parents and grandparents provided 4 
base coefficient of .35 which his own socioeconomic data increased to .37- 

Columns two and four of Table 4 reveal that the best combination of the 
son’s and the parent’s scales resulted in a multiple correlation of .53 with 
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TABLE 4 
SUMMARY OF THE CUMULATIVE CORRELATIONS FOR THE 506 PAIRS 


Dependent variables 


Order Reverse Order 
—__——— 
1 2 3 4 
Son’s Son’s Son’s Son’s 
SAT-V Graduation SAT-V  Graduatioi 


Groups of independent variables Status Status 
ae 
S, academic scales 492 538 
S$, socioeconomic scales .20 49 37 «33 
F academic scales 31 51 35 29 
Parents’ socioeconomic scales 36 53 27 26 
F college activities scales 2.) 53 18 sis 
Grandparents’ socioeconomic scales revs 53 09 At 


® Academic scales represented were SAT-V and Prediction. 


the son’s graduation status. In column two the son’s academic scales bear 
major burden of the multiple correlation; his own socioeconomic scale add 
nothing, and data about his parents and grandparents added only a 
increment. Yet, column four indicates that family background factors at 
counted for a multiple correlation of .29 which increased to .33 with the addi 
tion of the son’s own socioeconomic data. The subsequent addition of his o 
academic scales raised this multiple to .53, an increase suggesting that 
initial multiple of .49 was a reflection of more than the son’s own acadi 
potential as estimated at matriculation. 


3. Cumulative Multiple Correlation Data for the Triads 


In Table 5 the cumulative multiple correlations for the triads, rangi 
from .62 to .78, seem surprisingly high. These are much higher than those 
.37 and .53 for the pairs (Table 4). Differences between the pairs and tri 
are further underscored in Table 3 by comparing column one with two, 
column four with five, Since, as we reported above, the pairs and triads 
not statistically different with respect to means and standard deviations, 
fundamental reason for the differences in correlations is not clear. 

Column one, Table 4 indicates that for the 506 pairs the cumulative mu 
ple correlation started with .20 and reached a maximum of only .37 
comparable regression for the triads (Table 5, column one) started ¥ 
higher “lift-off” coefficient of 41 and terminated at .62. Thus this seem 
high multiple of .62 results largely from the initial .41 correlation. With 
Se data shown in Table 5, column two, the older brothers’ record augmen 
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creditably the initial correlation from .19 to .52; sequentially, the data about 
the parents and the grandparents increased it to .75 and 78, respectively. 

These comparisons and others involving the graduation status regressions 
support the following conclusions: (a) the addition of family data, such as 
those of an older brother, parents, and even grandparents, contributes sub- 
stantially to the prediction of the SAT score and graduation status; and (d) 
the intercorrelations among the scales reflect factors other than one specific 
characteristic in each scale. 

In Table 3, greatest stability is found in the correlations between the aca- 
demic scales and graduation status, where coefficients for the three popula- 
tions are .49, .52, and .33, respectively. Thus the combination of SAT and 
prediction is the most dependable index in our data. Table 5 indicates the 
consistency with which antecedent data increased the predictive power of the 
academic scales; the reverse order regressions reveal that the sons’ academic 
scales added little to the information already available. In considering these 
findings one should be aware of the culturally homogeneous nature of the 
sample which was composed of white males, of whom about 80% were gradu- 
ates of independent secondary schools. 


D. Conctupinc StaTEMENT 

1. Stepwise multiple regression analyses of 32 items of information about 
506 pairs of fathers and sons who had attended Yale as undergraduates indi- 
cate that the sons’ SAT scores, taken before their college matriculation, might 
have been estimated from family background data to the extent implied by 4 + 
multiple correlation of .37. These background factors included the father’s 
= lemic record, which contributed about the same correlation as the parents’ 
socioeconomic scales. To the extent represented by a correlation of .29, gradu: 
ation status might have been estimated from the parents’ and grandparents’ 
data combined, with the correlation increased to .53 by the addition of the 
student’s own socioeconomic and academic data accumulated prior to matricu- 
lation. 

Since an SAT score can be predicted more dependably from the Preliminary - 
Scholastic Aptitude Test, and since the student’s own prematriculation 
academic data correlated .49 with his later graduation status, these findings 
have more theoretical than practical significance, Socio-economic factors and 
parental participation in college activities correlated highly enough with sub 
sequent academic criteria to suggest that our culture is reflected in out 
measures of scholastic aptitude and college achievement. The history of 
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attempts to develop a culture-free intelligence test is replete with failure. And 
even if one were to be developed, it would be useful in higher education in 
this country only if it could be combined with other culture-laden measures. 

2. Our adjusted coefficient of .70 (father-son SAT score correlation) is 
y higher than the .50 generally cited as representing the parent-child relation 
in intelligence test scores. This suggests that under controlled conditions 
including (a) a reliable test appropriate to the culture and to the age group, 
(b) subjects of relatively constant chronological age, and (c¢) testees moti- 
vated to achieve optimal scores, a high parent-child coefficient might be ex- 
pected, 

3. Since our research design did not provide for cross-validation or replica- 
tion, other researchers should be encouraged to pursue similar longitudinal 
studies. Because multiple correlations involving many variables tend to reflect 
cumulative errors, the resulting coefficients must be interpreted cautiously. 
Furthermore, our findings are ‘‘culture-bound” since, as was pointed out 
earlier, our unique body of data was derived from a group relatively highly 
selected in terms of academic and socioeconomic characteristics. Much longi- 
tudinal research on groups of greater diversity of background will be 
necessary before our findings can be broadly generalized. 


E. Summary 


Matriculation and in-college data about 506 Yale fathers were compared 
with the analogous records of their Yale sons. Thirty-two variables included 
‘gdata about academic ability and performance, socioeconomic status, and 
participation in extracurricular activities, Stepwise multiple regression analyses 
yielded a multiple correlation of .37 between family background factors and 
sons’ SAT scores. Antecedent data correlated .53 with sons’ college gradua- 
tion status. Thus cultural factors are reflected in our measures of scholastic 
aptitude and college achievement. Correction for restriction in range of sons’ 
scores resulted in an estimated correlation of .70 between SAT scores of 
fathers and their sons. This finding suggests that when age is held relatively 
constant, and motivation is high, a significantly higher coefficient may be ex- 
pected than those of about .50 hitherto reported as representative of the par- 


ent-child relation in intelligence test scores. 
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RISK-TAKING EFFECTS ON GROUP DECISION MAKING* 


University of British Columbia and Simon Fraser University, Canada 


Marino A, Mippteton anp LAwRENCE WARREN 


A. INTRODUCTION 


Wallach, Kogan, and Bem (8), investigating factors determining a group’s 
choice of a leader, found a significant correlation between individual group 
members’ assessments of the relative influentiality of other members of the 
group and the risk those persons had displayed in initial decisions made in 
the presence of the group. Marquis (5) also asked his Ss about the influence 
exerted by other group members and also found that those thought to have 
been more influential had taken riskier positions in outset decisions. Pincus 
(6) found that, in fact, the people who were identified as most influential in 
discussions of particular situations tended to have held the socially desirable 
position for that situation at the outset of the discussion. In other words, they 
were initially most risky in risky situations and initially most conservative in 
conservative situations. His findings were interpreted as follows: A person 
becomes influential when he holds the socially desirable position of the group 
and is best able to articulate that position. The above studies were all con- 
ducted on North American Ss. 

The present study concerns itself with the relationship between individual 
risk-taking behavior and perceived influence in group decision making in 
sequential search tasks. Is it the risky position taken by an individual or is it 
the charisma of that individual that causes the group to adopt a position 
riskier than its central tendency? 

The sequential search decision making task discussed follows this outline: 
In a series of observations, a decision maker is instructed to select the best 
option. On each observation, he may opt to reject the current candidate and 
continue the search, or he may choose to stop and accept that option. He may 
not subsequently choose an option he has previously rejected. Problems of this 
nature are stopping rule problems (2) or optimal stopping problems (3). 
They are similar to real life problems of the order of selling stocks, accepting 
a job, or buying a used car. 

The present study is concerned with an optimal stopping problem with a 


* Received in the Editorial Office on July 3, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press, 


189 


190 JOURNAL OF PSYCHOLOGY 


fixed number of observations involving no profit or cost to the S making a 
decision on an observation. Two rules govern the task: (a) In choosing the 
highest number, only the current observation may be chosen. (b) Ss may 
keep a record of numbers as they pass. 

‘The game constitutes a finite Markov chain. The state Sn is defined as the 
maximum of a sequence of 2 numbers so that Sn = max (X31, Xo,..., Xn), 
where X,, is the number heard on a tape recording in the th position, n= 1, 
2,3,..., N. We then have 


P(Sn+1 = j/Sn =i) = O forj <i, = P(Xn+1) forj >i 


where j = Sn + 1, and i = Sn, and where P is the probability of the event. 

A solution to the finite optimal stopping problem for large N was given by 
Ash and Jones (1): Sample until Z, = N — &, have been heard as candidate 
numbers and then stop after hearing the first number that is the largest of 
those heard so far. Declare this number to be the largest of all those available. 
The optimal number of remaining numbers, &,, is the smallest root of 


(-E)(Sy4)= 
ima) h az) <9 

j=1 
This is given to good approximation for large N by ky = N(1—1/e). Then, 
Z. ~ "/e. Zo represents the optimal stopping number, &, is the optimal num- 
ber of remaining numbers, and e is a constant of 2.56. Gilbert and Mosteller 
(3). provide exact values of Z, for different values of N. 

The optimal stopping rule number was used as a baseline; the baseline for 
distributions of 64 numbers each is the 25th number into the distribution. 
Conservative behavior in the stopping rule problem involving a distribution 
of 64 numbers is arbitrarily defined as observing or hearing 25 numbers and 
then selecting the first number that is largest of any that have gone before. 
Risky behavior is arbitrarily defined .as hearing 10 numbers in a distribution 
of 64 and then selecting the first largest candidate of any that have gone 
before. 

In addition to examining the relationship between risk and perceived in- 
fluence as outlined previously, the decision maker’s sensitivity to various dis- 
tributions of numbers is also tested. Whether this sensitivity varied under 
individual or group administration is also analyzed. 

Tt is hypothesized as follows: (a) Individuals tested alone will reach an 
earlier decision, or stop, significantly sooner than will members of groups. 
(4) Individuals may be sensitive to variations in the distributions when select- 
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ing their candidate as being the highest number of a particular distribution. 
(c) Group size is a variable that significantly affects the influentiality of 
any member. (d) If an individual member of a group assumes a risky position 
on a task of this nature, any significant influence that an individual exerts 
on the group is a result of the risky position itself and is not due to charisma 
or any particular personality variable. 

In an experiment using the stopping rule type of problem administered to 
individuals alone and in groups, Kahan, Rapoport, and Jones (4) found that 
individuals tested alone stopped sooner; and when testing groups on their 
variation in performance, by exposing them to differently skewed distribu- 
tions, they found that performance scores were not significantly different for 
the distributions. Individual performance scores were not tested, by them, in 
this regard. It may be that when an individual is in a group, this sensitivity 
is lost, as a result of conformity behavior or other variables. Ash and Jones 
(1) and Rapoport and Tversky (7) found that the cost of the observations 
is likely to be more salient for Ss tested alone. 

Our third hypothesis projects that it is easier to persuade two people to 
shift their positions than it is to influence five people to do so. 


B. Meruop 


The design was a3 X 3 X 2 factorial in randomized groups. The Ss were 
81 undergraduates from the University of British Columbia and from Simon 
Fraser University, who volunteered to participate in a decision making ex- 
periment. Males and females were randomly assigned to groups. Scores were 
recorded for 70 of the 81 Ss. Scores for five Ss were discounted for the con- 
trol condition for groups of three, and scores of two Ss were discounted for 
the control condition for groups of six. Four Ss were E confederates whose 
scores were discounted. Thirty Ss were tested in groups of three, and 30 Ss 
were tested in groups of six. Ten Ss were tested individually. Each group 
was randomly presented with three experimental conditions—a shuffled dis- 
tribution of numbers from three differently skewed distributions: (a) one 
positively skewed, (b) one negatively skewed, and (c) one normal distribu- 
tion. Each distribution was adjusted to have the same mean, 41. 

All numbers were presented to Ss by means of taped numbers with two- 
second intervals. Ss were advised that they might keep a written record of 
the numbers as they were presented or could rely on their memories, as they 
preferred. d 

The group of 30 Ss in groups of three each were divided into 10 Ss in 
groups of three in which one of the members of the group was a confederate 
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with the E. This confederate would assume a “risky” position in the group. 
The confederate would arbitrarily proceed 10 numbers into the distribution 
of 64 numbers and would then select the first subsequent highest number as 
his candidate. E recorded the distance into the distribution that each of the 
other members of the group would proceed before announcing their candi- 
dates. Fifteen Ss were placed in groups of three, and the measurements of two 
of the three, in each group, were taken as the control group measurements. 
Ten Ss, of 15 Ss tested in groups of three were measured, two from each 
group, while the third S, the E confederate, voted for the first subsequent 
highest number offered after the 25th candidate, which was defined as the 
conservative position. A similar arrangement was instituted for groups of six 
wherein the sixth § was the E confederate, taking either the risky or the con- 
servative position. Dependent measures were on two groups of five in each 
manipulation. Finally, 10 Ss were tested, individually, with the three ran- 
domly presented distributions. 

Each § was instructed that three sequences of 64 numbers each would be 
presented and that from each group he was to decide which number would 
be the highest number. The Ss were instructed that as each number passed, it 
would be rejected as a possible candidate, so that they would be required to 
make their choice at the moment of presentation and had to inform the E 
immediately of that choice by responding to the option with the word “Stop.” 
At this moment the E would temporarily halt the flow of numbers emanating 
from the tape recorder and record the distance that particular § had pro- 
gressed into the distribution. When the last § called his choice, the E would 
advance the tape to the next Sequence in order to avoid, as much as possible, 
any conclusive impartation of knowledge about the sequences to the Ss. The 
E would also stop the flow of numbers and advance to the next sequence if 
an § passed the actual highest number in any distribution, 


C. Resunts 


Let Z, Z = 1, 2,, ... 64, denote the number of choices heard by § before 
stopping the search. Measurements were made from a baseline of 25 numbers 
into the distribution (25 is the optimal stopping number) 
to Ss were derivations from this baseline. 
counted in the analysis; neither were exc 
ditions. 

The data were analyzed with use of a one-wa' 
nique. For the ANOVA, scores used were abso 
Dunnett’s test for comparison of treatment me: 


- All scores assigned 
The E confederate scores were not 
ess scores in the control group con- 


y analysis of variance tech- 
lute deviations from 25, A 
ans and control groups was 
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+ used to evaluate significance among the risky and conservative treatments. A 
comparison of the means of scores of individually tested Ss and group tested 


Ss in the control groups of different sizes was also undertaken. 


The ANOVA in Table 1 reveals that all treatments except the risk factor 


, reached significance for p < .05. 


TABLE 1 
ANALYSIS OF VARIANCE OF TREATMENT VARIABLES AND INTERACTIONS 
4 Source SS df MS F 

Group size 454 1 454.0 13.00 
Distribution 783 2 391.5 11,12 
Risk factor 69 2 34.5 98 
Group X Distribution 1434 2 717.0 20.53 
Group X Risk factor 321 2 160.5 5.45 
Distribution Risk factor 1813 4 453.2 13.00 
Distribution x Risk factor X Group size 3003 4 750.7 21.15 
Within treatments 5738 164 34.9 
Total 13615 179 


The comparison of risk and conservative treatments with controls of groups 
of three Ss and groups of six Ss, using a two-tailed Dunnett’s test (Table 2), 
revealed that the risky and conservative treatment means were all significant. 
When compared to controls for groups of three Ss, these treatments with 
groups of six Ss reached significance only in the positive distribution: Dee 


X, = —6.76, for p < .05, for 1 and 27 df. 


The results for the comparison of means for the individually tested and 


control group tested Ss was undertaken by six separate ¢ tests to test the null 


hypothesis, for 1 and 18 df, # = 1.73, p< .05. Results are presented in Table 


TABLE 2 
ResuLts oF DUNNETT’s TEST FoR Risk MANIPULATION COMPARISONS 
wiTH A CONTROL 


Group size 
Distributions 3 Ss per group 6 Ss per group 
‘4 Normal 

Risky — 97 4.6 

Conservative 12.0 — 6 
Positive 

Risky —13.5 10.9 

Conservative 7.0 7.9 
Negative 

Risky 19.1 2.2 

Conservative —12,2 42 
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3. It may be seen that in the normal distribution for group size of three Ss, 
the differences between the means did not attain significance. Significant 
differences between the means was attained in groups of three Ss exposed to 
the negative distribution and groups of six Ss exposed to the positive distribu- 
tion. 


TABLE 3 
INDIVIDUAL TREATMENT vs, GRoUP CONTROL MEANS 
Groups 
3 Ss per group 6 Ss per group 
Distributions Treatment Control t Treatment Control t 
Normal —24 + 17 1.02 —2.40 + 6.1 1.61 
Positive —1.0 +10.7 2.18 — 10 +10.7 2.11 
Negative —7.3 +18.9 5.03 —7.30 + 7.0 2.17 


D. Discussion 


The results indicate that there was a significant difference in performance 
between group sizes in their deviation scores from an optimal stopping point. 
It was hypothesized that group size is a variable that significantly affects 
influentiality of group members, so that it is easier to influence members of 
smaller groups as opposed to larger groups. The results confirmed this. Signif- 
icance was attained by Ss in groups of three for all distributions ; but signifi- 
cance was reached only for the positive distribution with Ss in groups of six. 

Individuals stopped earlier in all the treatments when compared to controls 
for each of the group sizes and distributions, Significance, for Ss tested in- 
dividually, however, was only attained in the Positive and negative distribu- 
tions, Kahan, Rapoport, and Jones (4) found that individuals stopped sooner 
than groups. The present experiment Suggests that distributions may affect 
this stopping behavior. 

The results indicate that individuals, as well as groups of the sizes tested, 
respond differently to the various distributions presented. It may be that the 
patterns of numbers heard suggested possible different stopping behavior. 
These patterns, then, would depend on the different distributions. Some Ss 
tested exhibited an amazing sensitivity to these patterns and would consis- 
tently select either the highest candidate in the distribution or a candidate in 
close proximity to the value of the highest candidate. More experimentation 
to assess the variables responsible for this type of search behavior might be 
profitable. 


The hypothesis that any significant influence an individual might exert on 
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the behavior of a group is a result of the risky position itself, and not of a 
personality variable, was not substantiated by the results. If the risky position 
was the influential factor, the E confederate would not have been able to 
shift the group to a conservative position, as well as to a risky one, The re- 
sults indicate that the E confederate was able to influence the responses of Ss 
so that they stopped significantly sooner than their respective controls. This 
behavior was most pronounced in the groups of three. The E confederate was 
able also to persuade the Ss to move to a more conservative position. Again, 
this behavior was most marked in groups of three. 

The risk factor manipulation did not attain significance in the ANOVA. 
Possibly this was because absolute deviation scores from the optimal stopping 
number baseline were used. However, the Dunnett’s test revealed the compar- 
isons of risk treatment variables with respective controls. Significant interac- 
tion effects indicate the relatedness of the main effects. 

While the results do not support the hypothesis concerning the nature of 
the risky position, they indicate that some norm-setting behavior is being 
established: the / confederate is, in effect, “breaking the ice” in the risky 
situation and thereby sets the norm for the group. He apparently is able to 
influence others when assuming a conservative position because he has left 
the group “‘leaderless.” This two-way influence could possibly be due to 
charisma or personality variables. Further, to Pincus (6) it appears that the 
socially desirable position for a given situation may be communicated and 
defined by a member of a group. His conformity explanation is also sub- 
stantiated. 


E. Summary 


This study concerns itself with the relationship between individual risk- 
taking behavior and the influence this behavior exerts on group decision mak- 
ing in sequential search tasks, Thirty Ss tested in groups of three, 30 Ss tested 
in groups of six, and 10 Ss tested individually showed significant differences 
in their relative performance, when instructed to select the largest of a set 
of 64 numbers, heard one at a time (when only the number currently ob- 
served could be declared the largest). Numbers were sampled from three 
different distributions: one normal, one skewed positively, and one skewed 
negatively. 

It was found that Ss, individually and in groups, showed the effects of the 
distributions upon performance. Group size was a significant factor in the 
performance of the Ss. An E confederate in the groups was able to affect the 
performance of the group significantly, by taking either a “risky” or a “con- 
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servative” position. The results indicated that individuals tested alone stopped 
sooner than those tested in groups and that, in groups, an S is able to help 
establish the norm. 
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SELECTIVE EXPOSURE AND THE 
“ACTIVE GROUP EFFECT”* 


Memphis State University 
Cuartes T. Kenny anp Luxe Ettiort, Jr. 


A. IntTRopUCTION 


Reviews of the literature on the selective exposure hypothesis by Freedman 
and Sears (2) and McGuire (4) reveal that there is contradictory evidence 
with as much data to support it as there is to reject it. The purpose of the 
present study was to provide an experimental test of the hypothesis. The 
choice of the attitude to be investigated was inspired by McGinnies and 
Rosenbaum (3), who found that college females (but not males) who were 
exposed to President Johnson’s April 7, 1965 speech on Vietnam were more 
likely to support U. S. involvement in Vietnam than were those females that 
did not attend to the speech. The present study utilized the November 3, 
1969 speech by President Nixon, a speech that was preceded by some specula- 
tion that the President might alter his position in a dovish direction. The 
design of the study also permitted a comparison with the overall means re- 
ported by McGinnies and Rosenbaum in order to evaluate possible changes 
over time in students’ attitude toward the war. 


B. Merxnop 


During the week prior to the speech, over 300 Introductory Psychology 
students at Memphis State University were administered a pretest, which 
consisted of 19 seven-point Likert-type items. They included the eight items 
used by McGinnies and Rosenbaum,’ plus 11 additional items. The new 
items measured attitudes toward the war, interest in politics, and frequency 
of exposure to the media. During the week following the speech the posttest 
was administered. It consisted of the same 19 items in addition to three ques- 
tions asking (a) whether or not the subjects saw or heard the speech, (4) 
whether they heard about the speech from any source, and (c) whether they 
neither saw nor heard the speech nor learned anything about it. These three 
questions divided the subjects into three groups: (a) direct exposure, (4) 


* Received in the Editorial Office on July 3, 1972, and published immediately at 
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indirect exposure, and (c) no exposure. One hundred forty-nine subjects 
with both pre and posttests were available. 


C. Resurts? 
1. Selective Exposure 


An overall test for both males and females indicated that those in the 
direct exposure group—that is, those who saw or heard the speech (N = 82; 
M = 4.15)—were significantly (p < .05) less favorable (i.c., less hawkish) 
toward the administration’s position than those who neither saw nor heard 
the speech (N = 31; M = 4.55). The difference for the females was 
slightly smaller, and hence only reached the .10 level. There were too few 
males (N = 31 in the direct exposure group and N = 10 in the no exposure 
group) and too much variability to produce a significant difference. The 
means for the indirect exposure group were closer to the direct exposure 
group, but were not significantly different from the other means. 


2. Sex Differences 


Proportionally more males than females saw, heard, or read about the 
speech (x? = 9.14; df = 1; p < .01). This finding agrees with McGinnies 
and Rosenbaum. However, they had also found that females were more 
dovish than males, a finding that was not substantiated in the present study. 


3. Attitude Change 


Although the subjects did not change from the pre to the posttest in this 
study, there was a significant decline in hawkishness when these subjects 


(males = 34.9, females = 36.9) were compared to McGinnies and Rosen- 
baum’s (males = 49.3, females = 45.4). 


D. Discussion 


Although the results provided support for a selective exposure hypothesis, 
they were in the direction opposite from what would have been predicted by 
Festinger (1), the originator of this hypothesis. They also contradicted the 
findings reported by McGinnies and Rosenbaum. 

A possible explanation for these results is advanced by postulating the 
“active group effect,” whereby those who are most active and interested in 
an issue will be most likely to expose themselves to information concerning it. 
In 1965 the active group was the hawks, whereas in 1969 it was the doves. 


2 Special thanks to Mrs. Sarah Miravalle of the M 
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The shift over the four-year period in average attitude on the McGinnies and 
Rosenbaum scale supports the claim that the doves were more active in 1969. 
Less formal observations from the media and personal experience would also 
substantiate this shift. One further factor may have been operating. President 
Nixon had been expected by some to shift his own position, whereas in 1965 
no one had expected President Johnson to shift. It may be, then, that if a 
group is becoming active, and if there is some uncertainty, however small, 
about the nature of the message, that group will be more likely to expose 
itself to the message. According to this interpretation, it is not surprising that 
the literature on selective exposure is contradictory, with some studies finding 
support for the hypothesis and some studies finding results in the opposite 
direction. When no differences are obtained, it may be that measurements were 
taken in a transitional period when one grouping is losing its active group 
status and another is acquiring it. 


E. Summary 


As a test of the selective exposure hypothesis, college students’ attention to 
President Nixon’s November 3, 1969 speech on Vietnam was utilized. Atti- 
tude toward the war was measured from 149 introductory psychology stu- 
dents during the week prior to and the week following the speech. The results 
indicated that those students who were directly exposed to the President’s 
speech had more dovish attitudes than those who neither saw nor heard the 
speech, a finding at odds with the selective exposure hypothesis as usually 
formulated. To explain the results, the active group effect was postulated. 
When combined with some uncertainty about the nature of the message, it 
can account for much of the contradictory evidence in the selective exposure 
literature. 
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AN INTERPERSONAL SIMULATION OF THE 
JECKER CONFLICT EXPERIMENT: A TEST 
OF BEM’S THEORY* +? 


Department of Psychology, University of Florida 
Howarp D. Woopyarp? 


A. InTRopucTION 


Cognitive dissonance theory predicts that in a “free-choice” experiment after 
a person has chosen between two equally attractive objects, he will rate the 
chosen object as being significantly more desirable than the unchosen object—a 
“spreading apart effect.” Experiments generally yield the predicted results 
(3). 

Bem (1) has offered an alternative explanation of such cognitive dissonance 
results. He argues that there is essentially no difference between the perception 
of oneself and the perception of others. Thus, according to Bem’s self-percep- 
tion theory, an individual makes judgments about himself on the same basis 
as he makes judgments about another person. Both such judgments are made 
on the basis of observed behavior and the context within which the behavior 
takes place. Bem assumes that premanipulation beliefs are not salient to the 
outcome measures in cognitive dissonance experiments. 

Bem (1) developed the “interpersonal simulation” experiment to demon- 
strate that his self-perception hypothesis was a viable alternative to cognitive 
dissonance theory in explaining the results of “‘free-choice” experiments. 
According to Bem, the S§ sees himself select one alternative over another 
under conditions of free choice and thus concludes that he must find the 
chosen alternative significantly more desirable than the rejected alternative. 
The interpersonal simulation involves observer-subjects (O-Ss) who are 
given descriptions of a dissonance experiment and then are asked to estimate 
the participating subjects’ (P-S) ratings, As Bem predicted, the O-Ss were 
able to replicate the results of the original dissonance experiment. Bem con- 
cludes that these interpersonal simulations demonstrate that internal con- 
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structs, such as cognitive dissonance, are not necessary to explain the results 
of dissonance experiments. 

Jones et al. (5) criticized Bem’s interpersonal simulation on the grounds 
that he did not give the O-Ss any information regarding the P-Ss’ premanipu- 
lation attitudes. Jones e¢ al. performed a series of interpersonal simulations 
and found that O-Ss did not replicate the original dissonance results when 
when they were given clues as to the P-Ss’ premanipulation attitudes. Bem 
(2) replied to these critics by arguing that the interpersonal simulation is 
like a computer simulation in that if the correct input statements are used, the 
correct output statements will be obtained; but if there is incorrect input, the 
output will also be incorrect. He suggests that the Jones et al. results indicate 
they plugged in the wrong input statement—namely, information about the 
P-Ss’ premanipulation attitudes. 

The main thesis of this paper is that the self-perception hypothesis is not 
adequate to explain all cognitive dissonance results, In the experiment to be 
reported below, Bem’s assumption of the nonsalience of initial beliefs is 
adopted, and such information is not included in the interpersonal simulation 
descriptions of the Jecker conflict-experiment. 

In an experiment to rule out an alternative explanation of dissonance 
results derived from conflict theory, Jecker (4, pp. 21-30) performed a “free- 
choice” experiment in which he yaried conflict and dissonance independently 
of each other. Thus in one condition P-Ss were told that they had to choose 
one of two records, but that there was a high probability (19 in 20) that they 
would receive both records (low conflict). In another condition P-Ss were 
informed that they had to choose between two records, but that there was a 
very small possibility (one in 20) that they would receive both (high con- 
flict). It was reasoned that P-Ss who were likely to receive only the record 
they chose would experience more predecision conflict than those P-Ss who 
were likely to receive both regardless of their choice. Cutting across these 
conditions was a “dissonance” condition in which P-Ss received only one 
record and the “no dissonance” condition in which P-Ss received both records. 
The P-Ss were then asked to rerate all the records after Consequences were 
known. The results showed a significant dissonance effect, but the conflict 
effect was not significant. 

In order to rule out the possibility that the 
reverses itself after the consequences of the decisio: 
some new conditions to his experimental design. Two new high conflict (one 
in 20) and low conflict (19 in 20) conditions 


f were added, but they differed 
from the others in that P-Ss were asked to rerate the records after they had 


conflict resolution process 
m are certain, Jecker added 
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made a decision but before they knew whether they would receive one or both 
of the records. It was reasoned that if there was a conflict effect, it should be 
obtained from these P-Ss, since a reversal should not take place until after 
the consequences of the decision are certain. In an “ordinary decision” con- 
dition each P-§ knew he would receive only one record, and no mention was 
ever made of the possibility of receiving both records, The results showed that 
only in the “ordinary decision” condition did the P-S re-evaluate the alterna- 
tives in a dissonance-reducing direction. There was no conflict effect. 

The results to be emphasized for the purpose of this paper are that within 
the dissonance conditions both high and low conflict conditions produced the 
dissonance effect only when the consequences of the decision were known 
prior to the second rating. In the conditions in which the consequences of the 
decision had not been settled, the dissonance effect was absent. It is maintained 
here that according to Bem’s self-perception theory the dissonance effect should 
be obtained even in the conditions in which the consequences are unknown. In 
these conditions the P-Ss engaged in choice behavior that was observable. 
This behavior was engaged in under conditions free of threat or other induce- 
ments that might have produced a judgment of self-noncredibility (manding). 
If this reasoning is correct, then Jecker’s results suggest that the self-perception 
hypothesis cannot adequately account for the results of this cognitive disson- 
ance experiment. If, on the other hand, O-Ss in an “interpersonal simulation” 
of those conditions of Jecker’s experiment are able to reproduce the results of 
this experiment, it would suggest the possibility that the reasoning herein 
about Bem’s hypothesis is incorrect. Therefore an “interpersonal simulation” 
of the Jecker experiment is described below. According to the derivation from 
the self-perception hypothesis that O-Ss should be able to reproduce the results 
of experiments in which self-descriptive statements are the dependent variable, 
it is predicted that in an “interpersonal simulation” of the Jecker experiment, 
the “spreading apart” effect will be greater in conditions in which the con- 
sequence of the decision is known prior to the second rating (before condition) 
than in the conditions in which such knowledge is acquired after the second 
rating (after condition). 


B. Mernop 
’ 1. Experimental Design 


A 2 X 2 factorial design was employed testing the effects of the conflict 
conditions (high conflict = one in 20 vs. low conflict = 19 in 20) and the 
knowledge of consequences condition (knowledge before rerating vs. knowl- 
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edge after rerating) on the dependent variable, the difference scores. The 
difference scores were computed by subtracting the rating of record C (un- 
selected) from the rating of record F (selected). Then a constant of 2 was 
added to all difference scores to eliminate any negative scores. 


2. Subjects 


The O-Ss for this experiment were 40 undergraduate female students 
attending nursing class and physical therapy class at the University of Florida. 
These subjects were selected from a larger sample of 48 students. In order to 
obtain equal Ns in each condition the extra data were discarded randomly 
(three in the high conflict-after condition, one in the high conflict-before 
condition, four in the low conflict-after condition) until the data from only 
10 O-Ss remained in each condition. 


3. Materials 


Four descriptions of the Jecker experiment, corresponding to the four 
conditions to be replicated, were employed. Two conditions were selected 
from each of the Jecker experiments described above. In one description the 
chance of receiving two records was one in 20, and the P-S had knowledge 
that she received only one record before the second rating (high conflict-before 
condition). In another condition the probability was 19 in 20, and again the 
P-§ had knowledge that she was to receive only one record before the second 
rating was made (low conflict-before condition). The remaining two condi- 
tions corresponded to these conditions except that the P-S had no knowledge of 
whether she received one or two records before the second rating was made 


(high and low conflict-after conditions). The high conflict-before description 
is reproduced below: 


The following is a description of an experiment in which a student 
much like yourself participated. Please read the description very care- 
fully. After you finish reading the description, you will be asked to 
answer some questions about what you have read. It is necessary for you 
to read the description below carefully so you can give accurate reports 
to the questions that will follow the description. 

A student, chosen at random, came to the room used for the experiment. 
A graduate student in the Department of Psychology introduced himself 
and explained that he was doing some market research in order to collect 
information about how people were reacting to the popular phonograph 


8 Reprinted with permission of the publishers from “The cognitive effects of con- 


flict and dissonance,” by J. D. Jecker (4). In L, Festinger, et i isi 
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records currently on the market. He also told the student that he was 
giving away gift records to each student in exchange for their time spent 
helping in the research. 

The student was then shown 15 records and asked to indicate whether 
or not she was familiar with each one, and which of them, if any, she 
already owned. After this, the subject was asked to rate each of the 15 
records on a seven-point scale according to how much she would like to 
have it. On the scale, zero was labeled “don’t care,” and 6 was described 
as “would rather have this record than any other record I can think of 
right now.” 

When the ratings had been completed, the subject was told by the 
researcher that although he had planned to give each student just one 
record as a gift, he now found that he had extra records. Rather than 
return them to the record company, he would give two records to one of 
every 20 students who participated in the research. Whether she was one 
of the students to obtain one or two, the student was told, would be deter- 
mined by chance. The student would draw a slip of paper out of a box 
on the table to decide. The student was told that the box contained 19 
slips marked “1” and one slip marked “2.” The “1” meant that the student 
would receive one record, and the “2” meant that the student would 
receive two records. 

Before the student drew the slip of paper from the box, the researcher 
showed her two of the records she had previously rated and told her that 
they would be the two she received if she were to get two records, and 
asked her to indicate which of the two she wanted in case she were to 
receive only one of them. Of course, any record the student already owned 
was not used for the choice. Records C and F were presented to the stu- 
dent, and she chose F. 

At this point, a slip of paper was drawn by the subject. It indicated 
that the subject was to receive one record only. The experimenter gave 
the subject record F and asked her to wait a bit while he made some notes. 
After about two minutes, the experimenter told the student that he was 
interested in a person’s second thought about the phonograph records. 
He asked the student to rate each of the 15 records again. The experi- 
menter pointed out that after thinking about them, her ratings might or 
might not have changed. This did not matter. The student was to rate them 
as she felt about them at this point. 

The scales shown below were used to make the ratings. From this 
description, estimate as well as you can the actual ratings of the student. 
Indicate your estimate by drawing a line through the appropriate point 
on the scales, [Two seven-point scales followed, one for the evaluation of 
each of the two relevant records.] 


The descriptions of all of the other conditions are identical except for 
details about the likelihood of obtaining two records and details concerning 
knowledge of the consequence of the decisions. In the 19 in 20 condition, 
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the fourth paragraph was revised so that “19” [of every 20 students] was 
substituted for the word “one,” and the numbers “1” and “2” were reversed 
in the next to last sentence. In the after conditions, the second to last para- 
graph of the before conditions was replaced by the paragraph that follows: 
Then the researcher asked the student to rate each of the 15 records 

again. He pointed out that after thinking about them, her ratings might 

or might not have changed. This did not matter. The student was to rate 

them as she felt about them at this point. 


4. Procedure 


Descriptions representing the four conditions of the experiment (one in 
20-before, 19 in 20-before, one in 20-after, 19 in 20-after) were distributed 
randomly during a nursing class and during a physical therapy class. The F 
instructed the O-Ss to mark the scale on the vertical lines and not between 
these lines. 


C. Resuxts 


Table 1 shows the mean for each condition. The before-after effect was 
not significant (F = .18, df = 1/36, p > .05). The conflict effect (one in 
20 vs, 19 in 20) was significant (F = 5.31, df = 1/36, p < .05). 


TABLE 1 
Record RATING DIFFERENCE SCORE MEANS 
Degree of conflict High conflict Low conflict 
Before 4.0 
After 44 HT 


—— ee ee 


D. Discussion 


Since this “simulation” was designed to conform with Bem’s assumption 
about the P-S’s lack of knowledge of his own attitudes, the failure of the 
O-Ss to reproduce the results obtained in the original experimental conditions 
demonstrates that the self-perception hypothesis is at best of limited adequacy 
in explaining the results of dissonance experiments, f 

If self-perception theory were correct, the Jecker experiment should not 
have produced differences in the dependent variable between the before and 
after conditions because subjects in both these conditions had observed exactly 
the same behavior under identical conditions and thus should have made the 
same mand-tact discriminations. Since Jecker did obtain a significant differ- 
ence between the before and after conditions, these results create difficulties 
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for self-perception theory. Perhaps the self-perception theory advocates might 
argue that this is an incorrect analysis of the theory and that, in fact, self- 
perception theory would have also predicted a significant difference between 
the before and after conditions. However, in order for this argument to be a 
viable one, O-Ss in an interpersonal simulation of the Jecker experiment 
should reproduce the results of the original experiment. The present experi- 
ment was such an interpersonal simulation, but the O-Ss did not reproduce 
the Jecker results. Thus the Jecker experiment poses an inescapable dilemma 
for Bem’s theory. These experiments cast serious doubt on the capacity of self- 
perception theory to explain all self-descriptive behavior. 


E. Summary 


Whereas cognitive dissonance theory explains the results of “free-choice’” 
experiments in terms of an aversive drive produced by cognitive inconsistency, 
Bem attempts to explain the results by means of a self-perception hypothesis 
which maintains that persons make judgments about their own beliefs in 
the same manner that they make judgments about another’s beliefs—namely, 
on the basis of observed behavior and its apparent controlling conditions 
(mand-tact discriminations). Bem developed the “interpersonal-simulation” 
experiment to test his hypothesis. The present experiment applied the inter- 
personal simulation technique to another free-choice experiment, the Jecker 
conflict experiment. The results did not support Bem’s hypothesis, Thus the 
results of this experiment and the original Jecker experiment catch self-per- 
ception theory on both horns of a dilemma and cast serious doubt on the 
adequacy of this theory to account for dissonance phenomena. 
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VOICE PRODUCTION AS A FUNCTION OF HEAD BALANCE 
IN SINGERS* 1 


Tufts University 


Frank Pierce Jones 


Elsewhere (4) I have described an empirical method for redistributing 
postural tonus by changing the reflex balance of the subject’s head while he 
carries out some everyday movement against gravity. The tonic effect persists 
into subsequent movements and postures and is reported by the subject as a 
decrease in the feeling of weight and an increased feeling of competence and 
ease of movement. The effect has been demonstrated with multiple-image 
photography (3, 6, 8), electromyography (7%), and force/time records ob- 
tained from a strain-gauge force platform (5). 

The procedure for changing the reflex balance of the head during a learned 
activity was developed by the late F. M. Alexander (1, 2) as a method for 
dealing with vocal strain. The pupil is trained to recognize and inhibit an 
increase in neck-muscle tension which tends to fix the Position of the head by 
decreasing the distance between occiput and seventh cervical vertebra. When 
he is able to maintain this inhibition during speaking or singing, he can exer- 
cise an indirect control which facilitates voice production. The effect can be 
demonstrated experimentally by counterbalancing habitual head displacement 
by manual pressure about the occiput sufficient to prevent muscles attached to 
the head from shortening. If the pressure is maintained while the subject is 
projecting his voice, the neck can be observed to lengthen and the resonance 
of the voice to increase. A measure of the difference can be obtained by re- 
cording a singer’s voice under the two conditions, habitual and experimental, 
and taking “sound spectrograms” from comparable sections of the record. A 
sound spectrogram or “voiceprint” is the visible record of a sound pattern 
presented in such a way that “the variations of the vocal resonances with time 
are displayed conspicuously” (9, p. 4). 

The method for changing head balance during singing is illustrated in 
Figures 1 and 2, The subject was a music student who had had some previous 


* Received in the Editorial Office on July 10, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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FIGURE 1 
Supject SINGING wiTH HABITUAL Heap BALANCE 
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FIGURE 2 
Susyecr SINGING WITH HEAD BALANCE ALTERED BY THE EXPERIMENTER 
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training in kinesthetic perception. In Figure 1 she is singing in her habitual 
way. In Figure 2 she is repeating the song while the experimenter alters the 
balance of her head by the method described above. During the two proce- 
dures the subject’s voice was recorded with a Craig Cassette Recorder. 

The subject reported that she sang more easily, with greater resonance, and 
with better breath control under the experimental condition. Other musicians, 
who listened to the tape, confirmed the subject’s judgment of the difference 
in the quality of her singing. 

The tape was sent to the Voiceprint Laboratory in Somerville, N. J., 
where voice prints were made of three-second samples taken from comparable 
sections of the two records. The sections were chosen to include the end of 
one phrase and the beginning of the next, together with the interval during 
which a breath was taken. The two “wide-band” records reproduced in Fig- 
ures 3 and 4 show the increased richness of the overtones and the virtual 
disappearance of breathing sounds under the experimental conditions. 

This method of changing voice production by facilitating a postural reflex 
has been successfully used with speakers as well as singers, It is valuable as 
a teaching device because it provides immediate reinforcement and raises the 
pupil’s standard of performance by means of his own conscious experience. 


Summary 


Sound spectrograms (“Yyoiceprints”) were used to study the effect of 
differences in head balance on vocal resonance in singing, 
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ROTATIONS ON WPPSI GEOMETRIC AND BLOCK 
DESIGN SUBTESTS AT FOUR, FIVE, 
AND SIX YEARS*1 


School of Medicine, University of Louisville 


Apa P. Martueny, Jr., ANNE M. Brown, AND Ronatp S. Witson 


A. Intropuction 


The Wechsler Preschool and Primary Scale of Intelligence (WPPSI) 
was introduced (3) as an extension of the WISC, appropriate for children 
between the ages of four and six and one-half years, Although the reliability 
of the subtests and the validity of the WPPSI as a whole have been shown 
to be comparable to the WISC, little other information on specific aspects of 
the WPPSI subtests has been published. 

Of particular interest to the clinician are two subtests—Geometric Design 
and Block Design—which, during reproduction by drawing or construction, 
may elicit rotations. Wechsler (3, p. 69) points out that even though scores 
are not penalized by rotations on the Geometric Design subtest, “. . . all 
rotations of 45° or more should be noted . . . as rotations may be clinically 
interesting.” Rotations on the last six of the 10 Block Designs are scored as 
failures, and therefore do enter into scoring the child’s performance. There 
are, however, no normative data available on age-related errors of orientation 
with which the clinician can compare the results of a child’s performance on 
the WPPSI. 

Aside from the normative aspects of the frequency of rotations found for 
each age, the items that account for the rotations would be interesting in view 
of other research on perceptual orientation. Ghent (1), in a study of chil- 
dren’s preferred orientations of geometric figures, found that some figures 
elicited consistent preferences for particular orientations. For example, among 
children aged four to eight years, all of the children at each age said that an 
inverted T was “upside down.” Ghent hypothesized that children naturally 
scan downward, and the preferred orientations were due to an interaction 
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between the scanning direction and the focal point of the figure. When the 
focal point of the figure was at the top, it was congruent with a downward 
scanning pattern and the figure was said to be correctly oriented. When the 
focal point was at the bottom, children started scanning at that point and 
thereby the scanpath was counter to the normal tendency. Consequently, a 
figure in that orientation was called “upside down” by the children. In view 
of Ghent’s research some items from the Geometric Design and Block Design 
subtests should elicit more rotations than others because the orientation of 
the test model is contrary to the preferred orientation. 

The purpose of the present investigation was twofold: (a) to provide base- 
line data on the frequency of rotations found for each item on the Geometric 
Design and Block Design subtests given at ages four, five, and six years; 
and (b) to see if the characteristics of items frequently rotated correspond 
to characteristics outlined by Ghent’s hypothesis. 


B. MerHop 
1. Subjects” 


The Ss were drawn from a population of white twin and singleton children 
participating in the Louisville Twin Study. There were WPPSI tests avail- 
able for 182 twins and 30 singletons at age four years, 227 twins and 47 
singletons at age five years, and 228 twins and 68 singletons at age six years. 
The average age of the children at the time of testing was within one month 


of their birthdays. Females and males were almost equally represented at 
each age. 


2. Analysis of Rotations 


The Geometric Design and Block Design subtests were examined to deter- 
mine the total number of rotations for each item, even if, in the case of some 
Block Designs, the rotation caused the item to be failed. Designs were scored 
as rotated if the Teproduction was rotated more than 45° from the center 
axis of the design, In accordance with the test instructions, reversal of colors 
on the Block Design subtest was not counted as rotation. 

Interscorer agreement on rotations was ascertained by randomly selecting 
tests from 30 children and haying two scorers rescore the subtests. On the 


Geometric Design subtest, the level of agreement surpassed 99 percent, and 


on the Block Design subtest, there was no disagreement. The slight disagree- 


ment between scorers for the Geometri i i 
nt etric Design subtest i 
decisions on 90° rotations for item 5. eet 
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C. Resutts 


Preliminary analyses revealed no systematic differences between twins and 
singletons or males and females for the frequency of rotations at the three 
ages. Because of the lack of differences the groups were pooled by age. Mean 
scaled scores for Geometric Design, Block Design, and total Performance 
subtests did not differ significantly from those of the standardization popul1- 
tion. Frequencies of rotations could not be compared, as these data have not 
been reported for the standardization groups. 

The percentages of rotations on each item of the Geometric Design subtest 
are presented in Table 1, The percentages indicate that some items (for ex- 
ample, item 2—an inverted T) elicited considerably more rotations than 
others, and most rotations seemed to occur at five years. 

‘The percentages of rotations on each item of the Block Design subtest are 
also presented in Table 1. Here again, some items elicited more rotations 
than others, and the five-year-old children generally made more rotations 
than the four- and six-year-olds. 


TABLE 1 
PERCENTAGE OF ROTATIONS ON THE WPPSI Geometric AND Biock 
Desicn SuBTEsts AT Four To Six YEARS 


Four years Five years Six years 
lo lo % 
Item no. N rotations® N rotations* N rotations" 
WPPSI Geometric Design subtest 
1 212 _ 274 _ 296 _ 
2 212 24.5 274 36.1 296 20.9 
3 207 15.9 274 20.4 296 15.9 
4 187 —_— 273 — 296 — 
5 133 3.8 264 6.8 294 44 
6 76 14.5 234 10.7 294 2.7 
7 55 9.1 225 14.7 291 5.8 
8 33 - 191 35 284 = 
9 12 _ 112 bored 260 _ 
10 8 - 82 6.1 232 4 
WPPSI Block Design subtest 

1 212 11.3 274 5.1 296 _ 
2 212 14 274 4 296 - 
3 198 22.2 274 30.6 296 31.8 
4 189 1.6 274 3.6 296 4.0 
5 117 2.7 237 72 286 5.6 
6 80 1.2 214 7.0 283 7A 
7 40 - 182 2.7 276 4 
8 15 _— 144 2.8 264 2.3 
9 15 i 136 2.2 259 8 
10 7 a 99 4.0 234 ae 


® Includes all rotations regardless of item score, 
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D. Discussion 


The results from this study indicate that two WPPSI subtests which in- 
volve perceptual and visual-motor organization show variations in rotations 
as a function of chronological age and the specific designs being reproduced. 
Rotations from both subtests generally peaked at five years, a finding that is 
puzzling and deserves replication. 


Aside from the normative aspects of the frequency of rotations, the designs 
accounting for the rotations are interesting in view of Ghent’s research on 
orientations. On the Geometric Design subtest, the inverted T accounted for 
most rotations. Ghent (1) found that the inverted T was considered to be 
upside down, and it is possible that the “natural” position of the T compels 
some children to rotate the figure, regardless of task instructions and lack 
of agreement with the model. This is not to say that the children cannot 
correctly perceive the orientation of the model, but that they do not reproduce 
it. As Wohlwill and Wiener (4) have found, preschoolers given a matching- 


from-sample task have little difficulty in discriminating figures on the basis of 
orientation. 


The rotations found on item 7 of the Geometric Design subtest also might 
be consistent with a hypothesis concerning preferred orientation. The design 
consists of a circle and a triangle aligned horizontally, with the triangle tilted 
enough so that a vertex of the triangle touches the circle. When the triangle 
is rotated so that the circle touches a side rather than a vertex of the triangle, 
the design is counted as a failure as well as a rotation. For the three ages, 
this latter type of reproduction accounted for almost 75 percent of the rota- 
tions, 

On the Block Design subtest, a trian 
most rotations. This two-block design, 
has a vertical orientation, 
The rotated reproduction 
tally with the base of th 


gular figure (item 3) accounts for 
the only one for which the model 
has the apex of the triangle to the child’s right. 
typically consists of the blocks being placed horizon- 
e triangle at the bottom. Since Ghent, as well as 


a 
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Much has been made of the ability of young children to develop in- 
creasingly veridical perception of forms and their orientations as a correlative 
of developmental adequacy in spatial reproductions. For example, the relation 
between children’s competency in reading and their general level of com- 
petency on visual-motor tasks is well known and need not be referenced here. 
Less known are the normative aspects of visual-motor development as found 
on age-specific items from our diagnostic instruments, and against which we 
should compare the individual’s performance before making diagnostic judg- 
ments and therapeutic decisions. Regardless of whether an examiner has a 
predisposition for clinical or statistical approaches to analyzing psychometric 
data, he may feel more secure in using item characteristics when more baseline 
information about those items is available. 


E. Summary 


Percentages of rotations (45°) for all items of the Geometric Design 
and Block Design subtests of the Wechsler Preschool and Primary Scale of 
Intelligence (WPPSI) are reported for a sample of 212 four-vear-olds, 274 
five-year-olds, and 296 six-year-olds. Rotations occurred most frequently for 
Geometric Design item 2 (an inverted T) and Block Design item 3 (a two- 
block vertical arrangement). There were higher percentages of rotation at 
age five than at age four or six. The findings are discussed in terms of (a) 
baseline norms for the clinician’s use of item characteristics in making diag- 
nostic judgments and therapeutic decisions, and (4) theoretical importance 
for research in perceptual orientation, particularly Ghent’s hypothesis on the 
interaction of focal points and scanning direction as a basis for children’s 
preferred orientations of figures. 
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FACTORS AFFECTING INDIVIDUAL 
RISK TAKING ATTITUDES* 


School of Business and Public Administration, 
University of Missouri-Columbia 


Eart A. Crcit 


A. IntropuctTion 


During the last decade considerable research has been directed toward a 
comparison of individual vs. group decisions with respect to the level of risk 
individuals and groups will assume. Much of this research was initiated as 
the result of a Master’s thesis by Stoner (2) which reported that groups 
make riskier decisions than individuals. Prior to Stoner’s study, it was gen- 
erally assumed that groups would be more conservative than individuals. 
A majority of the research since Stoner’s study has been directed toward pro- 
viding an explanation of the often reported results that groups make riskier 
decisions than individuals. Far less attention has been directed toward the 
factors that may influence individual risk taking attitudes. If one is to explain 
the relationship between group and individual risk taking, more research is 
needed: in the area of individual risk taking. Two studies have been reported 
in which the effect of age on individual risk taking was examined. Vroom 
and Pahl (3) and Wallach and Kogan (4) both report that younger sub- 
jects were more risky than older subjects, 

Since age influences individual risk taking attitudes, it is reasonable to 
assume that other variables may also influence these attitudes. This study 
reports the findings of the significance of selected variables other than age. 


B. Mernop 
1. Subjects 
The subjects were 234 male and female students enrolled in the School 
of Business and Public Administration, University of Missouri-Columbia. 
2. Instruments 
The instrument used for assessing individual levels of risk taking was the 


Kogan and Wallach (1) Choice-Dilemmas Questionnaire which consists of 
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descriptions of 12 hypothetical situations each of which involves a choice 
of either a rather safe course of action with a certain outcome or a riskier 
course of action with a more desirable outcome. The situations cover a wide 
range of content that an individual is likely to encounter in a real life situa- 
tion. For each of these situations the subject is asked to indicate the lowest 
probability of success he would accept before recommending the more desir- 
able (most risky) alternative. Responses consist of one, three, five, seven, 
and nine chances out of 10 that the more desirable alternative would be 
successful. If a subject would not choose the more desirable alternative re- 
gardless of the probability of success, his response was scored as 10. 


3. Procedure 


A transparency of each of the 12 items was projected on a screen during a 
regularly scheduled class period. Subjects were provided with an answer sheet 
on which they were to mark their responses after standard instructions were 
read to them. This procedure was slightly different from the usual procedure 
in which each subject is presented with a copy of the Choice Dilemmas Ques- 
tionnaire, and his response is written on the instrument itself. 

Upon completion of this part of the research, data were collected from 
each subject with respect to seven variables that could influence individual 
risk taking attitudes, as follows: (a) sex, (b) class standing, (c) declared 
major, (d) birth order of subject, (e) occupation of head of house, (f) size 
of home town, (g) family income. These yariables were then checked to 


determine if any of them was significantly related to differences in risk taking 
propensity. 


C. Resuxts 


s Data were analyzed by means of a Z test as to the significant differences 
in mean tisk taking scores, with risk taking propensity being the dependent 
variable and the above seven factors the independent variables. An analysis 
of variance was also performed. 

The results can be summarized as follows: 
difference in risk taking propensity between 
Sophomores were more risky than juniors (p 
more risky than seniors (p < .01). (d) Sop! 
graduate students (p < .05). (e) Marketing 
than students with majors outside the area of 
tion (6 < .05). (f) There was no signi 
propensity with respect to birth order, (9) 


(a) There was no significant 
male and female subjects. (J) 
< .05). (c) Sophomores were 
homores were more risky than 
majors were more conservative 
business and public Administra- 
ficant difference in risk taking 
There was no significant dif- 
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ference in risk taking propensity with respect to occupation of the head of 
household. (h) Students classified as rural were more conservative than those 
from communities of 5000 to 10,000 population (p < .05). (i) Students 
from families with less than $5000 annual income were more risky than stu- 
dents from families with $5000 to $10,000 annual income (p < .05). (j) 
Students from families with less than $5000 annual income were more risky 
than students from families with $20,000 to $35,000 annual income (p < 
.05). (&) Students from families with over $35,000 annual income were 
more risky than students from families with $20,000 to $35,000 annual in- 
come (p < .05). 


D. Summary 


With use of the Choice Dilemmas Questionnaire to measure risk taking 
attitudes, seven variables that could influence individual risk taking were 
related to scores on the risk taking measurement instrument. Class standing, 
major, family income, and rural home environment were found to produce 
significant differences in individual risk taking attitudes. There were no 
significant differences with respect to sex, occupation of head of household, 
and birth order. 

The results of this study combined with the results of the Vroom and 
Pahl (3) study strongly suggest the importance of investigations designed 
to identify biographical variables that significantly influence risk taking at- 
titudes. Further research should be conducted to determine how these vari- 
ables, as well as others, combine to influence risk taking attitudes. 
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ANAGRAM DIFFICULTY AS A FUNCTION OF 
PRONOUNCEABILITY AND FAMILIARITY* 


Augusta College 


Ep M. Epmonps 


A. Inrropucrion 


A number of studies (e.g., 1, 2) have indicated that the difficulty of 
solving an anagram increases as the anagram becomes easier to pronounce. 
However, Dominowski (1) has suggested that pronunciation practice may 
affect anagram difficulty by increasing the familiarity of the anagrams instead 
of making them easier to pronounce, The present study used a training pro- 
cedure designed to alter anagram familiarity while minimally affecting pro- 
nunciation speed, This training procedure (spelling practice) was then com- 
pared with pronunciation training in an attempt to separate the effects of 
these two variables on anagram difficulty, 


B. Mernop 
1. Subjects 


The Ss were 40 undergraduates enrolled in psychology courses at Augusta 
College. They were randomly assigned to two groups, which are designated 
Group SP (spelling practice) and Group PRO (pronunciation practice). 

J 


2. Procedure 


Prior to anagram solving, both groups viewed six different random orders 
of the same 15 anagrams, Each of the anagrams had only one solution, and 
the anagram letter orders were selected from the easy orders used in a study 
by Mayzner and Tresselt (3). Group PR was told that they would be shown 
lists of nonsense syllables and, for each list, they would be asked to pronounce 
aloud each item in the list. Group SP was given the same instructions for 
lists 1 and 6, but was instructed to spell rather than to pronounce each item 
in lists 2 through 5. Both groups were asked to work as rapidly as possible. 

After completing the sixth list, each § received a booklet which contained 
one anagram per page and was instructed to arrange the letters on each page 
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to make one English word. If S failed to solve an anagram within 40 seconds, 
E gave § the solution and proceeded to the next page of the booklet. 


C. Regsuits aNp Discussion 


On the first anagram list, the mean reading times for Groups SP and 
PRO were 28.1 seconds and 27.9 seconds, respectively. On the sixth anagram 
list, the mean reading times for Groups SP and PRO were 25.2 and 16.4 
seconds (F = 12.3, df = 1,38, 6 < .01), respectively. These results indicate 
that prior to anagram solving the two groups were equated in terms of 
familiarity with the anagrams, but differed significantly with respect to 
pronunciation speed. To determine which of these variables was most relevant 
to anagram difficulty, the anagram performance of the two groups was com- 
pared. By use of the number of anagrams solved correctly as the dependent 
variable, the mean number of solutions for Group PRO (5.3 solutions) was 
found to be significantly lower (¢ = 4.3, df = 38, p < .01) than for Group 
SP (9,2 solutions). 

These findings support the conclusion that the difficulty of solving an 
anagram is influenced primarily by pronunciation ease rather than by fa- 
miliarity. 


D. SumMMary 


To separate the effects of pronounceability and familiarity on anagram 
difficulty, a training procedure (spelling practice) was used which altered 
familiarity while minimally affecting pronunciation speed. This procedure 
was then compared with pronunciation practice. The results indicated that 
anagram difficulty is a function of pronounceability rather than familiarity. 


REFERENCES 


1, Domrvowskt, R. L. The eff iati i i 
Pare es Aceh i 3 seas Pronunciation practice on anagram difficulty. 


2. Hepert, J. A,, & Rogers, C, A., Jr. An luti i 
ability and difieuly. Piychon. Sein 1908, & asses © fonCHN OF Pronounce: 


3. MAvzNrr, M. S,, & Tressett, M. E. Anagram solution times: i 
order and word frequencies, J. Bsgere Prychol. 1958, 56, His eo 


Psychology Department 
Augusta College 
Augusta, Georgia 30904 


Published as a separate and in The Journal of Psychology, 1972, 82, 229-231. 


STAFF AND RESIDENT ATTITUDES IN TWO CHILD CARE 
INSTITUTIONS OBSERVED, RESPECTIVELY, AS 
DEFENSIVE AND PARTICIPATIVE IN THEIR 
MANAGEMENT PRACTICES*! 


Memphis State University 


RicHarp C. Mappocx anp Cartes T. Kenny 


A. Inrropuction 

Writing about practices in industrial organizations, Gibb (1) has pre- 
sented a theory that distinguishes between defensive and participative ap- 
proaches to management. The defensive approach engages in facade formation 
and imposes heavy controls on those who are managed, whereas the participa- 
tive approach is more democratic, less controlling, and less rigid. The pur- 
pose of the’ present study was to determine if Gibb’s theory could be extended 
from industrial organizations to child care institutions. Two such institutions 
were’ selected which seemed from preliminary observations to typify Gibb’s 
two approaches to management. Interviews with staff residents failed to un- 
cover obvious differences that would support these initial observations. Hence, 
it was decided to employ standard attitude measurement techniques. It was 
hypothesized that these techniques would reveal differences between the two 
institutions in terms of the degree of defensiveness of staff and residents’ 
attitudes toward one another and toward the institution. 


B. Procrpure 
1. Subjects 


Two female child care institutions were investigated. One, fictitiously 
named Harwood, housed approximately 30 dependent and neglected female 
residents and appeared rather defensive in its approach to management. The 
other, Thrasher, housed 51 dependent and neglected female residents. 
Thrasher appeared much less defensive than Harwood. Only the adolescent 
residents were included in this study: 13 from Thrasher and 11 from Har- 
wood. Five staff members from Harwood and seven from Thrasher were 


included. 
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2 Staff Attitudes 


To study staff attitudes at each institution, two scales were adapted and 
reworked from those suggested by Fox ef al, (2). 

a. Behavior Change Checklist. Staff indicated their attitudes about the 
desirability and frequencies of various behavioral traits of residents on this 
instrument, which consisted of five response alternatives for each of a series 
of 12 items. 

b. Defensive-Participative Management Scale. A five point Likert format 
was employed to measure extent of agreement with statements structured 
around the basic aspects of Gibb’s theory. 


3. Residents’ Attitudes 


Two scales were used to study attitudes of residents in both institutions, as 
follows: 

a, Behavior Change Checklist. Residents completed a modified version of 
this instrument, indicating the staff behavior that they preferred. 

b. Sentence Completion Form. Residents also completed this form, con- 
sisting of 26 items. Scores of 1 to 5 were given, depending upon the degree 
of defensiveness characterizing each completion. Interjudge reliability be- 
tween the senior author and another judge was .83. 


C. Resuurs 


The results confirmed the hypothesis. The staff at Harwood, the institu- 
tion judged more defensive from preliminary observations, achieved higher 
(i.e., more defensive) scores than the staff at Thrasher on both the Behavior 
Change Checklist (AZ = 36.8 vs. 28.0; p < -05) and the Defensive- 
Participative Management Scale (MM = 79.2 vs. 61.8; p < .01). 

The residents at Harwood also received higher and hence more defensive 
scores than those at Thrasher on the Checklist (M = 41.09 vs. 33.23; p < 
pe is on the Sentence Completion Form (M = 99.33 os. 67.54; 


x 
_D. Discussion 


Although initial observations presented some clues Concerning facade for- 
mation, interviews with staff and residents revealed no detectable differences. 
The use of attitude measures, however, supported the hypothesis of this study 
and Provided firm empirical support for Gibb’s defensive management theory. 
The instruments that were used appear to provide a reliable key to institu- 
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tional diagnosis, showing a significant difference in management and resident 
attitudes at the two institutions. 

The results revealed a strong tendency for the residents, as well as the 
staff, in the defensively managed institution to demonstrate more defensive- 
ness, hostility, facade formation, and distrust. These results support Gibb’s 
contention that the attitudes of fear and distrust on the part of defensive 
managers encourage hostility in the residents. This hostility often results in 
such undesirable behavior patterns as acting out. In order to deal with these 
behavior patterns, managers erect further facades and defenses. The result 
is a never ending circle of defensiveness, producing suspicion and hostility 
which in turn produce more defensivenesses. 

Two additional implications may be drawn from this study. First, if the 
institutional goals of successful behavior change and therapeutic output are 
to be attained, staff members in any institution must be receptive to interven- 
tion and behavior change, just as they expect the residents to be; and both 
groups must work together in the interest of total institutional effectiveness. 
Second, much of the literature in psychology has concentrated upon the 
acting-out behavior of children as attention-getting in nature, and therefore 
assumes that it is subject to modification when the proper techniques are 
used. Rather than conceptualizing acting-out behavior as primarily attention- 
getting, and therefore not to be taken seriously, it may be more fruitful to 
regard it as a logical and justifiable way for residents to handle a defensive 
management style in order to maintain their dignity. 


E. Summary 


In summary, standard attitude measurement techniques were employed in 
an effort to extend Gibb’s theory of defensive management to child care in- 
stitutions. Results supported the hypothesis and indicated that both staff and 
residents were more hostile and defensive in one institution than in the other. 
Implications for conceptualizing residents’ behavior were discussed. 
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A COMPARISON OF THE SELF-DISCLOSURE BEHAVIOR OF 
FEMALE UNDERGRADUATES AND MARRIED WOMEN* 


Randolph-Macon Woman's College 


Tuomas L, PastTeRNACK AND MartHa VAN LANDINGHAM 


A. InTRopucTION 


Self-disclosure involves revealing some aspect of one’s life to another person. 
Although the degree to which persons possess this personality characteristic 
varies considerably, numerous studies have found that the degree to which it 
is exhibited may be influenced by the behavior of the other person or persons 
present. Jourard and Landsman (2) found that persons disclose more towards 
persons who have been self-disclosing towards them. Similarly, Chittick and 
Himelstein (1), Levin and Gergen (5), and Powell (6) all found the degree 
of self-disclosure exhibited by subjects to be related to the degree of openness 
of the other individual present in the situation. Jourard and Resnick (4) 
reported consistency between low or high scoring on a self-disclosure question- 
naire and low or high scoring in conversational situations when dyads were 
formed with like-disclosing pairs. Subjects who scored low on the questionnaire 
disclosed less to their partners than did the subjects who scored high. ‘They 
also found that when Lows were subsequently paired with Highs, the latter 
remained high disclosing and the Lows showed a significant increase in self- 
disclosure to their partners. 

The present study was conducted primarily to determine whether there is 
a difference in the self-disclosure behavior of female college undergraduates and 
older married women, and to determine if the expected increased self-disclosure 
of the Lows when paired with Highs carries over to subsequent sessions with 
fellow Lows. Lon ool SATE] 

Using a questionnaire, Jourard and Lasakow (3) found that although there 
was no difference in the amount of self-disclosure between married and un- 
married college students, the married students tended to concentrate their 
disclosing upon their spouses and to become more reticent toward other persons. 
On the basis of this finding and with the emphasis that today’s American 
youth supposedly puts upon openness, it was hypothesized that (a) the female 
undergraduates would disclose more than the married women. On the basis of 
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Jourard and Resnick’s study (4) it was further hypothesized that (4) low- 
disclosure subjects would disclose more when paired with high-disclosure sub- 
jects than when paired with low disclosers, and that (c) there would be no 
difference in the disclosure scores of the Highs over the three sessions. It was 
also hypothesized that (d) Lows would disclose less the second time they were 
paired with other Lows than during the session in which they were paired with 
Highs, but that (e) Lows would disclose more when paired with Lows for 
the second time than when originally paired with Lows. 


B. Meruop 
1. Subjects 


One hundred female undergraduate students ranging in age from 18 to 24 
(mean age 18.4), who were enrolled in Introduction to Psychology at 
Randolph-Macon Woman’s College, were asked to respond to a 40-item self- 
disclosure questionnaire. Thirty-nine of the items were items used by Jourard 
and Resnick (4), although seven were slightly modified to be equally applicable 
to both age groups. The remaining item was original: the students were asked 
to indicate how many, but not which, of the topics they would be willing to 
discuss fully with a female partner whom they did not know. The 12 students 
with the highest scores on the questionnaire comprised the student high-dis- 
closure group (mean score 32.17), while the 12 students with the lowest 
scores comprised the student low-disclosure group (mean score 9.08). 

The same questionnaire was distributed to 80 married women who were 
members or friends of members of the Lynchburg (Virginia) Junior Woman’s 
Club. Their ages ranged from 26 to 56 (mean age 32.7). They, too, were 
asked to complete the questionnaire and return only their score and an indica- 
tion of whether or not they were willing to participate further with the study. 
Only those who consented to continue were considered as possible subjects. 
From these, the 12 with the highest scores on the questionnaire comprised the 
married woman high-disclosure group (mean score 37.58), while the 12 
pie scorers comprised the married woman low-disclosure group (mean score 

The Student and married woman groups were socioeconomically similar. 
The primary differences between the two groups were the age and marital 


status of their members, 
2. Materials 


Twenty-four topics, 17 of which were selected from items used by Jourard 
and Resnick (4), and seven of which were modifications or originals, were 
rated by 15 other college students for intimacy value. The topics were placed 
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into three equivalent groups, each group consisting of four high-intimacy, two 
medium-intimacy, and two low-intimacy items. Each topic was typed on an 
index card and was numbered from 1 to 8 in increasing order of intimacy for 
each of the three groups, A, B, and C. The order of the items within each 
group of eight was then randomized. 

Rating sheets were designed to permit each subject to rate her own response 
and the response of her partner to each item on a scale of zero to 2. A rating of 
zero indicated a failure to respond, a “1” indicated the withholding of some 
relevant information, and a “2” indicated that the topic was disclosed fully. 


3. Procedure 


In the first experimental phase, each member of the married woman high- 
disclosure group was randomly paired with a member of the student high- 
disclosure group; low disclosers were paired similarly. This resulted in 12 
dyads of Highs and 12 dyads of Lows in which the partners were strangers to 
one another. 

The second experimental phase, which took place about two weeks later, 
followed the same general procedure with one major exception. In this phase, 
Highs were paired with Lows, producing dyads composed of members from 
opposite disclosure categories. 

The third experimental phase took place about two weeks after the second 
with the same general procedure being followed with the exception that Lows 
were again paired with Lows, and Highs were again paired with Highs. 
During the course of the study each of the subjects was paired with three 
different persons. 

The instructions to the subjects were identical in all phases. The two 
members of each dyad were introduced to each other by the experimenter who 
then took them to a room which afforded them privacy. They were then 
asked to discuss with each other the eight topics on the cards that were handed 
to them. Each subject had the option of speaking on a topic or declining to 
discuss it, She then handed the topic card to her partner who had the same 
option. After both partners considered the first topic, they indicated their 
perceptions of the responses on the rating sheets. This process was repeated, 
with the partners alternating speaking first, until all eight topics had been 
considered. 

It was emphasized to the subjects that they should respond to an item only 
if they wanted to, and that they could disclose as much or as little about the 
topic as each of them felt comfortable doing. After answering any questions, 
the experimenter left the room. No time limit was placed on the interviews, 
and they ranged from 30 minutes to two hours. After the subjects had com- 
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pleted each session, the student member of each group was asked to return the 
rating sheets and topic cards to the experimenter. 

Even though the three-item groups were considered to be equivalent in 
intimacy value, groups A, B, and C were equally assigned to the dyads on 
each of the three sessions to prevent the results from possibly being influenced 
by the specific items, During the course of the three sessions, therefore, each 
subject was exposed to item groups A, B, and C in varying orders. 


C. Resurrs 


Since two married woman subjects, one High and one Low, dropped out of 
the study after the first session, it was necessary to drop two additional married 
women and four students from the final analysis. Subjects had been randomly 
paired with one another prior to the beginning of the experiment and only 
those subjects who completed all three phases of the experiment were included 
in the data. The results, therefore, are based on 40 subjects: 10 High students, 
10 Low students, 10 High married women, and 10 Low married women. 

Contrary to expectations, the female undergraduates did not disclose signifi- 
cantly more of themselves than did the married women during the study. A 
person’s total disclosure was determined by adding the partner's rating of the 
person to the individual's rating of herself. With the use of the data presented 
in Table 1, the mean total disclosure score of the students was found to be 
27,32 ([25.57 +- 29.07]/2), while the mean total disclosure score of the 
married women was 26.63 ((25.33 + 27.93]/2). This difference was not 
statistically significant. 

The difference in the amount of self-disclosure by the Highs (Y — 28.50) 
and the Lows (Y = 25.45) during the three sessions was, however, statis- 
tically significant, F (1,30) = 12.10, p < .01. 


TABLE 1 
Mean Torat Disctosure Scores or Hich- ann Low-D: 
Dunino Tux Tate Drapic aoe Supyecrs 


Subjects 
Low-disclosing High-disclosing 
Married Married 
Sessions Students women Total Students — women Total 
1 29 24.0 23.95 29.3 

an 28.9 29.1 

; He 26.5 26.65 284 264 274 

25.5 25.75 29.5 28.5 29.0 

Total 25.57 25.33 25.45 29.07 27.93 28.50 
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‘The analysis of variance also showed that Lows disclosed significantly less 
to their partners (¥ = 23.95) than did Highs (X’ = 29.10) when dyads 
were formed with like-disclosing pairs the first time (Session 1), F (2, 60) = 
7.19, p < .O1. Similar results were obtained the second time the dyads were 
composed of like-disclosing pairs (Session 3), F (2, 60) = 4.53, p < .05, 
thus indicating that the questionnaire is a valid instrument for classifying 
people into High and Low groups. When Lows were paired with Highs 
(Session 2), the Lows showed a significant increase in self-disclosure, F (2, 
60) = 3.77, p < .05, thus supporting hypothesis (4) that Lows would 
disclose more when paired with Highs than when paired with Lows, Follow- 
ing experiences with Highs, the self-disclosure of the Lows in like-disclosing 
dyads (Session 3) was higher, though not significantly, than prior to such 
experiences, thus providing some support for hypotheses (d) that Lows would 
disclose less the second time they were paired with other Lows than during 
the session in which they were paired with Highs, and (¢) that Lows would 
disclose more when paired with Lows for the second time than when originally 
paired with Lows, 

Although the self-disclosure scores of the Highs during Session 2 (XY = 
27.4) were lower than their scores during either Session 1 (XY = 29.1) or 
Session 3 (XY = 29.0) the difference was not significant. Therefore, hypoth- 
esis (¢), which stated that there would be no difference in disclosure scores 
of the Highs over the three sessions, was supported (see Figure 1). 


D. Discussion 


During the first session when low-disclosing subjects were paired in the 
interview situation with similarly low-disclosing subjects, they revealed, as 
might be expected, relatively little about themselves. When high-disclosing 
subjects were paired with like partners, they disclosed much more about 
themselves than did the Lows. When dyads of opposites were formed, the 
High students maintained high-disclosing scores, while the Low married 
women with whom they were paired changed in the direction of greater open- 
ness, The High students apparently served as models for the Low married 
women, The High married women, however, when paired with Low students, 
tended to disclose less than they had during the previous session when talking 
with High students. It is possible, therefore, that the higher scores of the 
married women Highs during Session 1 may have resulted partly from inter- 
action with the High students. Surprisingly, during Session 2, the Low 
students disclosed slightly more than the High married women with whom 
they were paired, and even the Low married women who were paired with 
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O—@ Ligh Students 
A---A High Married Women 
O—O bow Students 
L-- LA bow Married Women 


30 


MEAN DISCLOSURE SCORE 


1 2 3 


SESSIONS 


FIGURE 1 
Mean ToTat Disctosure Scores oF HicH- AND Low-DiscLosinc SuBJEcTs 
Durinc EACH oF THE THREE SESSIONS 


High students disclosed more than the High married women. During the 
last session when Highs were again discussing topics with fellow Highs, and 
Lows were with Lows, both married women and student Highs returned to the 
previous higher scores similar to those during Session 1. That is, when high- 
disclosing subjects were again brought together, they returned to their 
“natural” level of disclosure. The scores for the Lows during the last session, 
when they were once again paired with low-disclosure partners, were not as 
high as they had been when Lows discussed with Highs, but they were higher 
than they had been during Session 1 when Lows talked with Lows initially. 
After experience with a high-disclosure person, Low subjects had a tendency 
to reveal more of themselyes than they had initially. These findings lend 
further support to the results of Jourard and Resnick (4), and indicate that | 
there may have been a carry-over effect operating. 

Although the scores of the married women Highs and the student Highs 
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on Session 1 were similar, their scores on Session 2 were quite different. The 
scores of the High married women came down markedly. In addition to the 
possibility that the scores of the High married women during Session 1 
reflected the result of their interaction with the High students, another possible 
explanation is that the undergraduates who were willing to reveal much about 
themselves, did not alter their self-disclosure behavior when faced with some- 
one who was unwilling to reveal as much. It may be, however, that the married 
women were not as willing to disclose themselves as fully when talking with 
someone less “open.” Being “open” may have been more rewarding and less 
of a risk for today’s students than for the married adults. 

Future studies might profitably concern themselves with the permanency of 
possible carry-over effects generated by a single experience, and the number of 
such experiences necessary to produce a lasting change toward greater self- 
disclosure. 


E. Summary 


‘Twenty dyads consisting of a married woman and an undergraduate dis- 
cussed eight topics of varying degrees of intimacy during each of three different 
sessions. These dyads were formed originally with persons who had similar 
scores on a self-disclosure questionnaire. Subsequently, the dyads were formed 
with partners who differed in degree of self-disclosure and, finally, with like- 
disclosing pairs once again. 

Contrary to expectations, there was no significant difference between the 
self-disclosure behavior of married women and undergraduates. Lows dis- 
closed significantly less to their partners than did Highs when dyads were 
formed with like-disclosing pairs. When Lows were paired with Highs, how- 
ever, the Lows showed a significant increase in self-disclosure. The self-dis- 
closure of the Lows in like-disclosing dyads was higher, though not significantly, 
following experiences with Highs than prior to such experiences. 
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THE WILLIAMS SCALE FOR THE SPECIFICATION OF 
POSITION: AN INITIAL REPORT* 


Utah State University and Brigham Young University 


Ray M, Wiuras anp Lesuiz M. Cooper 


A. Inrtropuction 


In her book on psychological testing, Anastasi observed “Some psychologists 
have maintained that, in the domain of personality, the individual can be effec- 
tively described only in terms of his own peculiar behavior interrelationships, 
rather than in terms of common traits” (1, p. 465). 

It is often useful in the clinical setting to elicit from a client a verbal 
expression of his perception of some problem, behavior, or event and of its 
meaning and importance to him. The purpose of this report is to introduce 
an assessment procedure that elicits such an expression in a structured manner 
and to publish initial standardization data. The device is called the Williams 
Scale for the Specification of Position (WSSP). 

The conceptualization and preliminary work on this instrument has spanned 
a time of more than five years. It was felt that the results of more recently 
completed formal studies have yielded indices of reliability and validity, and 
evidence of potential clinical value, to warrant publishing the results. 

The scale may be roughly classified as an inventory. There are 16 questions 
about one problem, or problem area, called a subject of enquiry. This problem 
area is usually determined by the administrator, although it can be selected 
with the assistance of the client. It should be carefully selected on the basis of 
several criteria. First, subject of enquiry should be, potentially, psychologically 
significant: It should represent an area of interest or concern to the tester or 
clinician. Second, to the testee, it should be intimate: that is, it should rep- 
resent some matter of contemporary interest and importance. Third, in the 
mind of the testee, it should be particular; it should seem limited in its scope 
and meaning. 

Examples of appropriate subjects of enquiry, which have been used in pilot 
studies are authority people, success in life, sex and dating, peace and happiness, 
success in school, goodness and worth. 

The administrator structures the interaction by saying: 


* Received in the Editorial Office on August 2, 1972, and published immediately at 
Provincetown, Massachusetts. Copyright by The Journal Press. 
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“I want to ask you about (subject of enquiry). Please understand that 
this is not a test. The things you say cannot be called right or wrong. You 
cannot be judged as passing or failing because there is nothing to pass and 
nothing to fail. 

“Js this clear to you? Will you believe that what we shall do is not a test? 
(Pause.) Very good. 

“Think of this: There are about four billion people on this earth, and each 
one of them is different from every other one. In the same way, you yourself 
are different from all other people on earth. And what I should like you to 
do is tell me exactly how you feel and think and thus how you are different. 
All right? (Pause.) 

“Remember, I want to ask you about (subject of enquiry). We are going 
to call (subject of enquiry) a problem. I want you to think of it as a problem 
to you. Whether (subject of enquiry) is good or bad, big or little, I want you 
to think of it as a problem. Do you understand this? (Pause. ) 

“TLet’s see now how you feel about your problem of (subject of enquiry).” 
‘The subject is then asked to respond to 16 items comprising the WSSP. 
The following four items’ constitute the first of four parallel sets or groups of 

items! : : 
I should like you to describe your problem of —________ by saying 
whether 

A) it has bothered you often; 

B) it has bothered you sometimes; 

C) it has bothered you very seldom; 


D) or—perhaps you feel you can’t choose. (Affective) 
I should like you to report about your problem of ___________ by saying 
whether 


C) itis quite simple to understand; 

B) it is somewhat hard to understand; 

A) it is very difficult to understand; 

D) or—perhaps you feel you can’t choose. 
I should like you to discuss your problem of 
whether 

B) you give some effort to solve it; 

A) you give much effort to solve it; 


(Cognitive) 
by saying 


1 A copy of the Williams Scale for th ificati iti ii 
Leslie M. Cooper, at the address ahr “3 sees pn mel > meme 
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C) you give very little effort to solve it ; 

D) or—perhaps you feel you can’t choose. (Conative) 
I should like you to explain your problem of __________ by saying 
whether 

A) you sort of think it is very important; 

C) you sort of think it is not very important; 

B) you sort of think it is quite important; 

D) or—perhaps you feel you can’t choose. (Generic) 


The testee elects one of the underscored alternatives of each item as his 
response, and this is checked as response A, B, C, or D by the administrator 
of the test. 

The tester may, if he wishes, present the first item and then, before the 
testee elects a response, say: “If you answer that your problem has bothered 
you often, I may think you have a really big problem—or I may think you 
have only a little problem, but you let it bother you a lot. So you see there are 
no good answers and no bad answers. Just tell me honestly how you feel. 
Now, I shall read the question again and you tell me what is your best answer.” 
It is proposed that such manipulation by the administrator of the test is 
effective against faking and malingering on the part of the testee. 


B. RATIONALE OF THE SCALE 


The scale assumes that the client can effectively represent his perceptions, 
and therefore his personality, with language. The test items of the scale to 
which the client responds are viewed as substances and structures of language 
useful to the client to represent his percepts. The verbalization of cognitive 
content and cognitive processes is held to be a central factor in personality, and 
it is the focus of this scale. 

The rationale of the scale proposes that extreme inferential interpretation 
in evaluating or otherwise understanding personality is undesirable. The 
complexity and variability of human beings are not easily understood. The 
value of subtle inferential interpretation from personality measurements is 
questionable, : 

The scale does not extract from a client verbal responses from which ex- 
tensive and subtle inferences must be drawn. It elicits a verbal reflection or 
Tepresentation of a percept, which needs only to be classified and treated 
actuarily. It is maintained that a subject can know himself concretely—and 
only concretely—by regarding his own percepts in the substance and form of 
language; and it is maintained that a tester can confidently know the testee 
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only by regarding concretely percepts as they are exposed in the protocols 
the scale, not by inferential interpretation of the protocols. 

The client’s protocol of 16 responses is his reaction to a total of 64 variables 
of language form, content, and emphasis contained in the body of test items, 
The protocol is proposed to be a good sample of the verbal substance of the 
percept chosen as a focus for testing. But if reliance is to be placed on this 
sample, then an attempt must be made to control for ambiguity. The client 
must be aided and controlled, as he utters the language sample, to regard his 
percept unambiguously. This is considered crucial in the WSSP scheme, and 
it is responsive to a rather common and vigorous concern about the signifi- 
cance of ambiguity. 

The rationale of the scale assumes that ambiguity in language can be 
managed and that the method of management is not that which is commonly 
proposed: namely, the use of greater precision and specification and elabora- 
tion. Writing of this problem, Waismann says “. . . ordinary language has 
the advantage of being in possession of declensions . . . and thus renews its 
spell over the philosopher luring him on into. the shadow chase” (6, p. 431). 
To attempt to certify the meaning of words with a greater number of words 
or with more precise words is to engage in a “‘shadow chase,” according to 
Waismann, who suggests that the only way to prevent or cure ambiguity is, 
in essence, not to attempt to prevent or cure it. 

WSSP rationale proposes that “general specification” must be distin- 
guished from “specific generalization” in language form. This distinction 
will help to understand Waismann’s paradox. A “general specification” causes 
and sustains ambiguity, while a “specific generalization” fosters nonambiguity. 
The item “I admire my parents in all important matters,” from Cattell’s 
Self-analysis Form (2) is an example of a “general specification.” It is general 
because it permits implications and uncertain meaning to emerge endlessly as 
the testee considers it. It is prima facie a specification, but is really a boundless 
generalization because of its potentially endless chain of implications and 
As ne e ambiguous, and it makes it difficult for the respondee 
wr lie eo pee about his Parents. The item “my problem 
eaitdiliot 4 pel anes Hea = as foci of enquiry in WSsspP, is an 
peteread te, se a about a similar psychological event, It 
his parents, This is so becuse this, ee ti pereentice a 
testee with various diskctions: aR pe et ae “ 

is and shades of meaning, but rather leaves him 


free to form a “set” about th invi i voi 
e event. It invites hi i id 
pai ” im and helps him to a’ 
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All words and combinations of words are always in some degree connotative 
and hence obscure, and word-creature man is forever trying to “catch the 
shadow cast by the opacities of speech.” But the situation is not hopeless: the 
ambiguity of words can be cured by forming a “set” about the words, a 
cognitive fixity about the percept the words represent. In responding to the 
directions and items of WSSP, the testee forms a “set” about the subject of 
enquiry and his responses are thereby not representations of an ambiguous 
percept of the subject of enquiry. 

Still further control for ambiguity is necessary if reliance is to be placed 
on the words-sample that is the testee’s protocol. The testee must be aided and 
controlled not to spoil his unambiguous perception as he utters it in responding 
to the test; that is, he must be controlled to publish his perception unambigu- 
ously. It is proposed that such control is inherent in WSSP inasmuch as the 
scale, essentially, does not require the testee to choose or use words in respond- 
ing, but requires him only to indicate a level in a simple ordinal scale. These 
indications make up his protocol of responses, and they are not susceptible to 
the stigma of ambiguity. 

In summary, WSSP is built on these premises: that man is, above all, a 
verbal creature; that percepts about man—that is, about his personality—are 
available for use in the form and substance of language; that these percepts 
are available originally only from the man whose percepts they are; that such 
percepts can be free from the devastations of ambiguity and thus free from a 
need to be extensively interpreted inferentially. 

It is proposed that, built upon such premises, WSSP holds several distinc- 
tions, First, its protocol is not an assembly of many verbal representations of 
one subject. Second, this protocol has unmistakable, not dubious, relevance 
because it has not emerged from one ambiguity or a chain of ambiguities, 
Third, this protocol, coming from a testee’s “set” about a perception, is an 
idiomatic, not a sophisticated, index of the personality of the testee. 


C. Scorinc 


Adler’s idea of life-style, Allport’s proprium, and Jung’s archetype of self 
(4) are only a few of many concepts about the tendency of the human organism 
to strive for unity. It is proposed that this tendency applies to the language of 
the human organism and that every person strives to be consistent and unified 
in his words and groupings of words relating to one matter. The scoring 
system is based on this proposition. 

A response grid is used to assist in the quantifying, classifying, and interpret- 
ing of responses. As illustrated in Figure 1, the columns correspond to the 
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possible alternatives of each test item and represent an intensity dimension, with 
one column: (D) corresponding to the alternative “can’t choose.” The rows 
correspond to different orientations or dimensions of the testee’s “set” in 
regard to the subject of enquiry, and have been labeled affective, cognitive, 
conative, and generic components. 


A B Cc D 
Affect. ——___: ——___: 
Cognition 
Conation 
Generic 
FIGURE 1 


Response Grip FoR WSSP 


To date, the scoring of the protocol has made use of the response grid and 
has consisted, in the main, of identifying what has been called pathological 
patterns, Nine of such patterns have been utilized to the present time and are 
listed in Table 1. The nine pathology patterns are proposed to represent nine 


common forms of deviation from normal consistency and unity in language 
about any percept. 


D. Reviasiuity or THE SCALE 


If the dimension corresponding to the columns of the response grid is thought 
of as being continuous, with the D (cannot choose) category being on this 
continuum as minimum intensity, then a score can be assigned to each response 
and a correlation coefficient computed between the results of two different 
administrations of the scale. When such item test-retest reliabilities were 
determined for 93 college students at Brigham Young University, they ranged 
from 28 to .92 with both the mean and median reliability being 54, These 
coefficients compare very favorably with the reliabilities of other standardized 
personality tests when so determined (3, pp. 556-560). While such indices are 
indicative, they are not very meaningful nor persuasive. Similarly, other 
common methods of determining reliability, such as the comparable form or 
split-half technique, are not applicable either. 


However, since the scoring of the protocols makes use of the response grid, 
a more appropriate index of the reliability of the scale is the percentage of 
agreement within the 16 cells of the Tesponse grid of the two different ad- 
ministrations. For the 93 college students the Percentages ranged from 40 
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TABLE 1 
ResPonsz-Grip PATTERNS OF THE WSSP Usep As INDICES 
oF “PATHOLOGICAL” FUNCTIONING 


Pattern 
number Definition 
1 A concentration of 11 or more responses in column “A” of the 
protocol grid. 
2 A concentration of 11 or more responses in column “C” of the 
protocol grid. 
3 A concentration of 3 or more responses in column “D” of the 
protocol grid. 
4 An affect/conation ratio of 2/1 or higher with at least one affect 


response in the “A” column and one conation response in the 
“C” column. Responses are weighted 3, 2, 1 in columns “A,” “B,” 
and “C,” respectively. 

5 A conation/affect ratio of 2/1 or higher with at least one affect 
response in the “C” column and one conation response in the “A” 
column. Responses are weighted as in Pattern 4, 


6 A scatter of responses in 11 or more of the 16 boxes of the grid. 


In a readministration of the test within 24 hours, a failure to 
repeat more than 8/16 of the responses given in the first test. 


8 A combination of Patterns 6 and 7. Specifically, a combination of a 
scatter of responses in 11 or 10 or more of the 16 boxes of the grid 
with a failure, on retest within 24 hours, to repeat more than 8/16 
or 9/16 of the responses given in the first test. 


9 An incomplete protocol, due to the unwillingness and/or incapacity 
of the testee to respond completely to the test according to the 
standard form and procedure. 


percent to 80 percent, with both the mean and the median being 67 percent. 
For a group of 20 hospitalized youths diagnosed as psychotic the percentages 
ranged from 50 percent to 100 percent agreement, with a mean of 72 percent 
agreement. For a group of 22 youths of equivalent age in a high school in Utah 
the range was from 63 to 94 percent agreement, with a mean of 78 percent 
agreement. Again, these findings compare favorably with those found for other 
categorical systems (5, p. 40 and 41). 

The average interjudge reliability for categorizing the protocols into the 
various “pathological” patterns was found to be 98 percent agreement between 
three pairs of judges. Again, these are sufficiently high to indicate little 
difficulty in reliably assigning the protocol into one of the patterns. 


E. Resutts or Inpivivat Stupres 


In a series of small informal pilot studies the WSSP appeared to differentiate 


high school students contrasted by JQ, grade point average, teacher’s ratings 
of school achievement, referrals for disciplinary action, as well as socioeco- 
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nomic status of family. Subsequently three formal studies were undertaken to 
evaluate the usefulness of the scale. 

The first study consisted of administering the scale, using “your success in 
life” as the subject of enquiry, to 30 students in the honors program at 
Brigham Young University and 28 students who had been placed on academic 
probation at Brigham Young University. 

Students in the honors program have had to show “unmistakable evidence 
of academic promise on the basis of tests, grades, and recommendation.” Most 
students who apply are prospective freshmen. Once admitted to the program, 
they are given special consideration in terms of flexibility in programming and 
small section in many courses. 

‘The second author approached two such classes and asked for volunteers 
to participate in a research project concerned with the development of a new 
personality scale. Thirty students volunteered to participate. 

Students at Brigham Young University are expected to maintain a certain 
minimum grade-point average. A student who does not meet the minimum 
requirements and whose record indicates that the student has not been making 
satisfactory progress in the accumulation of grade points is placed on academic 
probation, A student on academic probation is expected to improve his academic 
standing during the semester he is placed on academic probation, or he is 
subject to academic suspension from the University. 

A confidential list of students on academic probation was obtained from the 
office of academic standards, A letter was sent to each student asking him to 
volunteer as a subject for a research project concerned with ‘the development 
of a new personality scale.” Twenty-eight students agreed to participate as 
subjects. 

The second author individually administered the WSSP to each student 
using the standardized instructions and procedures. 

The results are presented in Table 2. It will be noted that 12 of the 28 
protocols or 43 percent of the probation group fell into a “pathological” 
pattern i while only four of the 30 protocols, or 13 percent, of the honors 
group did so. The chi square of 4.93 for these frequencies is significant at 
the .05 level of significance. The phi coefficient, an index of correlation, for 
these data was .29. This suggests that the categorizing of students by the 
WSSP is not independent of their actual academic standing at an accredited 
university, It appears that the scale is more sensitive to the responses of honor 
students than it is to those of the probation students. This may partly be 
accounted for by the choice of the subject of enquiry utilized in this study. In 
terms of an overall philosophy of life, students at Brigham Young University 
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TABLE 2 
FREQUENCY OF ProTocoLs FALLING INTO PATHOLOGICAL PATTERNS OF THE SCALE IN 
‘THREE STupIES INVOLVING (a) Honor Srupents AND STUDENTS ON PROBATION, 
(4) Detinquents anp NorMaL ADOLESCENTS, AND (c) 
PsycHotic AND NorMAL ADOLESCENTS 


Pattern 
Actuarial Pathological Normal Total 

Students on 

probation 12 16 28 
Honor students 4 26 30 

Total 16 42 58 
Delinquents 11 7 18 
Normals 5 13 18 

Total 16 20 36 
Psychotics 14 6 20 
Normals 1 21 22 

Total 15 27 42 


may not construe being on academic probation as necessarily threatening their 
being a success in life. 

A second study? utilized a sample of 18 adolescents placed in the detention 
home in Provo, Utah, for delinquent behavior and a sample of 18 adolescents 
from Manti High School, a school in central Utah, who were doing at least 
average work in their academic studies who had no record of any delinquent 
behavior or citations. The WSSP was administered to subjects by a research 
assistant with the use of “your problem of authority” as subject of enquiry. 

These results are also presented in Table 2. Eleven of the 18 protocols, or 
61 percent, of the delinquent group fell within a “pathological” pattern. The 
chi square of 4.05 is significant at the .05 level of significance. The phi coeffi- 
cient was .34. This indicated that the WSSP could differentiate between these 
two groups in terms of the “pathological” patterns, 

In a third study the WSSP was administered to 20 adolescents who had 
been diagnosed as psychotic and were inmates of the Youth Center at the Utah 
State Hospital, and 22 adolescents of equivalent age at Spanish Fork High 
School, a rural high school in Utah. The latter were selected on the basis of 
their having an average GPA of B or better, having been evaluated as physically 
healthy, and having no known record of any mental or emotional problems. 
The scale, with the use of “your peace and happiness” as the subject of 


2 We are grateful to Mr, Vance Grange, a graduate assistant at Brigham Young 
University, for administering the scale to subjects. 3 { 

8 We are indebted to Mr. Douglas Wood, a psychologist at the Utah State Hospital, 
for administering the scale to subjects. 
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enquiry, was individually administered to each of the youths. These results also 
are presented in Table 2. 

Fourteen of the 20 protocols, or 70 percent, of the psychotic group were 
classified in a “pathological” pattern; while only one of the 22 protocols, or 
five percent, of the normals was so classified. The chi square of 16.80 is signifi- 
cant beyond the .001 level of significance. The phi coefficient for these data 
was equal to .63. Again, the scale patterns were found to be significantly and 
impressively the actual diagnosis of the group members. 

Each of the three studies did significantly differentiate between the different 
groups, thus demonstrating the validity and usefulness of the scale for specific 
purposes. Nonetheless, additional studies must yet be undertaken. While the 
scale is currently seen as being applicable in many different testing situations, 
the domain of the scale and its limitations have yet to be defined. The appro- 
priateness of different subjects of enquiry needs further investigation also. 
Consequently, it is recognized that the scale is still being developed, and should 
be considered as still being in a research stage. Nonetheless, it is hoped that 


others might find it convenient and useful to use the scale in their particular 
settings, 


F. Summary 


The WSSP is a structured procedure to obtain an expression of an indi- 
vidual’s perception of a problem, behavior, or event, and its meaning to him. 
It consists of 16 multiple-choice-type items about some specific problem area 
chosen by the administrator. It is scored by means of a response grid which 
yields “pathological Patterns.” The average percent agreement for two ad- 
ministrations ranged from 67 to 72 percent in three studies. Results from these 
studies indicated that (@) more university students on probation than honor 
students (p = .05), (b) more delinquents than nondelinquents (p = .05), 
and (c) more youths diagnosed as psychotic than youths not so diagnosed 
(p = .001) gave responses yielding pathological patterns. 
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RAVEN’S PROGRESSIVE MATRICES: VALIDITY, 
RELIABILITY, AND NORMS* 


East Orange Veterans Administration Hospital 


Henry R. Burxet 


A. Vaupity 


With relatively small numbers of Veterans Administration Hospital 
patients, previous studies (1, 3) have indicated that in terms of concurrent 
validity, scores on the Raven Progressive Matrices (9) correlated about as 
well with scores on the WAIS Verbal (r = .80, r= .65) and WAIS Per- 
formance (r = .79, r = .76) as these did with one another (r = .77, r= .70). 
In terms of construct validity, the Raven, in both a principal components and 
a varimax factor analysis (3), loaded about as significantly on a general factor 
of intellectual functioning, as did the factor scores of Cohen (4, 5) for the 
WAIS (.80, .74, .72 vs. .76; and .79, .47, .34 vs. .53), even when this 
general factor was heavily determined by verbal content. 


B. Rewiasiiry: Otp Data 


In the study by Burke and Bingham (3), the untimed Raven showed a 
surprising split-half corrected reliability coefficient, .96 (N= 91, Age M= 
35.1, SD = 8.7). 


C. Reviasmity: New Data 


To investigate further the reliability of the Raven, 567 answer sheets were 
found in our files of veterans (black and white) who had taken this test when 
referred for vocational counseling. The Raven was always administered 
without a time limit. These answer sheets were rescored for odds and evens. 
The Pearson product-moment correlation between the two sets of scores was 
found to be .93 (Age M = 35.5, SD = 9.1, Age range = 18-64). 

Application of the Spearman-Brown formula for predicting the reliability 
of a test of double length (8, p. 194) yielded a reliability coefficient, .96. 

Inasmuch as it is well known that “other things being equal, the size of a 


* Received in the Editorial Office on August 3, 1972, and published immediately 


at Provincetown, Massachusetts, Copyright by The Journal Press. 
1 The author wishes to express his appreciation for the statistical and computer 


assistance of Emmanuel Lerner of the Eastern Research Support Center, Veterans 
Administration Hospital, West Haven, Connecticut. 


253 


254 JOURNAL OF PSYCHOLOGY 


coefficient of correlation is very much affected by the heterogeneity of the 
population on which it is computed” (8, p. 208), the 567 cases were accord 
ingly divided into five age groups, and the correlations of odd-even scores were 
computed for these age groups. The findings are presented in Table 1. 


TABLE 1 
RAVEN PROGRESSIVE MATRICES (9) : RELIABILITIES AND MEAsuRES 
OF CENTRAL TENDENCY FOR AGE RANGES 
Ages N M Mdn* rl Pol 
See 
16-25 103 46 Bad 83 89 
26-35 107 41 42 93 96 
36-45 217 37 38 94 96 
46-55 69 37 33 94 96 
56-65 8 37 27 95 97 
Total 567 40 93 96 


® Median scores for ages 20, 30, 40, 50, 60, given by Raven (9, p. 15) in his table 
of norms, 


Even though the data show a negative correlation with age (r = —.27, 
N = 567), homogeneity or heterogenity with respect to age does not appear 
significantly to affect test score reliability from age 26 to 64. 

‘The data in this study on reliability for ages above 25 compare favorably 
with similar data for the reliability of WAIS Verbal and Performance scores? 
for Raven scores, split-half corrected reliability equal to the corrected split- 
half reliability for WAIS Verbal scores, r = .96; corrected split-half relia- 
bility for Raven scores, better than the corrected split-half reliability for 
WAIS Performance scores, r = .96 vs. r = .93 and .94 (11, pp. 12, 13). 

These data on reliability resolve favorably some of the doubt about the 
reliability of the Raven expressed previously (2, pp. 215, 216). Reliability 
seems quite adequate for individual decision making, at least for age ranges” 
above 25 in an American veteran or similar adult male population. 

In Table 1, comparing the mean scores for the age groups in this study with 
the medians given by Raven (9, p. 15) as norms, we find that the mean scores 
in this study do not drop off as remarkably for the upper age groups as do the 
medians in Raven’s norms. This result may be due to the fact that all the 
tests in this study were given without a time limit, which tends to reduce the 
scores of older testees. (It may also be due to the relatively small and possibly 
selected character of our upper age groups.) In any case, these data lead us to 
the conclusion that we can set up a practically useful table of percentile rank 
norms, at least for the population of this study, or a similar population, dis- 
regarding the effect of age. r 
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TABLE 2 
NorMS: PERCENTILE RANKS FOR AN AMERICAN (BLACK AND Wuitz) VETERAN 
or ApuLT MALz Popuration, wirh WAIS JQ Equivacencies 
(N = 567, Mean age = 35.5, SD = 9.1, Mean score = 40, SD = 12) 


Estimated 
Raw Cumulative Percentile WAIS IQ 
scores frequencies ranks equivalencies*® 
60 567 100 
59 566 99 140 
58 564 99 135 
57 557 98 130 
56 552 97 128 
55 538 95 125 
54 522 92 121 
53 507 89 118 
52 491 86 116 
51 467 82 113 
50 443 78 111 
49 423 75 110 
48 394 69 107 
47 366 65 106 
46 355 63 105 
45 336 59 103 
4% 305 54 101 
43 286 50 100 
42 270 48 99 
41 255 45 y 98 
40 243 43 97 
39 229 40 Ci fee 
38 217 38 95 
37 197 35 94 
36 182 32 93 
35 172 30 92 
34 167 29 91 
33 157 28 91 
32 145 26 91 
31 128 23 89 
30 123 22 89 
29 118 21 88 
2 110 19 87 
27 100 18 87 
26 91 16 86 
25 87 15 85 
24 79 14 84 
23 14 13 83 
22 67 12 82 
21 57 10 81 
20 50 9 80 
19 46 8 78 
18 38 4A 76 
17 31 5 75 
16 25 4 74 
15 23 7m 
14 20 
13 17 3 72 
12 10 2 
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TABLE 2 (continued) 


Estimated 
Raw Cumulative Percentile WAIS 1Q 
scores frequencies ranks equivalencies* 
11 8 1 65 
10 6 1 
9 4 1 
4 3 1 


® Wechsler (11, p. 19). It should be remembered that for a sample of this popula- 
tion (N = 91), the correlation between the Raven and WAIS total scores was only 
Abs 


D. Norms 


‘Table 2 gives the percentile ranks for the raw scores of the population in 
this study of 567 (black and white) veterans on the Raven Progressive 
Matrices (9), along with estimated WAIS IQ “equivalencies” [See Wechsler 
(11, p. 19)]. 

In view of the small negative correlation between age and Raven scores 

’ (r = —.27, and in view of the probability that with an untimed administra-  \ 
tion, the effect of age on score up to age 64 is not very significant, this table, 
for practical purposes, ignores the effects of age on scores. However, due to 
the size of the mean (M = 40), the scatter of scores (SD = 12), and the 
actual (60-4) and possible (60-0) range of scores, the closer scores approach 
the mean plus or minus 2 SD, the less valid are the WAIS IQ “equivalen- 
cies.” 

Moreover, with respect to validity, concurrent or construct, one should, in 
any discussion of “equivalencies,” be aware of the limited correlation between 
Raven and WAIS total scores: on a sample of this population (N — 91), 
.75 (3, p. 248) ; with Shaw’s (10)2 population (N = 83), .83. 

Table 2 is offered as a practically adequate norm table of percentile ranks 
and WAIS IQ “equivalencies,” derived from and for use with an American 
(black and white) veteran or similar adult male population. 


mS 


E. Summary 


: Previously published studies with East Orange VA Hospital patients have 
indicated the concurrent and construct validity of Raven’s Progressive Matrices 
(9) as a nonverbal, untimed measure of intellectual functioning, An odd- 
even split-half corrected reliability coefficient, .96, was obtained for the total 

2 For similar norm tables derived from smal 


= ler populati k 
(7). Part of Peck’s Table 3 . a Populations, see Shaw (10) and Pec 
ically True Norms” of Orne (joe identical with Table 2, See also the “Hypothet- 


HENRY R. BURKE 257 


(N = 567) group of scores, and for the scores of 10-year age-span groups 
from 26-64. A norm table of percentile ranks derived from the 567 cases, with 
estimated WAIS IQ “equivalencies,” is presented as applicable to any similar 
(black and white) American adult male population, age range 18-64. 


ik) 
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STATUS, INTEREST, AND PROXIMITY AS FACTORS IN 
INTERACTION AND COMMUNICATION CHANNELS* 


University of California, Berkeley 


James E. Frisch anp SHELDON ZEDECK! 


A. InrTRopucTION 


Tiffin and McCormick (10) define communication as the process of trans- 
mitting ideas or thoughts from one person to another for the purpose of creating 
understanding in the thinking of the person receiving the communication. The 
important aspect of the definition is the need for creating understanding. To 
fulfill this requirement, effective interaction patterns and communication 
channels must exist. Rapidity, facility, and accuracy are interdependent ne- 
cessities of successful communication chains. Varying these factors will di- 
rectly affect the flow of information and efficiency in any group. When the 
flow of communication is rapid and accurate, the result is a greater degree of 
orderliness and structure, whereas diffuse and random communication cre- 
ates disorganization with respect to specific traits required and the involved 
responsibilities. An informed worker who knows about his organization and 
about the particular group he is in, to such a degree that he feels he belongs, 
is a more satisfied worker (6). Being part of an involved, pleased work group 
provides for increased morale (4). 

Analyses of communication systems in industry have been an area of increas- 
ing research interest during the past 15 years. The range and quantity of 
such research is indicative of the importance it holds. The direction of flow 
of information, the characteristics of senders and receivers of information, the 
amount of information transferred, hierarchical influence in interaction, and 
structure of the organization in influencing communication have been the 
main areas of research. Such variables as status or position of receiver and 
sender in hierarchy (1), type of information being transferred (3), and input 
and output requirements of position (4) have been investigated as potential 
barriers to information transfer or communication flow. The areas of psycho- 


* Received in the Editorial Office on August 7, 1972, and published immediately at 


Provincetown, Massachusetts, Copyright by The Journal Press. 
1 Reprints and complete analysis of variance results are obtainable from the second 


author (Sheldon Zedeck) at the Department of Psychology, University of California, 
Berkeley, 94720. 
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logical distance (e.g., prestige and status differences) have been examined 
extensively, whereas physical distance or proximity has been virtually ignored. 

Proximity is the physical separation of two individuals. Unlike physical 
proximity, psychological distance involves perception of one another. Differ- 
ences in status, prestige, age, interests, etc. influence the perceived distance 
between people. Psychological distance is relatively subjective and can be 
measured by dichotomizing on a scaled variable. Physical distance, on the 
other hand, is determined by looking at seating arrangements or measuring 
the actual distance between people in a work situation. 

Hare and Bales (5) found that centrality of seating position and distance 
between members can be used to predict the interaction patterns of people 
(with whom, where, and how they communicate) in a laboratory setting. This 
is complicated by the fact that personality variables are related to seating 
choice and therefore can affect interaction rate (frequency of interaction). 
Consequently, it may not be seating and distance that determine interaction, 
but personality that influences the distance and position chosen by the person. 

Barlund and Harland (1), too, have studied communication patterns in 
relation to proximity. Sections of sorority houses located in clusters were 
compared to those geographically isolated. A sampling was made of the com- 
munication channels within the sorority system by means of a questionnaire 
that was administered each evening. Results indicated that the physical setting 
in which people interacted probably set limits upon and determined the 
frequency of communication in the early stages of interpersonal relations. Once 
a social system evolved from that communication, however, and began to 
elaborate itself, physical forces intervened that tended to polarize these channels 
of communication around high status figures. 

Similarity of interests was found to be a determinant of friendship and, 
therefore, of Psychological distance (2). Pairs were deliberately set up in 
a college residence hall, placing together (increasing proximity) those of 
oe Pete and background (small psychological distance) in an attempt 
ag, ee ae § ect of similar and dissimilar students on one another, Accord- 

, Propinquity was also a significant determinant of these friendship 
patterns. 
ticle bee ius other things being equal, people are most 
tion will vary with the extent es hi eae ae ae Th 
important cbicte Gael Sy rae tg people perceive relevant and 
) in a similar fashion (i,¢., possess similar interests). The 


natural outcome of such attraction and contact would be increased commu- 
nication, 
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From the literature it appears that there are three factors that influence 
communication channels. The first of these is ecological and stems from the 
physical setting in which interaction occurs. The second determinant is atti- 
tudinal or psychological and is a result of differences in status privilege or 
authority. The third is perceived similarity of interests, Thus, the purpose of 
this study is to examine the effects of, and interaction between (a) status, (4) 
physical proximity, and (c) interest area upon the number of contacts in a 
communication system. 


B. Meruop 
1. Subjects 


Measures of communication for graduate students in the Department of 
Psychology (University of California, Berkeley) were obtained, Fifty-seven 
graduate students were initially provided with recording booklets for data 
collection during the first week of a 10-week academic quarter, Thirty-three 
records were usable. A second data collection period during the last week of 
the quarter yielded 22 usable records. The purpose of the second collection 
Was to examine changes in communication patterns over time, 


2. Independent Variables 


a. Proximity. Five offices (1, N = 10; 2,N=7;3,N=5;4,N=5; 
and 5, N = 6) are provided for graduate students. The office spaces for 
Teaching Assistants (TAs) were arranged according to the specific class being 
taught. TAs for each particular course were located together in the same 
office, regardless of any other factor. Two offices were occupied by graduate 
students who were not TAs and who did not have regular meetings or office 
hours. Their structured or forced interaction was, therefore, at a minimum. 
Proximity is the distance separating each subject from another in their offices, 
Proximity was viewed as intra- or interoffice in nature. 

6. Status. Status was dichotomized according to how many years the 
participant was a graduate student: below two years of study (Status 1, N = 
17) and above two years (Status II, N = 16). 

ce. Graduate Study Interest. Interest was determined by the person’s field 
of specialization within the Department of Psychology. Three areas of 
specialization were as follows: clinical (A, N = 6) ; physiological, experi- 
mental (B, N = 5); and industrial, cognitive, developmental, mathematical 
(C, N = 22). 
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3. Dependent Variable 


The dependent variable was the number of contacts. Rather than utilizing 
an experimenter’s judgment, S himself determined and recorded significant 
contacts. Since S is more aware of important exchanges and the existence of 
superficial contacts, he is in a better position to determine significant com- 
munication. 


4. Procedure 


Each participant recorded and submitted a record of the number of con- 
tacts with other participants. A limitation that the contact occur within the 
Department of Psychology building was imposed. Since status of participants 
versus proximity of office location was one concern of the study, contacts due 
to some other cause, such as a chance meeting outside, were not counted. In 
addition, certain procedures were followed to insure uniform recording of the 
data by Ss: (a) if several people met, then all those who spoke recorded con- 
tact with each member of the group (if the contact was seen as significant) ; 
a nonspeaker did not record the meeting as a contact, and (4) if social and 
business items were discussed in one conversation, then two contacts were 
tallied for each person, 


5. Analyses 


An analysis of variance design with repeated measures was used to ascertain 
the effects of and interaction between status, interest, and proximity. The 
main variables were dichotomized into intra- and intergroupings, resulting 
ina 2X 2 X 2 design. 

A recently developed analysis for investigating the structure of large, com- 
plex networks (8) was also employed. This analysis permits an identification 
of groups, isolates, and various other agents of communication transfer. Iden- 
tifying these individuals enables one to study factors influencing the formation 
of cliques and the movement of messages. For the purposes of this paper, only 
the initial analyses are pertinent; i.e., to examine the overall structure of the 
communication channels and determine the relative importance of interest, 
status, and proximity as links between subjects. The steps in the structuring 
« ordering Process are (a) calculating a weighted mean of reciprocal contacts 
3 ae tes ices a peaks fa ce number to each participant, 
contactees’ subject numbers a del hing t Be oe ee 
dividing this sum by the sum of ae tee inc eet ae a 

le weighting factors), (b) rank-ordering” 
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the weighted means, (c) replacing old contact numbers with new rank 
numbers, and (d) repeating the process until the rank order remains stable 
(convergence). The result of this process will be an ordering of individuals, 
with those who speak to one another being next to each other, While in 
Richards’ study each individual belonged in one and only one category, the 
present study involves three categories to which each subject belonged. 


C. Resutrs 


Intraoffice contact exceeded communication to any other particular office, 
but surpassed the total number of contacts to these outside offices in only sey- 
eral cases. The average number of intraoffice contacts was 30.0, 10.8, 26.2, 
25.6, and 15.3 for offices 1, 2, 3, 4, and 5, respectively. The average number 
of interoffice contacts ranged from 0,0 (between offices 3 and 4) to 8.0 (be- 
tween offices 1 and 2). If distance determined communication, then commu- 
nication should have been greater between offices of greater adjacency (i.e., 
offices 1 and 2 or 2 and 3 should interact more than 1 and 5 or 2 and 5). 
While offices closer to one another tended to have a higher number of com- 
munications, there was considerable fluctuation. 

The within group trend of greater contact also held for the interest and 
status variables. The average number of intrainterest contacts was 16.5, 22.8, 
and 21.9 for interest areas A, B, and C, respectively. Average interinterest 
contacts ranged from 1.0 (between A and B) to 14.0 (between A and C), 
Average intrastatus contacts were 27.8 and 20.0 for status levels I and II, 
respectively. Status II reported an average of 17.1 contacts with status I, 
whereas status I reported an average of 11.6 contacts with status IJ. (The 
discrepancy is a result of different perceptions of contacts, a problem significant 
in this type of research.) 

Results for the analysis of variance of data collected during the first week 
of the quarter indicated that the main effects of status (F = 16.8, df = 1, 
32) and interest (F = 21.6, df = 1, 32) were significant (both at p < .01). 
There were no significant interaction effects. 

In the replication of the study, interest and status both were significant 
(F = 9.1, df = 1, 21, p < .01 and F = 5.0, df = 1, 21, » < .05, respec- 
tively). This time, however, interaction effects existed for interest and status 
(F = 6.7, df = 1, 21, p < .05) and for status and proximity (F = 5.3, 
df =1, 21, p < .05). The difference in contacts was considerable between 
Status I and Status II for those of similar interest, but there was no difference 
between I and II with respect to interinterest contacts, Likewise, the more 
experienced graduate students communicated considerably less with people 
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outside their offices than did newer students, but with no difference in com- 
munication within offices. 

The network analysis proposed by Richards (8) looks at the overall struc- 
ture of the communication interactions and attempts to determine the various 
ways the components are linked together. A rank ordering and reranking many 
times is performed until people most similar to one another (on the basis of 
analogous communication patterns) are placed together. Results of this 
analysis indicated that interest and status were more important influences of 
communication than was proximity. Richards’ analysis was not attempted for 
the replication because of two concerns. First, as already mentioned, it is 
difficult to apply the analysis when more than one variable is used. Second, 


the significant interaction effects revealed by the ANOVA enhance the problem 
of interpretation. 


D. Discussion 


‘The main purpose of this study was to determine the importance of status, 
interest, and proximity in communication channels. Both analysis of variance 
and Richards’ analysis indicated that status and interest were each influencing 
the number of contacts. The result of the replication indicated that there was 
interaction between interests and status and between status and proximity 
signifying that the effects of status upon communication differ for different 
interest groups and different proximity groups. 

It is likely that the interaction was a function of the nine-week interval 
between recordings. During this period, newer students (Status 1) sought out 
and communicated more with students of similar interest which required 
more contact with other offices, 

With respect to Richards’ (8) analysis, it is not clearly discernible which 
category (characteristic) is the more important in influencing the adjacency 
of individuals. Results indicate possible causes for this adjacency; the char- 
acteristics that adjacent individuals have in common are indicative of the 
factors that influence the formation of communication channels. In other 
words, because neighboring individuals communicate with one another, and 
a cause must exist, it is likely that common membership in a category is a 
ee ween 
dividuals. In fact, it is a similar ie pee ea aera Bt 
interest, status, of both. In contrast, sear, aa 
as the only similar factor, between en a irae nie Bet oral 

It is important to note that a contact, 2 il pai inati 

» aS measured, is a combination of 
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three factors: interest, status, and proximity. There is a need to distinguish 
between the three when making assumptions about communication patterns. 
A situation could exist in which person A is bounded by two people, one of 
whom has a different status from his own. Any differences in communication 
between A and each of these people (holding interest constant) may be at- 
tributed to the status differentiation that exists between them, since proximity 
is identical. Similarly, Person B could be in contact with two people of the 
same status but of different adjacency. Differences in communication patterns 
between B and each of them, therefore, could be attributed to propinquity. 
Further analyses of pairs of dyads within other offices would provide addi- 
tional data. In a like manner, such dyads may be found between offices or 
separated by offices rather than just by desks, and holding status constant, 
proximity may be investigated. By locating many identical dyads, mean in- 
teraction rates can be determined for a group. In pinpointing these identical 
dyads, any differences which may be significant can be attributed to the vari- 
able in question: i.e., interest, status, or propinquity. 

At this point something should be said regarding the subjectively viewed 
significance of contacts. As mentioned earlier, the primary purpose of subject 
self-recording is the superior vantage point for trivial versus meaningful com- 
munication exchanges. In this respect a more accurate, representative measure 
of contact between individuals is recorded. Its use has gained in popularity 
recently (1, 9), possibly because its shortcomings have been neglected. One 
of the initial problems encountered is the optimal technique of individuals’ 
recordings. If the contacts are recorded at the conclusion of each day, then 
memory failures are likely to reduce the number of subjects’ contacts. While 
such selective forgetting may eliminate trivial contacts, some significant com- 
munication may also be lost. On the other hand, constant recording of data 
as they occur is not only laborious but confounding as well. The tediousness 
of the task provides an incentive to actually avoid some contacts or delay 
them for future meetings when thoughts accumulated during the course of 
the day can be discussed. Clearly, the deliberate avoidance of contact is 
something to be averted. In the present case the recordings, on occasion, be- 
came an enhancer of contact. The novelty of a questionnaire in communica- 
tion initially spurred on subjects to create contacts in order to record them. 
It became, at one point, a topic of conversation in itself, and therefore, pro- 
vided some sort of basis upon which people could communicate. On one 
occasion it was observed that two individuals greeted one another saying 
“Contact.” This subsided rapidly, though, and was eliminated before the 
completion of the first day. Problems similar to this would appear to be 
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inherent in studies utilizing this method, unless the situation can be highl 
structured and even this may not be an adequate solution. Essentially 
obtrusive measures are needed (11). 
‘A question arises as to the applicability of such analyses and data to or” 
ganizations, and to the degree of usefulness it possesses. In the business 
environment, as in the Psychology Department, no choice of seating or ar 
rangement of offices is given. Similarly, there is contact between members of 
varying rank in both situations. However, the important question pertains 
to the power differences that exist within each group. As there are differences 
in rank among workers and graduate students, so there are differences in 
power within each group. This may be greater authority or predominance 
as far as privileges are concerned. In neither case (the graduate student and 
the worker) is it direct power over another person. The graduate student, 
by virtue of higher status (i.e., years in program, knowledge of more people) 
may function more efficiently or with greater facility than those of lower 
status. He may be accorded special privileges or may not be required to do 
items that others must. If he were suddenly to take greater advantage of his 
position, he may be said to “pull rank,” an indication of his power difference. 
The worker, in a like manner, may have a superior position over others (i.¢., 
be a supervisor). Should the need of correction or replacement arise, though, 
he would often have to take it to a higher authority than himself to bring 
about the change. It is not as direct a power hold as first suspected. Con- 
sequently, they are alike in this respect. Undoubtedly, it is a semantic battle - 
being waged here to insure placing the correct limitations upon the results. 
Admittedly, there do exist some differences between the groups, but it is 
likely that even within a specific level, a status delineation does exist, whether 
it be formal or not. Q 
As far as spatial arrangements affect frequency of interaction, they will / 
also tend to affect relationships of affinity. Thus they affect the attraction 
structuring of a close contact (physical contact) group. The effectiveness and 
success of any organization is incumbent upon trenchant communication and 


communication channels. The altering of the physical structure may provide 
the necessary push towards such successful channels. 


E. Summary 


Three criteria in the formation of communication channels—status of re 
ceiver, physical nearness, and interest—were examined. Preliminary analysis 
of the self-recording of communications for 33 graduate students in psy- 
chology indicated that they tended to have greater contact with those most 
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similar to them (of equal status and interest) rather than those located nearer 
to them (of greater proximity). A replication (N = 22) revealed a status- 
interest and status-proximity interaction, In addition, a new method of 
communication analysis (8) which focuses on particular carriers of com- 
munication was studied. 


TS) 


11. 
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WOMEN, TODAY AND YESTERDAY: AN EXAMINATION 
OF THE FEMINIST PERSONALITY* ! 


University of Florida 


Marcurrire Gigert Fowier anp Hani K. VAN DE Rier 


A. Intropuction 


The current feminist movement has drawn much attention to women—in 
particular, today’s women. Amidst the well publicized clamor for equality, a 
new breed of women is emerging in the public’s eye. This conception of 
liberated womanhood frequently and implicitly characterizes the women who 
ascribe to the philosophy of the movement as being cold, calculating, emascu- 
lating, and altogether basically different from their “sisters” who appear to 
be contented at home, and certainly different from their mother’s generation 
as well. How different, however, is anyone’s guess, for despite the obvious 
din of the movement to date, few, if any, attempts of an empirical nature have 
been made to examine and document various dimensions of the movement's 
advocates, 

Such an opportunity was recently provided, however, at the University of 
Florida. The University hosted a women’s conference entitled “Off the 
Pedestal Into the World.” The conference was well attended by several 
hundred women and a number of men, and featured noted feminist speakers, 
as well as other prominent women in politics and academics. The conference 
provided invited addresses along with smaller group sessions featuring a 
myriad of problem area topics from which to choose. The aim was to provide 
interested women the opportunity to come together for a common purpose— 
that of hearing about and being heard on those issues deemed crucial to the 
movement, with hopes that new levels of consciousness would evolve for the 
participants. The enthusiasm generated at the conference and the extensive 
involvement of the participants with the issues indicated that it was a worth- 
while and perhaps soon to be repeated experience. 

The conference provided the opportunity to have a firsthand glance at 


* Received in the Editorial Office on August 7, 1972, and published immediately at 
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concerned and active feminists, and to compare and contrast them with other 
segments of the female population. No a priori hypotheses were made; but, : 
instead, normal and relevant personality dimensions were examined in hopes 
of confirming, dispelling, or modifying the currently held feminist image, 
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B. MernHop 

1. Subjects 
‘The Ss were 63 women who represented diverse groups of interest and age 
levels. These groups were feminist women (nm — 18; mean age = 22.4); 
university undergraduate women (nm = 16; mean age — 24.5); elderly 


noninstitutionalized women of a Veteran’s Administration Research Unit on 
Aging (n = 12; mean age = 67.6) ; and elderly institutionalized women in q 
a V.A. domiciliary (n = 17; mean age = 67.5). All Ss participated on a — 
voluntary basis, except for the university women, who participated on a semi- 
voluntary basis. Students in the psychology program in which they were 
enrolled were required to participate in experiments, but had a choice of 
research in which to participate. The four groups were chosen to represent a 
spectrum of women from the “liberated” to the institutionalized, including 
generational diversity, 


2. Materials F, 


The research instrument was the Adjective Checklist (3). It consists of 
300 adjectives describing normal traits which comprise various scales. It is 
self-administered, complete with stanadard directions, and can be completed 
in 15-30 minutes. The scales employed in this study were chosen for their 
relevance in differentiating among the particular groups of women involved. 
The scales chosen were (a) Self-Confidence, (4) Self-Control, (c) Personal 
Adjustment, (d) Achievement, (e) Dominance, (f) Endurance, (g) Nurtur- 


ance, (k) Affiliation, (i) Autonomy, (7) Aggression, (4) Succorance, and 
(1) Deference, 


3. Procedure 


The elderly institutionalized (DOM) sample and the university (UNIV) 
sample were given the inventory in groups at each of their facilities, respec- 
tively, This was not feasible for either the elderly noninstitutionalized (RUA) 
sample or the feminist (FEM) sample. These latter two groups were given 
their materials in an envelope, and asked to return them by mail upon com- 
pletion. They were also cautioned not to take materials unless they intended 
to complete them, which resulted in few response failures, d 


| 
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In addition to the one checklist on which each § was instructed to describe 
her “Real” Self (as they were), the feminist women were also given a second 
checklist on which they were instructed to describe their “Ideal” Self (as 
they would like to be). This was intended to provide some insight into self- 
perceived image discrepancies, as well as perhaps potentially sought changes 
for this group vis-a-vis subscribing to the feminist movement. 


C. Resuxrs 


All Ss’ scores on each of the scales were converted to standard scores by 
means of tables provided in the inventory manual which corrected for total 
number of adjectives checked. This technique classifies each protocol into 
one of four female categories, and uses standard scores calculated for that 
category in order to reduce response set artifacts (3). Since no a priori 
predictions were made, an analysis of variance comparing the mean scores 
of all four groups was performed for each of the 12 scales. Highly significant 
results were obtained on the Autonomy scale, F(3, 59) = 8.21, p < .01, the 
Aggression scale, F(3, 59) = 5.59, p < .01, and the Deference scale, 
F(3, 59) = 8.045, p < .01. Subsequently, Tukey’s HSD (honestly signifi- 
cant difference) tests for multiple pairwise comparison among means were 
computed for those scales yielding significant F effects. 

Post hoc paitwise comparisons showed that both the FEM and UNIV 
women significantly exceeded the DOM women on the Autonomy scale 
(Table 1), while the FEMs also surpassed the RUA women on this scale. 
Comparisons on the Aggression scale also showed significant differences in 
the same directions between the FEM and the DOM women, as well as 
between the UNIV and the DOM women (Table 1). Significantly higher 
Deference scale scores were found in both the DOM and RUA women com- 
pared to the FEM women and in the DOM women with respect to the 
UNIV women. 

Equally noteworthy were the relative trends of sample means on each of 
the above scales. As can be seen in Table 1, the FEM women manifested 
the most Autonomy and Aggression, with the UNIV women, RUA women, 
and the DOM women declining in that order. The Deference scale, also 
shown in Table 1, reflects the reverse trend, with the FEM women mani- 
festing the least Deference, and the UNIV women, RUA women, and finally 
the DOM women succeeding in that order. 

The mean standard score for the FEM women on each scale was compared 
with the mean score of the standardizing group norms for females (3). The 
results of these ¢ tests with df = 16 revealed several differences significant 
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TABLE 1 
TuKeEy’s HSD Test: AUTONOMY, AGGRESSION, AND DEFERENCE SCALES 


Pairwise mean differences 
Group Means DOM RUA UNIV FEM 


Autonomy Scale 


DOM 44,941 _- 6.142 10.059* 15.726* 
RUA 51,083 _ 3.917 9.584% 
UNIV 55.000 _ 5.667 
FEM 60.667 = 


Aggression Scale 


DOM 48.529 _ 4.721 9.784" 13.304* 

RUA 53.250 _ 5.063 8.583 

UNIV 58.313 _ 3,520 

FEM 61.833 _ 

Deference Scale 

DOM 53,706 = 

RUA 50.500 3.206 — 

UNIV 43.625 10.081* 6.875 _— i 

FEM 39.444 14.262* 11.056* 4.181 — 
#2 <..05, 


beyond the .05 level. The FEM women were significantly more Autonomous fF 
(t = 4.399), Aggressive (t = 4.880), Dominant (¢ — 2.956), and Self 
Confident (¢ = 2,291) than the normative female group. The FEM wome: 
were, however, also significantly less Deferent (¢ = 4.353) and Affiliative 
(¢ = 2.543) than the normative group. 

Correlated (two tailed) ¢ tests between the feminists’ perceived Real and 
Ideal Selves also yielded significant differences, as noted in Table 2. The 
Ideal Self significantly exceeded the Real Self in Nurturance, Endurance, 
Self-Confidence, Self-Control, and Personal Adjustment. The Real Self, how- 
ever, significantly exceeded the Ideal Self in both Aggression and Succorance. 


D. Discussion 


The rank order of sample means on the three scales with significant F 
effects is particularly interesting. As might be expected, the “liberated” 
feminist women exemplify the most Autonomy and Aggression, with the ' 
university women following, then the elderly noninstitutionalized women 
and finally the elderly institutionalized women. This seems like a logical 
progression, when one considers the definitional significance of the four | 
groups. The reverse trend for Deference, with the feminists manifesting the 
least, followed by the university women, then the elderly noninstitutionalized 
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TABLE 2 
COMPARISONS OF REAL ws. IDEAL SELVES OF FEMINIST WOMEN oN THE ACL 
Scale Xyaeat Xneal tay = 16) 

Nurturance 52.00 46.82 2.251# 
Affiliation 50.82 4441 2.083 
Autonomy 59.18 59.76 —0.295 
Aggression 52.65 61.41 —3,166** 
Succorance 38.18 46.53 ——7.0g1e 
Deference 37.76 40.06 —1.011 
Endurance 58.24 51.53 3.015** 
Self-confidence 62.18 55.71 2.457* 
Self-control 53.00 46.88 2.923% 
Personal 

Adjustment 55.88 47.53 3.4840" 
Achievement 59.94 55.29 2.038 
Dominance 61.35 57.53 1,521 


Note: These scores were calculated on the basis of m = 17, since one set of scores 


was lost by an S inappropriately following directions on her Ideal Self checklist. 


* p< .05. 
8p 01s 


; women, and finally the elderly institutionalized women, also seems consistent 
with relative expectations held for younger and/or “liberated” women. in 


<ou 


comparison with older and/or institutionalized women, 

With reference to the post hoc comparisons, the feminist women can be 
described as significantly more Autonomous than both groups of elderly 
women, but not significantly different from the university women. Both the 
feminist and university women are significantly more Aggressive than the 
elderly institutionalized women, but not significantly distinct from one 
another. These differences may naturally be the result of generational dif- 
ferences: i.e., the age factor or cohort group influences. The age factor would 
include changes and consequently increased differences evolving with age, 
perhaps similarly for all women. Examples of this, broadly speaking, would 
be changes in roles and motives during different stages of life (1). Cohort 
group influences imply generational differences resulting from inherently dif- 
ferent and unique life experiences of each of the cohort groups concerned. In 
essence, each of the two generations compared would be the product of their 
respective lifespans placed in historical and experiential perspective. All of 
which is to say that generational differences can be attributed to age as a 
developmental feature and/or a cultural artifact (2). 

Perhaps the most surprising findings are the expectations set forth by the 
feminist women for themselves in terms of Real and Ideal Selves. The fact 
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that the Ideal Self exceeded the Real Self in Nurturance, Self-Control, and 
Personal Adjustment in particular, among others, could be taken as cor- 
roborating evidence by the movement's critics for the disparaging feminist 
stereotype previously described. On the other hand, because these indices in- 
cluding those of Self-Confidence and Endurance are perceived by the feminist 
women themselves as significantly discrepant from their Ideal Self, it may 
well be that attaining the Ideal Self image is one of the very goals among 
those potentiating and ennobling the movement. Additionally, the Real Self 
exceeding the Ideal Self in Aggression and Succorance could be interpreted 
as trait-descriptive of the movement's advocates, or rather as a part of a 
necessary psychological mantle to assure the movement’s success. Both are 
equally probable explanations, and perhaps whichever one is preferably in- 
voked is a function of one’s sympathetic perspective. 

The fact that the feminist women are significantly more Autonomous, 
Aggressive, Dominant, and Self-Confident than women at random, as well as 
less Deferent, seems consistent with the frequently espoused “self-actualiza- 
tion” philosophy of the movement. It was noted, however, that along with 
being less Deferent than the normative group, the feminist women are also 
less Affiliative. The implications of this are intriguing. On the one hand, 
the self-actualizing feminist woman may have to rely less on seeking and 
sustaining numerous friendships in order to derive personal fulfillment, a 
role frequently delineated as naturally feminine (1). Rather, her self-esteem 
is based on her unique abilities and interests, and furthering her individuation 
perhaps at the cost of affiliation. On the other hand, it could be that the 
charges leveled at the feminist women by their critics, to the effect that the 
feminists are not for other women, but against men, 

One other observation is worthy of comment, tha’ 
any significant differences between the feminist and the university women on 
any of the personality dimensions. Insofar as this i 


h ne nstrument would obtain, 
the feminist women do. not appear to be significantly different from their 


contemporary peers. This is to say that university women and others of like 
aspiration, who also have gotten “off the pedestal and into the world,” may 
be more like than unlike the feminist women when core personality anne 
sions are considered. Could it be that with each current generation of uni- 
versity females educated (an increasingly common social phenomenon), an- 


other generation of potentially active feminists is spawned? In essence, are 
most of today’s bright, increasj 


; igh ingly well-educated, and socially conscious 
women basically feminists in sheep’s clothing? 


may have some merit. 
t concerning the lack of 
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All things considered, it may be that feminist women are a new “breed” 
when contrasted with previous generations, a different “breed” than her 
more contented “sisters” at large, and essentially the same breed as the up 
and coming generation of women. It could be said that today’s “liberated” 
woman can be characterized as necessarily more Autonomous, Aggressive, 
and less Deferent than yesterday's woman. Additionally, she is also more 
Dominant, Self-Confident, and less Affiliative than women in general, In 
light of the complexity and demands of life today, such important differences 
on the part of women may even be considered assets and perfectly appropriate 
behavior, which could further women’s contributions to a better life for 
everyone, 


E. Summary 


Although the current feminist movement is well publicized, the more 
popular conceptions of the feminist personality and image lack adequate 
empirical validation. This study examined and more accurately documented 
some salient, normal dimensions of the feminist personality. Sixty-three 
women, including the “liberated” and institutionalized, young and aged, were 
administered an objective personality instrument, the Adjective Checklist. 
The feminist women did not differ significantly from their contemporary 
peers, the university women, on any dimensions, but both groups significantly 
exceeded the elderly institutionalized women on the Autonomy scale; while 
the feminist women also exceeded the elderly noninstitutionalized women on 
this scale. The feminist and university women also significantly exceeded the 
elderly institutionalized women on the Aggression scale. Both the elderly 
institutionalized and noninstitutionalized women, however, significantly ex- 
ceeded the feminist and university women on the Deference scale. 

Relative trends indicated that the feminist women manifested the most 
Autonomy and Aggression, with the university women, the elderly noninstitu- 
tionalized and the elderly institutionalized women declining in that order, 
with the reverse trend indicated on the Deference scale. The feminist women 
were found to be significantly more Autonomous, Aggressive, Dominant, 
Self-Confident and less Deferent and Affiliative than a normative group of 
women. Feminist women’s Ideal Self significantly exceeded the Real Self 
in Nurturance, Endurance, Self-Confidence, Self-Control, and Personal Ad- 
justment. The Real Self significantly exceeded the Ideal Self in Aggression 
and Succorance. These findings are interpreted in terms of generational dif- 
ferences and “self-actualization” influences in the Women’s movement. 
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THE EFFECTS OF SEX OF SUBJECT, SEX OF APPROACHING 
PERSON, AND ANGLE OF APPROACH UPON 
PERSONAL SPACE*! 


Brigham Young University 


Daru M. Pepersen® anp ANNE B. HEaston 


A. InTRopUcTION 


Personal space has been defined as “the space immediately surrounding an 
individual which he feels to be personal, to belong to himself” (1, p. 93) and 
as “a body-buffer zone in interpersonal transactions” (4, p. 651). The size of 
the personal space is determined by a number of factors including the sex of 
the people involved and the direction of approach by other people. 

A number of experimenters have examined the effects of the sexes of the 
people involved upon personal space requirements. Long, Ziller, and Hen- 
derson (6) found that females chose positions that maintained less in- 
terpersonal space from others than did males. Horowitz, Duff, and Stratton 
(4) and Horowitz (3) found that when male Ss used personal comfort as a 
criterion for interpersonal proximity they placed a greater distance between 
themselves and a male than between themselves and a female. Dosey and 
Meisels (1) found that males approached members of either sex at about the 
same distance while females approached females closer than they did males, 

Findings regarding personal space as a function of angle of interpersonal 
orientation are somewhat inconsistent. Sommer (8) and Hare and Bales (2) 
found that women preferred sitting with others in a corner to corner position 
at a table or beside another person, whereas males preferred other seating 
arrangements, especially opposite positions. Lewit and Joy (5) found that 
people facing one another or in typical bodily orientations maintained closer 
interpersonal distances than those facing away or in unusual body positions. 
Rawls, Trego, and McGaffey (7) discovered no significant differences 
between distances when another person walked up to the back, front, left, and 
right sides of a stationary person. 
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There have been some attempts to develop simulated measures of personal ~ 
space. However, they have not been good predictors of similar behavioral 
measures. Dosey and Meisels administered a simulated personal space measure 
where Ss drew silhouettes representing themselves and silhouettes of people 
standing near them. They found that there was little relationship between the 
simulated measure and the actual behavioral measure and cautioned against 
discussion of personal space without consideration of the method used. Rawls, 
Trego, and McGaffey (7) found a correlation of only .34 between a be- 
havioral, personal space measure and a simulated one where S drew a circle 
around a stylized figure to represent his personal space. 

The purpose of the present study was to determine the effects of sex of S, 
sex of approaching person, and angle of approach on personal space for both 


a simulated personal space measure and an actual behavioral personal space 
measure, 


B. Meruop 
1. Subjects 


Ss were 40 students (20 males and 20 females) between the ages of 18 and 
30 enrolled in a psychology class at Brigham Young University. 


2. Measuring Instruments 


Pedersen Personal Space Measure (PPSM). The PPSM was developed 
to measure the personal distance of male and female Ss being approached at 
nine different angles by both males and females. The measure consisted of 18, ” 

8/4” X 11” pages each with a top-view profile representing the S. A male 
profile was used for males and a female profile for females. In the upper — 
left hand corner of each Page was attached another top profile that repre- 
sented an approaching person, For nine of the pages the top profile was of a 
person of the same sex as S, and for the other nine it was of a person of the 
opposite sex. Radiating from the profile representing § was a four-inch line 
at one of nine different angles. The angles were measured starting at 0° on 
the right side of the profile going to 180° on the left side. The nine angles 
used were 0°, 2274°, 45°, 6734°, 90°, 11214°, 135°, 15734°, and 180°. 

The Ss were instructed to remove the gum-backed profile repress the 
approaching person and place it on the four-inch line facing the S-profile so 

that the resulting distance was as close as possible without producing a feel- 
ing of discomforture for §. Hence, the PPSM called for the placement of 
both male and female profiles at each of nine different angles. The 18 different 
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approaching person and angle combinations were presented in random order, 

Pedersen Behavioral Personal Space Measure (PBPSM). The PBPSM 
consisted of nine six-foot pieces of tape radiating from a central point. The 
pieces of tape were separated by angles of 2214°, and the angles were measured 
from 0° at the right side to 180° at the left side. Each piece of tape was 
marked off at 4-inch intervals. The S stood at the point from which the 
pieces of tape emanated. They faced forward at 90° and were approached 
along one of the pieces of tape by an unfamiliar person. This person approached 
until S indicated that he would feel uncomfortable if the person moved closer. 
E recorded the distance from the tip of S’s toe to the tip of the toe of the 
closest foot of the approaching person to the nearest quarter of an inch. The 
S was approached from each of the nine directions in a random order. S was 
approached by persons of the same sex and of the opposite sex, yielding nine 
scores, for each sex. 


3. Procedure 


The PPSM was administered to Ss during a one-hour class period. ‘The 
distance between each S-profile and each approaching profile was subsequently 
recorded to the nearest millimeter. The PBPSM was administered individually 
to each § during a ten-minute experimental session, 


4, Analysis 


A2 xX 2 & 9 factorial analysis of variance was completed for both the 
PPSM and the PBPSM. The main factors were sex of subject, sex of 
approaching person, and angle of approach. Sex of approaching person and 
angle of approach were repeated measures. This analysis permitted a deter- 
mination of (a) differences in personal space between males and females, (b) 
differences in personal space when approached by males versus females, (c) 
differences in personal space when approached from different angles, and (d) 
differences in personal space produced by interactions of two or three of the 
following: sex of subject, sex of approaching person, and angle of approach, 
A Duncan New Multiple Range Test (9) was applied to significant main 
and interaction effects to determine those means that differed significantly 
from the remaining means. 


C. Resutts AND Discussion 


The analysis of variance summary tables for the PPSM and the PBPSM 
are presented in Table 1 and 2, respectively. The sex of S main effect was not 
significant for either the PPSM or the PBPSM. Males did not have a 
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TABLE 1 
ANALYSIS OF VARIANCE FOR THE PEDERSEN PERSONAL SPACE MEASURE 

Source df MS F 
Sex of Subject (A) I 28.01 01 
Subject (B) 38 2597.54 
Sex of Approaching Person (C) 1 4732.94 21.78) 
AUX 1 1290.69 5.94 
Boxe 38 217.32 
Angle (D) 8 191.21 2.42* 
AXD 8 225.40 2.8588 
BxD 304 79.13 
cxD 8 40.69 117 
AxcCxD 8 13.79 40 
BXxXCxD 304 34.77 
Error 0 
Total 719 


* 
* 
BD 
AN 
os 
Ra 


TABLE 2 
ANALYSIS OF VARIANCE FOR THE PEDERSEN BEHAVIORAL PERSONAL SPACE MEASURE 
Source df MS F 

Se: j 

x of Subject (A) 1 99.76 .00 
Subject (B) 5 38 207602 
Sex of Approaching Person (C) 1 5533.36 3.00 
x xc 1 1881.80 1.02 

xc 38 1844.29 
Angle (D) 8 266.94 72 
s x 2 8 1222.79 3.299# 
cS be 5 304 371.91 
AXCXD 8 452.60 2.619* 
BXYOSD B. 113.14 -65 
Bes 3 i 173.47 
Total 
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—~e MALES 
@--© FEMALES 


MALE Ss APPROACHED ON THE PPSM FEMALE Ss APPROACHED ON THE 
PPRSM 


MALE Ss APPROACHED ON THE PBPSM FEMALE Ss APPROACHED ON THE 
PBPSM 


FIGURE 1 
MEAN DisTANCE TOWARD MALES AND FEMALES FoR EACH oF NINE ANGLES OF 
ORIENTATION FOR MALE AND FEMALE Ss ON 
Boro THE PPSM And THE PBPSM 


test revealed that males maintained a significantly greater distance from 
males than they did from females and females did from females, but not 
significantly greater than females approached by males. Females and males 
tended to prefer maintaining the same relative distance from males but not 
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from females. Also males approached by females were significantly closer 
than females approached by males. 

The sex of the approaching person was significant at the .01 level on the 
PPSM. This can be seen in Figure 1. Both males and females placed female 
figures significantly closer than male figures. 

Significant differences in personal space for the various angles of the PPSM 
were found. The means for the angles were zero degrees, 15.89; 2214°, 18.06; 
45°, 19.01; 6734°, 18.74; 90°, 18.67; 11234°, 18.89; 135°, 19.67; 1571%4°, 
19.80; 180°, 15.47. The Duncan test showed that the mean for 180° was 
significantly lower than for all angles except zero degrees and 2234°. And 
the mean for zero degrees was significantly lower than for 15714° and 135°. 
Subjects allowed others to come closer at the sides (zero degrees and 180°) 
than at the other angles. For some unknown reason Ss kept people farther away 
at the far left (135° and 15734°) than otherwise. 
A significant sex of subjects X angle interaction was found for the PPSM. 
This interaction is diagramed in Figure 2. The Duncan’s test demonstrated 


o——* MALE Ss 


e——e FEMALE Ss 


oO 
ON 224P-45° 6749 90° 1124 1350 157201890 
ANGLE 


FIGURE 2 
Man Distance oN THE PPSM As 4 Function oF Sex or SusjecT X ANGLE 
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that the mean distances for females at zero degrees and 180° were significantly 
less than the mean distances at all other angles and were also significantly less 
than distances for males at all angles except 6734° and 90°. Females’ let 
people approach them closer at the sides than males did for all angles except 
6734° and 90°, the relatively head-on approaches. The distance for males at 
90° was significantly less than the distance for females at 90°. Males let 
people approach frontally significantly closer than women did. 

Neither of the analyses of variance yielded a significant three-way interac- 
tion. This indicated that there was nothing unique about the joint combina- 
tion of sex of subject, sex of approaching person, and angle of approach. 

On the PBPSM significant sex of subject angle and sex of approaching 
person X angle interaction effects were found, Figure 3 gives the means for 
the interaction between sex of subject X angle on the PBPSM. The patterns 


o—* MALE Ss 
@--e FEMALE Ss 


N 
o 


DISTANCE IN INCHES 


B 
o 


0° 224° 45° 674° 90°1124°135°1573°180° 


ANGLE 


FIGURE 3 
Mean DIsTANCE ON THE PBPSM As A FUNCTION OF Sex OF SuBjECT X ANGLE 
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are similar to those for the PPSM (cf. Figure 2). The means for the females 
on the PBPSM are similar to the corresponding means on the PPSM except 
at zero degrees, 45°, and 180°. For these angles the distances are greater on 
the PBPSM. Females at 45°, 15734°, and 180° had a significantly smaller 
distance than they did at 90° on the PBPSM. And 180° was also significantly 
less than 6734°. This indicates that females let people approach them closer 
at the sides than at the front. Conversely, males at 6714° had a significantly 
smaller distance than at zero degrees or 15734°, demonstrating that males let 
people approach them closer at the front than at the sides. Females at 67%4° 
and 90° had a significantly greater distance than males at 45°, 674°, and 
90°, Females kept a significantly greater frontal distance between themselves 
and others than males did. 

The Duncan test was applied to the means for the significant sex of 
approaching person X angle interaction on the PBPSM. The means are 
plotted in Figure 4. It was found that the mean distances for males approach- 
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in at 90° and 135° were significantly greater than at 45°, 6734°, and 180°, 
and the mean distance for males approaching at 15734° was significantly 
greater than at 45° and 180°. People tended to let males approach them closer 
slightly off center to the left than they did slightly off center to the right. The 
mean distances for females approaching was not significantly closer at some 
angles than others. Mean distances for males approaching at zero degrees, 
90°, 11234°, 135°, 15734° were significantly larger than for females ap- 
proaching at 45°, 90°, 11234°, 135°, and 15734°. Means for males approach- 
ing at 90°, 135°, and 157%4° were significantly different from means for 
females approaching at zero degrees and 180°. And mean distances for males 
approaching at 90° and 135° were also significantly different from distances 
for females approaching at 67%4°. This indicated that people do not like 
males to approach as close on the left as they do females. But on the right, 
people let males and females approach at about the same distance. 


D. Summary anp ConcLusions 


Two personal space measures were administered to 20 males and 20 female 
subjects. The Pedersen Personal Space Measure (PPSM) was a simulated 
measure in which the subject manipulated profiles that represented himself 
and another person. The Pedersen Behavioral Personal Space Measure 
(PBPSM) was such that subjects were actually approached by another 
person. Each yielded a distance at each of nine angles: zero degrees, 2214°, 
45°, 6734°, 90°, 11234°, 135°, 15734°, and 180°. The distance represented 
S’s personal space: i.e., his closest comfortable distance relative to the other 
person, Four conditions were used for each measure, Male and female subjects 
were approached by both male and female persons. 

A 2 2 9 factorial analysis of variance on both the PPSM and the 
PBPSM yielded significant main and interaction effects. These effects were 
evaluated for significant differences among the means by the Duncan: New 
Multiple Range Test. 

It was found on the PPSM that Ss permitted females to approach more 
closely than males and that both males and females permitted females to 
approach more closely than males permitted males to approach. No difference 
was found in the distance that males were permitted to approach either males 
or females. 

Ss allowed people to approach more closely at the sides than at the front. 
This was primarily due to the fact that females permitted others to approach 
more closely at the sides than at the front. Males differed from females inas- 
much as they permitted others to approach closer frontally. 

For the PBPSM only sex of subject X angle and sex of approaching person 
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> angle interactions were significant. As in the PPSM, females tended to 
permit others to approach closer at the sides than at the front. However, the 
distances at the side were not as relatively close as on the PPSM. Males 
allowed others to approach closer at the front than at the sides. Also, as o2 
the PPSM, males permitted others to approach more closely at the front than 
did females. 

Finally, people let males approach more closely off center to the left than 
they did females; whereas there were no differences in approach distances for 
approaching males and females off center to the right. 

There was a close correspondence between the conception of people as to 
their personal space behavior as measured by the simulated measure and their 
actual personal space behavior. This was most apparent in the analysis of the 
distances of males and females being approached at the various angles. 


Np 
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PLAYING PEEKABOO WITH A FOUR-MONTH-OLD: A STUDY 
OF THE ROLE OF SPEECH AND NONSPEECH SOUNDS 
IN THE FORMATION OF A VISUAL SCHEMA* * 


Center for Cognitive Studies, Harvard University 


Patricia M. GREENFIELD 


A. INTRODUCTION 


The role of auditory signals, in general, and speech, in particular, in 
structuring the visual attention of a four-month-old baby was investigated by 
systematic study of my own baby in his natural surroundings. The question 
was whether sound in general and speech in particular would help the baby 
to segment continuous visual information by calling attention to critical 
points in a visual sequence. The visual sequences used in the study involved 
the disappearance and reappearance of objects or mother—the familiar peek- 
aboo game played with inanimate things, as well as people. 

A secondary purpose’ of the study was to ascertain whether patterns of 
response would differ according to whether the disappearing objects were 
human or inanimate. Trevarthen and Richards (10) have found that, from 
the beginning, infants respond very differently to people than they do to 
inanimate objects. 

The peekaboo game would not be possible if it exceeded the visual capacities 
of the infants. Haynes, White, and Held (6) have shown, however, that by 
four months of age, the point at which this study began, infants adjust their 
visual accommodation quite precisely to the distance of the target. Nor would 
peekaboo be possible if an object ceased to exist for an infant when it dis- 
appeared. But Bower (2) has shown that when things disappear slowly and 
perspectively, they continue to exist for a seven-week-old baby for a duration 
of five seconds. By 12 weeks of age, babies have existence constancy for objects 
disappearing at rates even faster than those used in a peekaboo game. In 
peekaboo, disappearance is much shorter than five seconds. Thus, at 12 weeks 
of age, objects are sufficiently permanent for the normal peekaboo game. 


* Received in the Editorial Office on August Dashes ane published immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. 

1 Preparation fae the manuscript was supported by Grant No. MH-12623 from the 
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Institute of Child Health and Human Development to Harvard University, Center for 


Cognitive Studies. 
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Charlesworth (4) has very successfully used this game to study cognitive 
processes in babies from five months up. He showed that even the youngest 
babies develop clear-cut visual expectancies when objects reappear in a constant 
locale. But what does it mean to say that expectancies develop, if in a sense 
the expectancies are there from the outset in the form of existence concepts? 
‘The spatio-temporal characteristics of a particular game must be what the 
infant is learning from experience. In this present study, a hiding-reappear- 
ance game was used to determine what interactional conditions facilitate or 
impede the development of this type of learning. 

It is known that newborns are selectively attentive to the frequency range of 
human vocalization [Eisenberg (5)]. Given its privileged position as a 
stimulus, can the human voice serve to make spatially coordinated information 
in another modality—vision—more distinctive for the infant? 

Newborns possess a capacity for sound localization as well [Bronshtein and 
Petrova (3), Wertheimer (11), Semb and Lipsitt (9)]. Although the 
actualization of this capacity is initially constrained by information-processing 
limitation [Aronson and Tronick (1)], by the age of three months infants 
will orient to all visible sound sources [Piaget ($)]. When a four-month- 
old baby then orients to a sound emanating from a visually informative locus, 
can this experience help shape a pattern of visual attention? The question of 
this study is whether a sequence of such experience facilitates the growth of a 
visual expectancy. 

These problems were investigated by comparing the baby’s response to the 
reappearance phase of the hiding game under varying auditory conditions. In 
games where the disappearing object was inanimate, disappearance and _re- 
eae bed Reape either in silence, accompanied by the object's 
ane iad ap sane y pits speech. When mother was the dis- 
sound made by the mother re sa @ ea oe rae Tah eee Plas fe 

p Placed the sound of the inanimate object. Smiling 
when the object reappeared was used as an 
According to evidence from Kagan et al. 
the smile functions to indicate assimilation 
schema. 


index of a confirmed expectancy. 
(7), four months is an age when 
of a visual stimulus to an emergent 


B. Procepure 
1. Study 1 
_The first study involved the typical sort of peekaboo game, in which I 
disappeared and reappeared. The baby’s response to my Rea orea Hike ‘was com- 
pared under two conditions: in one condition, the word “peekaboo,” said with 
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the bright intonation normal to the game, accompanied reappearance; in the 
other condition, I reappeared silently. Under both conditions, I made a 
sound (“ooh-ooh”) to attract his attention before hiding. A series of 10 
game sessions, each consisting of seven to 14 trials, was played in a number of 
different locales around the house. In each game session, the two reappearance 
conditions—“peekaboo” and silent—were represented an approximately equal 
number of times; in the average session, each of the two conditions was rep- 
resented by five or six trials. Fatigue and boredom determined at what point 
to terminate a game session. The study began at 17 weeks and lasted until 
19 weeks of age. Some ancillary sessions were held as late as 23 weeks of age. 
More than one game session took place in each locale over a period of days 
or weeks, so that it was possible to trace the development of a peekaboo schema 
in each of four different locales. 

One locale was Matthew’s crib, beneath which I would disappear while he 
was in it. This was the only one in which he had played peekaboo before the 
onset of the study; the three other hiding places were a large bathroom 
hamper, a bed, and a sofa. Locales for hiding around the house were thus 
sampled to a reasonable degree. 

Reappearance consisted of popping my face up (or sideways, in the case of 
the hamper) into Matthew’s visual field. ‘The amount of time in hiding was 
not controlled. In this way I felt that I would obtain a “representative 
sampling,” in the Brunswikian sense, of amount of time hidden while main- 
taining natural conditions. My experience has indicated that artificial timing 
constraints in infant experiments often destroy the phenomenon one wishes to 
study. Nevertheless, when simulated game conditions were timed in the 
laboratory, temporal characteristics turned out to be remarkably consistent : 
time spent hidden averaged 1.3 seconds; the range was from 1.2 to 1.4 seconds. 
Rate of disappearing averaged 41 cm/second ; the range was from 38 to 42 
cm/seconds. Thus, “natural” timing in disappearance-reappearance games turns 
out to be uniform without any experimental constraints. af 

As a more general point about methodology, it seems that two qualities of 
infant behavior make it necessary to develop systematic yet natural observation 
conditions; in comparison with adults, infant behavior is (a) much more 
sensitive to situational context and inner state, and (b) much less detachable 
from specific goals. These considerations are most important when one 
purports to study how commerce with the social or physical environment 
affects the growth of some aspect of development. 

id arbitrariness in designing experimental 


Although it is necessary to avol i ning « 
conditions, it is also necessary to avoid “experimenter effects’ which confound 
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results. In the present case, an obvious source of such an effect could be the 
mother’s smiling. Therefore, a conscious rule not to smile first but always to 
smile back when the baby smiled was initiated after the first game session. I 
thought that this rule would duplicate, in a systematic way, the naturally 
occurring contingency vis-d-vis smiling. Still, this procedure introduces the 
possibility that smiling is being operantly conditioned. This hypothesis will 
be evaluated later in the light of the actual results, 


2. Study 2 


This series of hiding games, begun after the main part of Study 1 when 
Matthew was 19 weeks old and continuing until 22 weeks of age, was designed 
to compare the effectiveness of speech and nonspeech auditory signals in struc- 
turing a response to the visual peekaboo game. A second complementary 
purpose was to see whether the pattern of response differed when an inanimate 
object, rather than a human being, disappeared and reappeared. 

In this study, two parallel series of games were played with mother and 
with objects. In the object games, where three toys constituted the objects, 
experience with a given toy was the variable analogous to experience with a 
locale in the “mother” games. Thus, in both types of game the growth of a 
specific expectancy could be traced over time. During the object games, 
Matthew was always on the couch with me in front of it. The toy started in 
front of my face, disappeared downwards, and reappeared in front of my 
face. Thus, my presence was a constant stimulus in the object games. When 
the object game was simulated later in the laboratory its temporal conditions 
turned out to be extremely consistent and identical with those obtaining when 
the object was a person: disappearance occurred at an average rate of 41 
cm/second ; the range was 38 to 44 cm/seconds. The object stayed hidden an 
average of 1.3 seconds; time hidden ranged from 1.2 to 1.4 seconds. 

Each game session consisted of nine peekaboo trials equally divided among 
three conditions. No condition was presented on two trials in a row. In 
addition to a speech and silent condition as in the first study, there was also a 
nonspeech sound condition, In game sessions where a toy was hidden, the 
speech signals were distinctive from those used when mother hid: “Hey, 
Matthew” to signal impending disappearance, and “Here it is’ when the toy 
popped back. When mother hid under the nonspeech condition, the sound used 
to herald disappearance and reappearance was a sort of edueak, difficult to 

describe but often used with babies. This sound had in fact been a favorite 
for Matthew, but when tested before the study began, it no longer elicited a 
smile by itself. When a toy was hidden under the nonspeech sound condition, 
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the toy’s own sound—squeak or chime—signalled disappearance or reappear- 
ance. Under the silent condition, both mother and toy disappeared and re- 
appeared in silence. This was a departure from the first study in which an 
orienting signal was given even under the silent condition. In both studies, 
however, a trial was not begun until Matthew focused visually on whatever 
was about to disappear. 

In this study, two new hiding places were used when mother hid—a table 
and a chair in their normal locations in the house. Two game sessions took 
place at the table, one at the chair. For inanimate disappearance, locale was 
constant—the couch used in the first study— but the object itself was variable. 
One of two rubber squeakers, a deer, was already familiar; a duck squeaker 
and chime rattle were not. Altogether six sessions of inanimate disappearance 
took place. One was excluded from the data analysis because no smiles were 
elicited under any condition, probably because Matthew was tired. 


C. Resutts 
1, Study 1 


At first, speech constitutes a powerful cue in the peekaboo game with 
mother; her reappearance rarely elicits a smile without “peekaboo” marking 
the event. With increasing experience with the game in a given locale, how- 
ever, the speech signal loses its privileged position, and the visual sequence 
alone suffices to elicit a consistent smiling response. Table 1 makes this pattern 
clear: in the first game in given locales, silent reappearance elicits a smile only 
six out of 17 times, or 35% of the time. When the game is repeated in the 
same locales at a later date, silent reappearance elicits a smile 50% of the 
time; when it is repeated a third time at a still later date, silent reappearance 
elicits a smile 100% of the time. The results from the “peekaboo” condition 
are in sharp contrast; smiling occurs 17 out of 19 trials or 897% of the time 
the very first time the game is played in a given locale. This response remains 
constant: smiling occurs 11 out of 12 trials (92%) for the second game in 
a given place, four out of four trials (100%) for the third game in the same 
place, Whereas the difference between the “peekaboo”” and silent conditions 
is large and statistically significant (p < .01, one-tailed Fisher Test) during 
the first session in a given locale, all differences evaporate by the third series 
of trials: the visual sequence alone elicits smiling on every trial, exactly as it 
does in combination with the “peekaboo” signal. 

In the results presented thus far, experience is somewhat confounded with 
maturation, especially by the third game session in a given locale, In order to 
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TABLE 1 
RESPONSE TO MoTHER’s REAPPEARANCE DurinG GAME SESSIONS VARYING IN 
FamutiariTy oF Locate: Stupies 1 AND 2 


Game sessions 


First in given Second in same Third in same 
Reappearance locale locale locale 
marked by Smile No smile Smile No smile Smile No smile 
Study 18 
“Peekaboo” 17 2 1 1 0 
Silence 6 11 6 6 3 tt) 
Study 2> 
Human spéech 6 0 3 0 
sounds (“peekaboo”’) 
Human nonspeech i 3 3 0 
sounds 
Silence 0 6 3 0 


® Locales in Study 1 were as follows: First game session—hamper, sofa, bed; 
second session—hamper, sofa; third session—hamper. 

» Locales in Study 2 were as follows: First game session—table, chair; second 
session—table, 


see whether experience can play an independent part, let us turn to the data 
collected on the second day of the study in the one locale where peekaboo 
had been played before the systematic investigation began. We may then com- 
pare these results with those collected at the same time in two new locales. In 
this comparison, we hold maturation constant while varying amount of game 
experience in a specific locale. 

The second day of the study we played peckaboo by Matthew’s crib, the 
place where he had previous peekaboo experience, Reappearance elicited a 
smile on five out of five “‘peekaboo” trials and five out of five silent trials. The 
day before and the day immediately following, games were played in two 
new Jocales—hamper and couch. The results obtained on silent trials in new 
hiding places were uniformly different from those obtained in the familiar 
hiding place. When I reappeared silently, smiling occurred one in five times 
at the hamper, one in seyen times at the sofa. In sharp contrast, silent re- 
appearance at the crib had elicited a smile on every trial, that is, five out of 
five times. (This result at the crib locale was replicated five dayh later.) In 
other words, experience can promote the development of the visual disappear- 
ance-reapearance schema at any chronological point within the one-month 
range tested in this study, 

It is interesting that at this point in development, expectancies seem ex- 
tremely specific. This fact emerges when one notes that performance fails to 
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improve during a second game session when that session takes place in a 
different locale, whereas there is great improvement the second time when 
the game is played in the same locale. By the end of the month, either because 
of maturation or because of a learning-to-learn phenomenon, the initial game 
in a new locale (bed) elicits a much higher rate of smiling responses on 
silent trials (four out of six, or 67%) than was true at the beginning of the 
month (two out of 10, or 20%). 

Thus, one can conclude, first, that the speech signal enables a consistent 
pattern of response to emerge earlier than is possible on the basis of visual 
cues alone. Second, experience that includes vocal articulation of the critical 
visual event promotes the future development of a consistent pattern of 
response in the absence of this auditory cue, 

The inference is that the smile results from the confirmation of an emergent 
expectancy and that this consistent pattern of response therefore signifies the 
actualization in a specific instance of existence constancy. Some independent 
indication that the development of an expectancy constitutes the underlying pro- 
cess is furnished by more data collected one week after the last games reported 
thus far. In these two games, anticipatory smiles occurred on every trial at the 
alerting signal, so that recognitory smiles could no longer be scored. If predic- 
tion is the hallmark of an expectancy, then ‘this anticipatory response con- 
stitutes extremely strong evidence for the reality of such a development. 

The temporal patterning of the smile response would seem to offer the best 
evidence against reinforcement as an explanation of the results. Since I would 
smile back any time Matthew smiled, reinforcement in no way explains why, 
initially, he smiled only at the end of a trial (reappearance) and never at 
the beginning of a trial (before disappearance). 


2. Study 2 


The results of Study 2 replicate those of Study 1; Table 1 shows that the 
first time mother disappears and reappears in a particular place, reappearance 
elicits a smile when accompanied by “peekaboo.” By the second game session 
in one of the same locales four days later, smiling occurs whether or not there 
is any sound to signal disappearance and reappearance. The nonspeech signal 
starts out exactly intermediate between speech and silence in eliciting a smiling 
response (three out of six trials, or 50% of the time). But experience either 
improves the effectiveness of the nonspeech signal or obviates the need for any 
auditory signal; during the second game in a given locale, smiling occurs on 


every trial in every condition. A j 
The results of the parallel object-hiding games are displayed in Table 2 
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where three degrees of experience with game and object are distinguished, 
Degree of experience is a composite of two factors: (a) familiarity with a 
object before it was used in any hiding game, and (4) experience with a 
object in a game session. At all three levels of experience, speech-trials virtually 
always elicit a smile, silent trials never. Thus, Matthew never seems to learn 
what to expect from a hidden toy unless he is given some auditory cues. The 
visual schema seems harder to establish for things than for mother, and 
auditory cues become correspondingly more crucial in directing the pattern of 
visual attention. 

As for nonspeech sounds, they gradually become more effective in structur- 
ing visual attention. In the first game with the unfamiliar toys (duck squeaker, 
chime rattle), the nonspeech sound condition elicited a smile on only on out of 
six trials (17%). In the games of intermediate familiarity (deer and duck 
squeakers), smiling occurred two out of six times (33%). In the games af 
the highest level of familiarity (second session familiar deer squeaker), 
smiling occurred in the object-sound condition two out of three times (67%). 
This trend toward the increasing effectiveness of nonspeech sounds is statis- 
tically significant at the .05 level when one considers the results both for 

“people” and for “thing” games (one-tailed Fisher Test). 

In the object games, Matthew never smiled at my face alone (during the 
disappearance phase), but smiled only when face and object were visible to 
gether (reappearance). This pattern of response is inconsistent with a rein- 
forcement explanation of the observed behavior, for “smiling back” as 4 
reinforcer could. occur at any point in an object game, not exclusively upo 
the object’s reappearance. e 

Additional data would have been desirable, but Matthew refused to ol 


TABLE 2 
ResPonsz To OBjzct’s REAPPEARANCE DURING GAME SESSIONS VARYING 
IN FAMILIARITY oF Opject 


Degrees of Experience 


: Level 2 
Level 1 (Initial game ses- 

5 (Initial game sion with familiar Level 3 
Disappearance Session with object; second (Second game 
and reappear- unfamiliar ame with un- session with 
ance marked object) familiar object) familiar object) 


by: Smile 

Human speech 5 
sounds 3 

Object sound 


Silence 


No smile Smile No smile Smile No smile 
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any more. Since Charlesworth (4) has found persistence at a peekaboo game 
to be inversely related to the infant’s ability to predict reappearance, this 
behavior constitutes additional evidence that reappearance had become predict- 
able, that Matthew had indeed formed a visual schema. 

In sum, the pattern of results is very striking and consistent for inanimate 
and animate objects. When Matthew starts to play a new game—that is, one 
involving a new locale or a new object—speech holds a very privileged 
position; it is the only condition that reliably elicits the recognitory smile. 
With experience, the sound of an inanimate object becomes almost as effective 
in producing this smile. Silent reappearance of an inanimate object never 
elicits a recognitory smile. As for games involving mother’s disappearance and 
reappearance, nonspeech sounds are initially somewhat effective in eliciting 
the smile, although much less so than speech. After experience with the game 
in a given locale, Matthew shows consistent appreciation of my reappearance 
under all three conditions. At this point the visual event alone suffices to 
trigger an expectancy and to induce its confirmation. 


D. Discussion 


The greater difficulty of establishing a disappearance-reappearance schema for 
things than for a person who is a mother may reflect strong general expectations 
about what happens to a mother or to any person in the baby’s experience when 
she hides. This would be in line with a general conclusion of the two studies: 
the more unpredictable a visual sequence, the greater the role of auditory cues 
in delineating its critical points. Because I was the only person to play peekaboo 
with Matthew, we cannot distinguish a specific “mother” effect from a 
general “person” effect. The inanimate objects, by contrast, represented a 
sampling of different noise-making toys; generalization about inanimate ob- 
jects therefore seems warranted. é 

A potential source of weakness in the studies was the failure to assess 
whether “peekaboo” and “here it is” would elicit a smile by themselves at the 
outset of the investigation. Even if they had, however, this fact could not 
explain why the silent and nonspeech conditions became equally effective in 
eliciting a smile after experience with a given visual sequence. The effective- 
ness of the silent condition in eliciting a recognitory smile the first time peekaboo 
was studied in the familar crib locale shows that previous experience with a 
silent event is not a necessary requisite for the development of a purely visual 
schema, for there had been no silent hiding before the study began. While 
seeing that “peekaboo” trials are sufficient for the development of the hiding- 
reappearence expectancy, we do not know if the silent trials also made a con- 
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tribution to this learning. But if the pattern of attention is the crucial aspect 
of learning a visual expectancy, then it makes sense to think that experiences 
in which attention is directed to the critical loci at the right times would be 
much more valuable in organizing a temporal pattern than would experience 
where attention might well be randomly or irrelevantly distributed, as on the 
silent trials. 

If Bower (2) is correct in claiming that infants of this age have existence 
constancy at the outset, what does it mean to say that a child “learns” a 
simple hiding game in the course of certain kinds of experience? It must be 
the spatiotemporal pattern of this particular game that is being learned. In 
relation to timing, it is probably significant that many of the earliest games 
babies play have a strong rhythmic component, for example, “pat-a-cake,” 
It is often this rhythmic component which seems to contribute most to the 
game-like quality of the interchange; rhythm comes to be a rule-of-the-game 
that the baby learns first. As for spatial learning, our results show that peekaboo 
is initially defined as a game-in-a-particular-place. Later, we have seen, 
Matthew’s expectancy transcends the limits of any specific locale. It could 
well be that when the expectancy also transcends the limits of a specific 
temporal rhythm, the whole game loses it appeal for the baby. 

If these results are valid, they suggest an important way in which caretakers 
can control and develop the deployment of an infant’s visual attention through 
the use of sound. Sound can contribute informational redundancy vis-a-vis 
the visual event by encoding it in a second modality. Moreover, sound, under 
normal conditions, has two design characteristics—spatial localization and tem- 
poral organization—which may be important in guiding visual attention. The 
spatial localization of an ordinary auditory stimulus plus the omnidirectional 
capacity of human sound Teceptors means that visual attention can be guided 
to the informative locus. Temporal organization, on the other hand, brings 
with it the possibility of imposing points of articulation upon the relatively 
continuous flow of the visual world. It appears that a mother’s speech, and 
probably speech in general, hold a privileged position among auditory stimuli 
ey oe ee Since the role of speech in the enviroment is a 

ural variation in cognitive development, and since skill- 

ful deployment of attention may well be the crucial cognitive accomplishment 
paiiccen Bi ee could constitute the ontogenetic 
nship between language and attention. Many 


more infants must be studied if such developmental phenomena are to be 
fully understood. 
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E. Summary 


This study investigated the role of auditory signals in general, and speech 
in particular, in structuring the visual attention of a four-month-old baby. 

The first study involved the typical sort of peekaboo game, in which mother 
disappeared and reappeared. The baby’s response to reappearance was com- 
pared under two conditions: in one condition, the word “peekaboo,” said with 
the bright intonation normal to the game, accompanied reappearance; in the 
other condition reappearance occurred in silence. The results showed, first, 
that the speech signal enables a consistent pattern of response to emerge 
earlier than is possible on the basis of visual cues alone. Second, experience 
that includes vocal articulation of the critical visual events promotes the 
future development of a consistent pattern of response in the absence of this 
auditory cue, 

The second study was designed to compare the effectiveness of speech and 
nonspeech auditory signals in structuring a response to the visual peekaboo 
game. A second complementary purpose was to see whether the pattern of re- 
sponse differed when an inanimate object, rather than a human being, dis- 
appeared and reappeared. The pattern of results is very striking and consistent 
for inanimate and animate objects. In a novel hiding game—that is, one in- 
volving a new locale or a new object—speech holds a very privileged position ; 
it is the only condition which reliably elicits the recognitory smile. With 
experience, the sound of an inanimate object becomes almost as effective in 
producing this smile. Silent reappearance of an inanimate object never elicits 
a recognitory smile. As for games involving mother’s disappearance and 
reappearance, nonspeech sounds are initially somewhat effective in eliciting 
the smile, although much less so than speech. After experience with the game 
in a given locale, the baby shows consistent appreciation of mother’s reappear- 
ance under all three conditions. At this point the visual event along suffices 
to trigger an expectancy and to induce its confirmation. 
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THE EFFECT OF REDUCING THE REDUNDANCY OF 
WRITTEN MESSAGES BY DELETION OF 
FUNCTION WORDS*? 


Institute for Child Study and Reading Program, Indiana University; and School of 
Education, Clemson University 


J. Jaap Tuinman anp Gorpon Gray 


A. RATIONALE 


It is well known that printed English text is highly redundant. Shannon 
(9) calculated this redundancy to be about 75%. Theoretically, only one 
fourth of the characters normally used in a written message are needed to 
convey the information contained in it. Sequential and distributional con- 
straints operate, however, which make it impossible to create messages which 
are still intelligible by random deletion of three out of every four characters 
in a text. The reduction of redundancy, or shortening of written messages, 
is of interest for at least two kinds of reasons. First, there are the reasons of 
economy. If under certain conditions shorter messages are possible, with little 
loss of effectiveness in terms of information transmission, there is no economic 
justification for more redundant versions. The question of redundancy reduc- 
tion is also of interest, however, in terms of the isolation of those factors 
which permit less redundant messages with minimal loss of information, Under- 
standing of the nature of these factors and of their operation is of import 
both for the construction of more economical messages and for the explanation 
of comprehending regular prose. 

Miller and Friedman (8) conclude that the reduction of messages by creat- 
ing a statistically optimal ortography may be possible but hardly realistic. 
There simply would be too many practical problems associated with learning 
and reading such as “efficient” ortography. These researchers proceeded by 
testing various reduction schemes involving deletion of individual characters 
(letters, spaces between words). Others have analyzed the effects of deletions 
of various word classes [Weaver and Bickley (11), Bickley, Weaver and 
Ford (3), Bradley (4), and Bhooma (2)]. The latter approach has intuitive 
appeal. It is used on a large scale in the construction of telegrams, for instance. 
What is happening in these cases is that the structural redundancy of the 
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message is minimized in the belief that the receiver can supply the necessary 
structural information. Bradley (4) showed that this is true, particularly if 
the effectiveness of information transmissions is measured by asking questio $ 
about the text read. In his study deletion of function words had no significant 
effect on the performance of Ss on a multiple choice test based on the para~ 
graphs read. This finding is in accordance with results in an earlier study by 
Weaver and Bickley (11) and with those of Bhooma (2). 

One role of function words is to disambiguate sentences by acting as cues 
to the underlying grammatical structure. Thus, according to Fodor and 
Garrett’s (6) theory of sentence comprehension, deletion of such cues from 
the surface form of the sentence decreases comprehensibility. The degree to 
which this is true, however, depends on the ability of the receiver of the 
message to supply part or all of the missing cues. This ability will be generally 
high in those situations when a message is followed by questions about its 
content. Built into the questions are many of the cues specifying the relations 
among the content words remaining in the message. For example, the questio 
“Who spoiled the boy?” signals that the preceding message “Boy spoiled’ 
woman” can not be interpreted as “The boy spoiled the woman” but rather 
should be read as “The boy was spoiled by the woman.” 

The present study focuses on the reduction of sentence structure controllini 
for a member of factors not explicitly considered in the redundancy reduction 
studies mentioned above. First of all an attempt was made to obtain a measure. 
of comprehension that was highly Passage dependent. Questions used in 
standardized reading tests are often answerable without reading the accom 
panying passage at all (10). This confounds the interpretation of the effects” 
of modifications of the passages on their comprehensibility. Secondly, the 
Weaver and Bickley (11) study and those of his students Bradley (4) 
and Bhooma (2) utilized a blackout technique replacing the elements to be’ 
deleted by a black bar. The presence of the black lines themselves, however, - 
signal structure. Compare the relative ambiguity of the sentence The woman 


washed ——— clothes with The woman washed clothes, both of which are 
reductions of The woman washed her clothes. Finally, 


two definitions of function words were employed. As 
strated, the relationship between the content word ratio 
paragraph’s comprehensibility as indexed by cloze scor 


in the present study 
Coleman (5) demon-— 


qj 


in a paragraph and the” 


study two patterns of 
definition of function 
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In summary: in this research the effect of reducing structural redundancy 
of written messages on the comprehension of these messages was studied. The 
reduction of structural redundancy was effected by deletion of function words 
as defined in two different ways. 


B. PRocEDURE 
1. Materials 


A 32-item three-foil multiple choice test? was developed on the basis of the 
story “The Last Samurai” originating from the Asian Studies Project (1). 
One of the foils of each item contained a statement directly dealt with in the 
text. Thus the role of inference in this reading test was minimized. This was 
done in an attempt to reduce the number of items which could be answered 
without prior reading of the passage. As Weaver, Bickley, and Ford (12) 
have shown, inference items are generally less passage dependent than are 
factual items. The results of a pilot study showed the item construction strategy 
to be successful. The mean score of 30 Ss on the 32-item test was 11.20, or 
35%, only slightly higher than the expected score of 33%. 

Three versions of “The Last Samurai” were prepared. Version UA (un- 
altered) contained 1700 words. The first modification (M1) was obtained 
by retaining in the text only content words as included in Coleman’s wider 
definition of content words (5). Under this definition content words are de- 
fined as noun, verbs, adjectives, adverbs, numbers, copula, adverbs of time and 
location, pronouns, address words, possessive pronouns and the words not, yes, 
and no. The deletion of the words not in these classes (i.e. the function words 
under a narrow definition) resulted in a text reduced to 62% of the original 
text (1155 words). Figure 1 contains a sample of the text. 

Modification M2 was obtained by retaining only those words classified by 
Coleman as content words under his second, more narrow definition which 
includes only nouns, verbs, adjectives, adverbs (other than the ones listed 
above as separate classes), and numbers. This modification produced a text 
half as long as the original test (855 words). A sample of the text is given in 
Figure 1. 

It must be noted that the deleted words were not replaced by blanks or any 


other marking device. 


2 The test was constructed by Mr. Wayne Berridge in the course of a dissertation 
study under guidance of Dr. B. Elgit Blanton, Indiana University, and the author, 
The author wishes to thank both gentlemen for their permission to use the instrument. 


During World War II Fujiro Ohashi had been an outstanding pilot 


in the Air Division of the Japanese Navy. He was one of the first men” 
in Japan to learn how to fly a fighter plane and was well known for 

his skill. He was the first pilot to land on an aircraft carrier at 
night and soon after that he became a captain. He would have become 4 

rear admiral if the war hadn't ended as quickly as it did. Throughout 
his Navy career, Captain Ohashi was considered to be a follower of the 
ancient military tradition of the Samurai which had existed in Japan 


for over 800 years. 
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FIGURE 1 
SAMPLE OF ORIGINAL Text (UA), Mopirication I (M1) 
AND MopiricaTion 2 (M2) ; 
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2. Subjects 


Subjects were 90 seventh graders stratified—randomly selected from the total 
seventh-grade population of a central Indiana rural junior high school. The 
school’s tracking system made stratification desirable. Both sexes were nearly 
equally represented in the sample (49 girls versus 41 boys). 


3. Procedure 


The tests were administered simultaneously by four research assistants 
under the supervision of E in groups of about 23 Ss each. Standard test 
directions were read. In addition, it was explained that “some students will 
have a story that is a little bit strange. However, by reading the questions and 
the story closely it will be possible to decide upon the right answer in most 
cases.” Within each of the four groups Ss were randomly assigned to one of 
the three treatment conditions (UA, M1, or M2). The students had un- 
limited time to finish their tests. All Ss did complete the 32 questions. 


C. Resuts 


In Table 1 the mean, standard deviation, and Spearman-Brown reliability 
coefficients are presented for each of the treatment groups. The reliabilities, 
though not high, seem adequate for the present usage of the test results, First, 
it is worth noting that the test is relatively easy when used with regular 
text, The UA mean score of 25.17 amounts to 79% of the highest possible 
score (32.00). This percentage drops to 68% for M1 and 60% for M2. The 
relationship among the means for the three groups can be expressed in yet 
another way. The mean score of M1 is 87% of that of UA; the mean score 
of M2 is 77% of that of UA. Finally, M2 scored 88% as high as M1. These 
percentages indicate the extent of the loss of information as a result of modify- 
ing the text. 

A one way ANOVA was performed, indicating that the treatment was 
effective at the .001 L.S. (Table 2). Subsequent Scheffé-comparisons resulted 
in significant differences between UA and Mi (p < .05) and between UA 


TABLE 1 
MEANS AND STANDARD DEVIATIONS AND SPEARMAN-BROWN 
ReviaBiLities (n = 30) 


Text condition x SD T'xx 
UA 25.17 4.66 86 
M1 21.90 4.31 67 


M2 19.33 3.59 62 
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TABLE 2 
One-Way ANALYsIS OF VARIANCE 
Source ss df MS F 
Treatment 512.87 2 256.43 14.45% 
Within groups 1543.53 87 17.74 
Total 2056.40 89 


* p< 001. 


and M2 (p < .01). The 1.s. associated with the difference between M1 and 
M2 is .10. (In view of the conservative character of his test Scheffé recom- 
mends usage of 1.s, = .10 where one normally would use 1.s. = .05). 

From the data it can be concluded that though the differences between UA 
and the two text modifications were relatively small (in terms of percentages 
of loss of information) they were systematic. 


D. Discussion 


The fact that, in a pilot, only 35% of the items could be answered correctly | 
when text was removed, indicates that the information in the passage was 
vital to performing adequately on the test. Yet, it: was shown (particularly 
for M1) that drastic reduction of the original text is possible without equally 
severe damage to the information function of the passage. The loss from UA 
to M1 was only 13%. Even reducing the text by 50% led only to a drop in 
“comprehension” of 23%. It might be argued that, therefore, the set of 
questions did not gauge comprehension precisely enough. There is a danger 
of circularity in this argument, however. At this time no specific and objective 
criteria for the degree to which any set of questions measures the comprehen- 
sion of a given passage are available. The facts are that (a) the probability 
of correctly answering this set of questions without reading the passage was 
35, (4) this probability was .79 when the original passage was read and (c) | 
this probability was .68 when the mutilated version M1 was read. Thus, in- 
nyt the Passage pas a necessary and sufficient source of information for 
309) een functioning as such was impeded relatively lightly by the 

Parenthetically, it may be pointed out that Martin and Herndon (7) 
report two studies in which the mean comprehension score for the groups 
reading their telegraphic stories is 88% of the mean score of the groups that 
read the traditional text. The variable Tepresenting comprehension which was 
used in their experiments was a test that tequired Ss to recall whether a particu- 

lar relationship among concepts in the story was disjunctive or nested. Even 
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so, robustness of information transmission in face of severe reduction of 
redundancy is emphasized by the Martin and Herndon results. 

It may be concluded from the data presented here that if a message is read 
for the purpose of answering specific questions, the danger inherent to 
mutilation of the text by deletion of function words is relatively small. 
Apparently the structure provided by the questions allows reconstruction of 
the message to a rather high extent. In general this study to a degree supports 
the concept of a written language which has been deliberately rendered less 
redundant than its counterpart, the spoken language. Of considerable interest 
is the question, not answered by the present study, in regard to the type of 
information demand situations (e.g., as exemplified by certain categories of 
questions for which particular deletion strategies are of optimal value). 


E. Summary 


Regular English text was rendered less redunant by deletion of function 
words, Application of two definitions of function words resulted in text 
reductions of 38% and 50% respectively. The intelligibility of the shortened 
messages was measured by means of an especially constructed reading com- 
prehension test. Ss were 90 junior high school students. The redundancy 
reductions resulted in comparatively small amounts of loss in information 
transmission. Ss in the 38% reduction condition scored 18% below Ss using 
regular text. Ss with the 50% reduction condition scored 23% below the Ss 


with standard English text. 
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THE USE OF OTIS-LENNON MENTAL ABILITY TESTS, 
ADVANCED LEVEL, IN IRAN*? 


Pahlavi University, Shiraz, Iran 


A. A. Hosszint, A, A. Razavizu, anp A. H. Menryar 


A. IntTRopucTION 


The series comprising the new Otis-Lennon Mental Ability Tests (MAT) 
was published in the U.S. in 1967 and became available in two forms, each 
with six levels: Primary I and II, Elementary I and II, Intermediate, and 
Advanced, These tests, which represent a revised edition of the Otis Quick- 
Scoring series, were “designed to provide comprehnsive, carefully articulated 
assessment of the general mental ability, or scholastic aptitude, of pupils in 
American schools” (6, p. 4). 

Advanced level of Otis-Lennon Mental Ability Test is composed of 80 
items arranged in spiral omnibus form and is recommended for use with 
typical students in grades 10 through 12. The time required for administering 
this test is 45 to 50 minutes which allows ample time for students to go 
through all of the items, In this sense, it places emphasis upon obtaining a 
power measure of general abstract reasoning ability rather than upon per- 
formance reflecting speed of work (6, 7). The test enjoys a high degree of 
validity and reliability (8) which is indicative of the usefulness of the instru- 
ment for teachers, administrators, and counselors. 

The present report contains preliminary data obtained with a Persian 
adaptation of the Otis-Lennon Mental Ability Tests, Advanced Level. 


B. MetxHop 
1. Instrument 


A Persian translation of the Otis Lennon MAT, Advanced Level, form J 
(7) was prepared by two of the authors. Every effort was made to keep the 
translation as close to the original as possible. The resulting version was 
retranslated into English by two psychology professors and was compared 
with the original test. It was then tried on a group of 30 Pahlavi University 
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freshmen with a view of identifying and modifying questions that were vag 
and unapplicable. No major change or omission of the original text was nec 
sary, except in the case of item 43 where “Horse is to whinny as clock is t¢ 

.” was substituted for “Snake is to hiss as saw is to _____.” The 
reason for this change was that in the Persian language there is no common 
word which could easily express what is implied by “hiss” in English. 


2. Subjects 


Subjects were 1012 students (768 boys and 244 girls) taking the Pahlavi 
University entrance examination in June, 1970. They were a randomly selected 
sample of a much larger population (over 23,000 applicants). The majori 
of the subjects were between 18 and 20 years, had completed 12 years o} 
primary and secondary education, and were highly motivated to do well in 
the examination and win the “race.” The sample included subjects from all 
three fields of specialization, 7% being from the literature branch, 42% fro! 
the mathematics branch, and the remaining 51% from the natural scien 
branch? The above percentages are proportional to those of the original 
population, a point which confirms the randomness of the procedure adopted 
for selecting the sample. 


3. Procedure 


Subjects took the test within a 45-minute working period and marked their 
Tesponses on a separate answer sheet. After the completion of the test, they 
took two semistandardized English and Persian tests. Each subject also took 
an achievement battery according to his field of specialization: that is, those 
who followed Mathematics branch took a test in mathematics, a test in 
chemistry, and a test in physics and mechanics. Those who followed Natural 
Sciences branch took a test in biology, a test in chemistry, and a test im 
physics and mathematics, And finally, those who followed Literature branch 
took a test in history and geography, a test in Persian literature, and a test in 
philosophy and logic, After the completion of these batteries, all of the sub- 
jects responded to a questionnaire which was designed to obtain information 
on the socioeconomic and family background of the subjects. 


C. Resutts anp Discussion 


Table 1 presents the means and the standard deviations of the test for 


boys and girls in the three fields of specialization and for the two sexes com- 


n educational system, the reader is referred to 


ia 


2 For a brief account of the Irania 
Mehryar and Shapurian’s report (4). 
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bined. Table 2 shows the coefficients of correlation between Otis-Lennon 
MAT: Advanced and two tests of English and Persian language for boys and 
girls in Literature, Natural Sciences, and Mathematics branches, and the two 
sexes combined. It also includes the correlation coefficients between the test 
and certain batteries of achievement tests. Table 3 includes the mean and the 
standard deviations of the test according to father’s education, These data 
were collected as a further attempt to provide evidence of the validity of the 
test. 

Table 1 shows that subjects who followed the Mathematics branch scored 


TABLE 1 
MEANS AND STANDARD DEVIATIONS OF THE O7IS-LENNON MAT For 1012 Boys AND GIRLS 
IN LITERATURE, NATURAL SCIENCES, AND MATHEMATICS BRANCHES 


Branch Sex N Mean sD 
Literature Boys 44 31.47 9.98 
Girls 31 35.16 6.71 

Total 75 30.93 8.80 

Natural Sciences Boys 333 31.15 9.54 
Girls 174 32.85 10.08 

Total 507 31.76 9.74 

Mathematics Boys 391 40.24 10.32 
Girls 39 39.36 9.64 

Total 430 40.19 10.28 

Grand total 1012 35.33 10.73 


Note; The values of ¢ are as follows: Mathematics vs. Natural Sciences, ¢ = 13.17 
(p < 001); Mathematics vs. Literature, t = 8.26 (p < .001); and Natural Sciences 
vs. Literature, ¢ = .75 (NS). 


the highest, followed by Natural Sciences and Literature branches, respec- 
tively. The mean scores of the subjects in Literature and Natural Sciences 
branches were close, but presented a greater divergence from that of the 
students of the Mathematics branch. These findings are not consistent with 
Mehryar and others’ study (5) which suggests a greater similarity between 
science branches. 

From Table 1 it is also apparent that boys who followed Literature and 
Mathematics branches achieved higher means than girls. Girls, on the other 
hand, excelled boys when they followed Natural Sciences branch. The differ- 
ences did not reach the significant level although they were in line with the 
results of earlier studies (5). 

When the results of this study are compared with the American norms 
provided by Otis and Lennon (8), it would appear that Iranian subjects have 
scored strikingly low. The mean of raw scores are 47 for Americans and 35 
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TABLE 3 
MEANS AND STANDARD DEVIATIONS OF THE OTIS-LENNON MAT, Apyancep LeveL, 
For 1012 Boys AND Girts AccorpING To Years oF FATHER’S EDUCATION 


Years of Boys Girls 
father’s 
education N M sD N M SD 
0-4 228 34.62 11.66 67 31,45 8.30 
5-6 220 35,13 10,05 50 33.68 9.50 
8-9 95 36.10 12.02 38 32.63 8.94 
10-14 127 36.81 10.52 33 33,96 10.92 
15- 98 38.42 10.06 36 36.32 11.62 
768 244 


for Iranians. These figures clearly show that the performance of Iranian 
students is remarkably lower than that of their American counterparts and 
that the difference between the mean of the two groups is greater than one 
standard deviation. The inferior performance of Iranian students in com- 
parison with their Western counterparts is well in line with similar results 
reported for Persian forms of AH5 and AH#4 scales (4, 5); Otis Self-Ad- 
ministering MAT, Intermediate Form; and Otis Quick-Scoring MAT, 
Gamma Test (2, 9); Raven Progressive Matrices (10) ; and GATB scales 
(3). 

The inferior performance of Iranian students could be attributed to lack 
of test sophistication, slowness in reading, and deficiencies in schooling (4). 
Other sources that may have contributed to this inferior performance might 
be lack of cultural stimulation in the home, perceptual deprivation, child-rear- 
ing practices which often discourage rather than encourage intellectual develop- 
ment, and the style and the patterns used in Persian language both in reading 
and writing (2). 

‘The coefficients of correlation as shown in Table 3 indicate that the Persian 
form of Otis-Lennon MAT, Advanced Level enjoys some degree of validity 
and can be considered as a valid instrument to assess the general mental ability, 
or scholastic aptitude, of pupils in Iranian secondary schools. In addition, the 
systematic increase of mean intelligence scores, as shown in Table 3, may be 
considered as a mark of the construct validity of the test. Moreover, the fact 
that Mathematics students scored significantly greater (p < .001) than those 
of Natural Sciences and Literature can be taken as another evidence of the 
validity of the test. The more intelligent students of Iranian high schools 
tend to major in Mathematics rather than either in Natural Sciences or Litera- 


ture. 
In order to provide evidence on the reliability of the test, a small group of 
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subjects (N = 42) who had been admitted to the University were retes 
about seven months later under normal conditions. The test-retest reliabilii 
coefficient (r = .60) was found to be decidedly smaller than those report 
in the Test Manual (8) and those generally considered as satisfactory ( 
The lower test-retest reliability of the test may be due partly to the long ti 
interval between the two occasions and partly to the small and restrict 
range of the sample. 


D. Summary 


A Persian adaptation of the Otis-Lennon Mental Ability Tests, Advanced 
Level was administered to 1012 Iranian secondary-school graduates (768 
boys and 244 girls). The mean scores of the subjects were found to diffe 
according to the areas of specialization in high school, Iranian subjects as ¢ 
group scored strikingly lower than their American counterparts. The 
was found to correlate with batteries of achievement tests. Test-retest reliability 
coefficients over a seven-month interval were found to be relatively low. 1 
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DIFFERENTIAL EFFECTS OF STATED PREFERENCES AND 
QUESTIONNAIRE ROLE PERFORMANCE ON INTER- 
PERSONAL ATTRACTION IN THE 
DATING SITUATION* 


Purdue University 


James P, Curran 


A. IntropucTIoN 


Tharp (30) stated. that the study of similarity or complementariness of 
attributes is not a fruitful approach in marriage research. He favored a closer 
examination of the stimulus situation and the roles involyed therein, Rosow 
(24) agreed with Tharp (29, 30) in opting for a role theory approach in the 
study of interpersonal attraction. They stated that it would be more efficient 
to examine the needs, abilities, etc. that are important in fulfilling the func- 
tions of the roles in a particular social relationship than to attempt to estab- 
lish a general attraction principle across all groups and attributes studied. 
Many needs may be more readily gratified outside of the relationship that is 
under study and, hence, may be irrelevant. 

A logical but difficult way to test the effects of differential role fulfillment 
is to examine the actual behaviors of members of a dyad or group. If a func- 
tional analysis of their behavior yields consistent differential functioning and 
this is correlated with differential attraction, then perhaps role theory is a 
useful conceptualization. D 

This is not the usual procedure. Responses to a questionnaire or personality 
inventory are taken as substitutes for a functional behavioral analysis. There is, 
of course, no guarantee of a one-to-one correspondence between questionnaire 
responses and actual behavior. If one is willing to assume at least a partial 
correspondence, then the utility of role theory could be supported by data 
that demonstrated consistent differential responses to role questionnaires that 
were related to interpersonal attraction. These differential questionnaire re- 
sponses are then assumed to reflect differential role enactments. 4 

Another alternative to the assessment of differential role performance is to 
request that subjects rate or rank those attributes important to the functioning 
of certain roles (wife, roommate) and to have them describe the ideal level of 


* Received in the Editorial Office on August 30, 1972, and published immediately 
at Provincetown, Massachusetts. Copyright by The Journal Press. 
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intensity of possession of the particular attribute. For example, a subject 
might rank intelligence an important attribute in the marital dyad and might 
state that an ideal level of intelligence to possess for a harmonious relationship 
may be an JQ of 115 for a male and 100 for a female. 

Many outcomes from this approach are possible. Some of these outcomes 
are (a) lack of role differentiation—i.e., subjects may give approximately the 
same rank order of attributes and preferred intensity levels regardless of 
various roles; (b) lack of role agreement—i.e., subjects’ responses may be so 
idiosyncratic or so qualified as to the duration and nature of the relationship 
that no consensus of a role is reached; (c) subjects are able to differentiate 
various roles and there is fair consensus of important attributes and their 
desired intensities, but this is not related to interpersonal attraction in actual 
interactions. That is, there is no guarantee that if these roles are enacted in 
the stated preferred manner that they would foster harmonious interactions, 
Subjects might not know what they want. | 

A study by Rychlak (25) indicated that subjects respond with differential 
preferences (rankings, intensities) when requested to complete an inventory for 
various role types. Subjects then appear to be able to respond to role differentia- 
tions. The next question to be answered is how consistent are their responses. 
McGinnis (22) replicated an earlier study by Hill (16) concerned with 
preferences in mate selection, A good deal of consistency in rated importance 
of 18 personality traits was demonstrated. 

However, Hobart (17) contended that marriage role expectations change 
according to different courtship changes. Hewitt (15) also found different 
Patterns of traits, both those desired in partners and those believed to be 
important, when subjects have marriage partners or dating partners as refer- 
ences, Consensus on husband and wife roles increases with length of marriage 
(11). It therefore appears that there is some inconsist 
the reference group chan 
the reference group,’ 


There have been some demonstrated sex differences in role expectations in 
marriage (20). Women were more concerned with receiving affection, sym- 
pathy, and understanding, and stress achievement for the males. Males were 
more concerned with having a spouse who was neat and tidy, adjusted to a 
routine, avoided friction, was home loving, dependable, Seistnably even- 
tempered, and made a good impression, It would appear to be reasonable to 
expect sex differences in role expectations in 


0 the dating dyad. Skipper and 
Nass (27) concluded that in their sample the female’s primary motive in the 


ency on preference when 
ges and with increased interaction or experience with 
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dating situation was mate selection, but to males the primary motive was 
recreation. 

If this approach is to be useful, then the closer a person approximates the 
ideal role enactment the better should be his chances of being liked. 

Evidence that the fulfillment of role expectations may be related to inter- 
personal attraction in the dating situation was provided by Coombs (10). Sub- 
jects who participated in a computer dating study were administered a number 
of questions to which they had to state their preferences for a date. Approxi- 
mation to preferences was significantly related to interpersonal attraction. 

This present study attempted to explore the relationship between stated 
preferences and the actual attributes which are determinants in fostering inter- 
personal attraction. This logically calls for stated role differences in prefer- 
ence, moderate consensus in these differences, and a relationship between actual 
role differentiation and interpersonal attraction. 


B. Merxop 
1. Subjects and Procedures 


Ss were university undergraduate and graduate students. All Ss were 
Caucasian and citizens of the United States. The majority of Ss (females, 
84% ; males, 79%) were between the ages of 17 and 25, 

A computer dating service called Psych-A-Date was established in order 
to match Ss for a date. Recruitment of Ss was conducted by means of adver- 
tisements in the university’s daily newspaper and flyers posted in public places 
on campus and distributed to students during registration. These notices 
instructed potential Ss concerning the time and place of the testing sessions, 
which were held over a two-week period. t é , 

The testing period lasted approximately two hours. During this period 
the operational procedures of Psych-A-Date were explained to the Ss, and 
they were administered a battery of tests. 

Each S was notified of the date’s name, address, and telephone number by 
means of a postcard, approximately two weeks after the testing period. It was 
the responsibility of the dating couples to arrange the date, and there were no 
restrictions placed on the dating situation. xd 

To ensure a good return of the dependent measure Ss were initially charged 
$2.50 for participation in the study. Two dollars of the initial fee was 
refunded to the Ss when they returned the dependent measure. The remainder 
of the fee went for expenses incurred—e.g., stamps, envelopes, etc. 


316 JOURNAL OF PSYCHOLOGY 


2. Predate Measures 


The 16 Personality Factor Questionnaire—16PF (8) and the Motivation 
Analysis Test—MAT (9) were administered in the matching proced 
The raw scores from these tests were converted to sten scores. A univei 
normative sample was furnished by the publishers for this conversion. 

Ss were also requested to state their preference for an ideal date by checkin 
on a 10-point bipolar scale where they wished their date to fall on each of 
the 16PF and MAT dimensions. In addition, Ss were asked to rate each 
the 26 variables for their relevancy in the dating situation. Each variable 
rated on a five-point scale ranging from very relevant to completely irrele a 

Other information hypothesized to be pertinent to the success of the d 
was also obtained. This information is not germane to the present study 
will not be discussed. 


3. Postdate Measure 


The dependent measure consisted of a series of 30 questions dealing wit 
such items as enjoyment received from the date, amount of money spent on th 
date, would they date the Person again, amount (relative) of physical 
tact, etc. Ss responded to multiple choice options to these questions. It was 
that some of these questions were good interpersonal attraction items and 


other items would be unrelated to interpersonal attraction and would instead: 
measure other dating factors. 


4. Matching 

All Ss were matched in 
the male was as old as or older than the female, 
or taller than the female, 
in physical attractiveness, 


t puter dating and never went out on 
date. In some cases the couples did go on the date, but only one partner 


turned the dependent measure, One Possible reason for this type of failure 


complete the study may be that some Ss lost their dependent measure and 
failed to remember the Procedure to obtain another, 


1 A copy of this scale may be obtained from the author upon request. 
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C. ReEsuts 
1. Factorization of the Dependent Measure 


The items on the dependent measure were intercorrelated and placed in a 
principal axis factor program with unities on the diagonal. The eigenvalues 
were plotted, and Cattell’s Scree Test (7) was then used to determine the 
number of factors. Commonalities were then estimated by use of an iterative 
principal axis factor analysis. A Maxplane program (13) was then utilized 
in order to allow the factors to be rotated obliquely. This was followed by a 
Rotoplot program in order to approximate better Thurstone’s (31) simple 
structure criterion. 

Although eight factors were removed and rotated, only the attraction 
factor will be discussed here.? Items with high correlations with this factor 
were as follows: enjoyability of the date (.39), liking the date partner (.46), 
how much one would like to date the partner again (.55), desirability of 
the date as a marriage partner (.58), desirability of the date as a friend 
(.43), the date favorably exceeding expectations (.59), and the date’s per- 
ceived (by the S) physical attractiveness (.54). 

It was felt that these items (with the exception of physical attractiveness ) 
represented a pool of items that were good indications of the interpersonal 
attraction construct. Many of these same items were intercorrelated in 
Coombs’ (10) study. Essentially the same factor structure was achieved 
in a pilot study. 

Ss’ scores on the attraction factor were computed by adding their raw 
scores on the above-mentioned items (with the exception of physical attrac- 
tiveness). Combined couples’ scores could have ranged from 8 to 48 on the 
attraction factor. The actual range was from 9 to 42. ‘The median score was 
26, the mean was 26.72 with a standard deviation of 7.55. 

There was some consensus (r = .28) between partners in rating the 
satisfaction or lack of satisfaction with the date. That is, if one rates the 
date as unsatisfactory, then the partner also tends to feel it was unsatisfactory. 
The mean rating for the female raters was 13.24 with a standard deviation 
of 4.57. For males, the mean was 13.48 with a standard deviation of 4.85. 

For a later analysis couples were divided at the median of their summed 
attraction score into two groups: those that were highly attracted had a 
mean of 32.84 and a standard deviation of 4.38; those that were low in at- 


2 For an analysis of‘ these other factors see Curran (12). 
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traction had a mean of 20.43 with a standard deviation of 4.16. These groups: 
were significantly different at the .01 level. 


2. Consistency in Preferences Between High \ 
and Low Attracted Groups ; 


As mentioned previously, couples were divided at the median of their at- 
traction scores into high and low attracted groups. It is possible that members 
of these two groups had differential preferences. Table 1 contains the means, 
standard deviations, and the results of ¢ tests between the two groups. The 
only significant difference was that those in the low-satisfied group requested 
a date higher in ego strength (emotionally stable, mature, unruffled, etc.) 
than those in the high-satisfied group. Both groups preferred dates who were _ 
low on pugnacity (destructive, hostile), protension (self-opinionated, jeal- 
ous), guilt proneness ( apprehensive, worrying). Ss who scored high rankings 
on the mating erg (heterosexuality), assertiveness (self-assertion, achieve- 
ment), sweetheart sentiment (sentimental, romantic), affectothymia (out- } 
going, warm-hearted), intelligence (bright), ego strength (mature), and 
self-sufficiency ( resourceful, self-sufficient) were preferred by both groups. 


3. Sex Differences in Preference 


There were many significant differences between female and male pref- 
erences (Table 1). Female Preferences for an ideal male were higher than 
male preferences for an ideal female for the following factors: assertiveness, 
superego, career, parental, ego strength, dominance, and superego strength, 
Male preferences were higher than females on the mating erg, narcism, 


premsia, guilt proneness, and tadicalism, These differences appear to reflect 
Skipper and Nass’ conclusion (27) that females’ prima: 


T, parent) but the males’ primary motive 
narcism). These Preferences also reflect sex 
s prefer males who are assertive and dominant, 
are dependent, P 

i 
4. Choice Preference (Similar or Dissimilar to Self) 


When Ss are asked to state their preferences for an 
choose someone who is like t 


hemselyes or different? 

Research has indicated that friends Perceive themsel 
(1, 19, 21) even though this may be the result of judgmental error (23). 
Ss also are more attracted to bogus strangers who are described as similar 
to themselves (3, 4, 6, 14). However, Stuart’s Ss (28), when asked to state 


ideal date, will they 


ves as being similar 
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their preference for mates, requested mates whose personality qualities com- 
plemented their own. 

Similarity of personality attributes was measured (18) by means of the 
pattern similarity index (Rp). This coefficient expresses the degree of sim- 
ilarity from +1.00 to —1.00 (somewhat analogous to the correlation coeffi- 
cient) that exists between two different profiles of test scores. The pattern 
similarity coefficient distribution is known, and significance levels are estab- 
lished. An Rp with 25 degrees of freedom is significantly different from zero 
at the .05 level if it exceeds +.25 or —.21. The majority of the Rps were 
negative, only 18 positive ones (one significant) for the female and 26 (three 
significant) for the males. Twenty-two of the Rps for females and 23 for 
the males were significant in the negative direction. Hence, it does not appear 
that the majority of the Ss desired a date that was similar to them on these 
personalogical and motivational factors. It is felt that Ss were not so much 
choosing a date that was dissimilar or complementary to themselves, but, 
rather, choosing a close to perfect stereotype which in most cases proved to be 
dissimilar to them. For example, most Ss requested a highly intelligent date, 
and in most cases this was discrepant with their own sten scores. 


5. Relationship Between Ideal Date and Attraction 


There was good consensus in what constituted an ideal date. The standard 
deviations for each preference were quite small (Table 1). Since there is fair 
consensus on a stereotypic ideal date, it would appear to be reasonable to 
expect that the closer the prospective date approximates this average ideal, 
the more likely the partner will be attracted to him. Rps were computed be- 
tween the male and female average preferences (Table 1) and their actual 
date’s sten scores. A correlation was then computed between this coefficient 
and the partner’s attraction toward them (Table 2). 

TABLE 2 


INTERCORRELATIONS AMONG SATISFACTION ScoRES AND INDICES OF SIMILARITY 
BETWEEN AcTUAL DATE AND VARIOUS IbgAL Dates 


Variables 1 2 3 4 5 6 
1 Male Satisfaction 1,00 
2 Female Satisfaction Score 28 1.00 
3 ER betwee female's profile : 
and male average ideal date 03 
4 DE between female’s profile ny se 
and partner’s ideal date 7 
5 Ep beomecn alee Bronte and ot = i 
emale average ideal date 04 i 
6 Rp between male’s profile and % a FA ad 
partner’s ideal date 04 04 28 36 70 1.00 
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A similar analysis was cenducted between the attraction scores and the Rp 
coefficients which reflected the date’s approximation to his partner’s individual 
preferences for an ideal date (Table 2). 

‘Table 2 contains a correlation matrix which indicates relationships among 
satisfaction scores, the degree to which one’s date approximates (Rp) one’s 
idea of an ideal date, and the degree to which one approximates (Rp) the 
average ideal date. 

As noted previously, male and female satisfaction was correlated signifi- 
cantly (.05 level, two tail). Male satisfaction was related to his date ap- 
proximating his idea of an ideal date (.17 significant at .10 level, two tail). 
No significant correlations existed between male satisfaction and approxima- 
tion to average ideal date. Female satisfaction was not related to male ap- 
proximation to her ideal nor to the average female ideal date. 


6. Examination of Relevance Data 


Ss were asked to rate the relevance of the 16PF and MAT factors for the 
dating situation on ’ five-point scale. An (a) response was scored as a four 
with an (e) response representing zero. Table 3 contains the mean relevance 
scores of these ratings for males and females separately. ¢ tests were run be- 
tween the two groups, and there were 10 significant differences. In each and 
every case the females gave the trait a higher relevancy rating than the 
males. It may be that females at least verbally express more interest in the 
personality of their dates than do males. It appears that most of these traits 
fall into two classifications: (a) those conforming to the male sex role in 
general, such as assertiveness, self-sufficiency, etc. and (b) those instrumental 
to the dating situation, such as sweetheart sentiment, affectothymia, etc. 

Kendall’s Tau coefficient was used to examine the rank order of these 
traits for both male and female relevance responses. There was fair agreement 
(Kendall’s Tau = .55) between the sexes with major discrepancies occur- 
ring with self-sufficiency (female higher), self-concept (female higher), 
radicalism (female higher), narcism (male higher), and superego (male 
higher). 

There were no significant differences be : 
high-satisfied couples on their relevance responses. There was fair agreement 
in their rank ordering (Kendall’s Tau = 66). The low-satisfied group 
ranked shrewdness and narcism quite a bit higher, and self-concept, sentiment, 
and radicalism quite a bit lower than did the high-satisfied couples. 


tween low-satisfied couples and 
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TABLE 3 
Mean DIFFERENCES BETWEEN MALES AND FEMALES ON THEIR 
16PF anD MAT ReLevancy RESPONSES 


Females Males 
Factors Mean SD Mean SD t 
1 Mating 3.21 96 2.96 93 1.63 
2 Assertiveness 3.37 76 3.09 +93 2.01** 
3 Fear 3.05 98 2.73 -88 2.10** 
4 Narcism 2.36 95 2.41 1,05 32 
5 Pugnacity 2.58 99 2.42 1.05 598 
6 Self-concept 2.66 1.04 2.02 1.20 3.49%e 
7 Superego 2.21 1.05 2.37 95 97 
8 Career 2.42 98 2.24 1.16 1.06 
9 Sweetheart 3.08 1,07 2.65 1.14 2i35%* 
10 Parental 2.49 99 2.33 1,08 94 
A Affectothymia 2.52 1.10 2.10 1.25 2.1394 
B Intelligence 2.69 1.00 2.36 1.12 1.92 
C Ego Strength 2.53 1.16 2.65 1.06 65 
E Dominance 2.45 neat 2.21 1.11 1.29 
F  Surgency 2.32 98 2.09 1.06 1.34 
G Superego Strength 3.10 95 2.82 94 1.81 
H Parmia 2.77 1.08 2.93 1.01 93 
I Premsia 2.62 1,04 2.32 1.10 1.74# 
L Protension 147 114 1.27 1.09 1.09 
M Autia 2.11 1.06 1.88 1.01 1,34 
N_ Shrewdness 2.53 1,15 2.33 1.11 1.08 
O Guilt Proneness 2.50 192 2.05 1.03 2.g5eee 
Qi Radicalism 2.59 tai 1.97 1,38 2.90%## 
Q2 self-sufficiency 2.70 1.08 2.04 1.23 3.5200" 
Q3 Self-concept control 2.73 95 2.48 141 1.50 
Qt Ergic Tension 2.02 1.09 1.72 1.29 1.57 


* p< .05, one tail. 
“* » < .05, two tail. 
“ee < .01, one tail. 


7. Examination of “Actual” Role Differences 
and Attraction Scores 


An examination of the correlation between a partner’s satisfaction with 
his date and his date’s sten levels on the 16PF and MAT factors yields some 
interesting role relationships (Table 4). On E, the dominance factor, there 
is a positive correlation between male dominance and female Bestattion and 
a negative correlation between female dominance and male satisfaction 
It appears that females preferred males who were assertive Piesendént 
dominant, socially bold, uninhibited, and spontaneous, while isles eared 
females to be conforming, accommodating, submissive, restrained diffident, 
and timid. Similar relationships hold for Patmia (shy versus PF eieicccome) 
and autia (practical versus imaginative). That is, females preferred males 
to be adventuresome and imaginative, while males preferred females who 


SE eee 


bg eal 
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TABLE 4 
CoRRELATIONS BETWEEN PARTNER’S SATISFACTION WITH His DATE AND 
His Date’s Sren LeveL on 16PF anp MAT Factors 


Correlations Correlations 
between male between female 
satisfaction and satisfaction and Z (of 
Factors female sten level male sten level difference) 
1 Mating 01 —.03 24 
2 Assertiveness 12 03 55 
3 Fear 00 04 24 
4 Narcism -03 10 A3 
5 Pugnacity —12 —.07 31 
6 Self-concept 04 —.04 48 
7 Superego —.04 .03 43 
8 Career 16 .03 54 
9 Sweetheart 08 3 31 
10 Parental 05 03 12 
A Affectothymia .03 04 06 
B Intelligence —.02 +10 2B 
C Ego Strength —.04 ' 
E Dominance —.28%* 276 3.4200 
& Surgency —.05 a ae 
> Superego Strength 07 ‘ 
H Pavia es 2408 age" 3.2900" 
I Premsia —.14 01 A 
L_ Protension —.06 —10 rad 
M Autia —03 270" 1,868 
N_ Shrewdness lt — 230" 1.96% 
O Guilt Proneness —07 — 228 Pd 
Q1 Radicalism —09 07 Ag 
Q2_ Self-sufficiency —AL —03 4 
Q3  Self-concept control 203 —03 +4 
Q4 Ergic Tension 01 03 x 


* p < .05, one tail. 
** b < .05, two tail. 
##* 5 < 01, one tail. 


were shy and practical. For the shrewdness factor male satisfaction was posi- 
ale, while female satisfaction 


tively correlated with shrewdness in the fem isfactis 
was negatively correlated with shrewdness in the male. Female cel 
also appeared to be related to high M and low N and O in a male. Hence, 
as characterized as imaginative rather than 


a female preferred a male who wi , Dea and 
practical, sentimental and natural rather than calculating, and polished an: 
depressive. 


self-assured and confident rather than worrying, 
There was not much correspondence between what males and females 
actually found attractive (Table 4) and what they stated as their preferences 


(Table 1). For example, females in their preferences (Table 1) indicated 


they wished their ideal male to be below the 5 sten on the autia factor. In 


the actual dating situation, there was, however, a positive correlation be- 
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tween attraction and high scores on autia (Table 4). Those traits on which 
sex differences in sten levels were found also did not receive high relevancy 
rankings (Table 3). 

Another examination of the different role relationships in the two groups, 
low-satisfied and high-satisfied, may be found in Table 5. A discriminant 
function analysis was conducted between the sign differences of female and 
male scores on the personalogical and motivational factors. Those with high 
discriminant weights are important factors in discriminating the different role 
relationships between the two groups. Of course, as in any discriminant func- 
tion, this is the best fit linear regression line which will maximumly dis- 
criminate between the two groups. High negative weights mean that the dif- 
ference between males and females in the high-satisfied group is greater than 
the difference between males and females in the low-satisfed group. 

Males in the high-satisfied group have larger differences with their dating 


TABLE 5 
RESULTS OF DiscRIMINANT ANALYSIS BETWEEN Low-SarisFigD AND HicH-SATIsFieD CoupLEs 
ON THE SIGN DIFFERENCE oF FEMALES’ AND Mates’ Scores 


Low-satisfied High-satisfied 
couples’ means couples’ means Discriminant 

Factors Male Female Male Female weights 

1 Mating 4.0 3.1 : — 
2 Assertiveness 3.1 3.2 37 33 403 
3. Fear 51 3.9 51 33 6.91 
4 Narcism 74 6.3 7.6 6.8 2.65 
5 Pugnacity 3.5 3.8 34 3.9 2.28 
6 Self-concept 5.8 5.4 52 5.1 10.03 
7 Superego 7.6 7.0 77 63 185 
8 Career 52 47 5.5 38 — 16 
9 Sweetheart 43 44 5.4 5.6 7.47 
10 Parental 6.8 6.7 64 6.9 1.79 
A. Affectothymia 5.0 58 5 52 ‘07 
B_ Intelligence 7.0 6.1 69 63 iF 
C Ego Strength 5.4 5.5 61 54 204 
E Dominance 5.5 61 64 54 —9.04 
F Surgency 5.0 5.7 5.5 5.4 —7.00 
GS Superego strength 51 43 46 48 =2'19 

armia 5A oe i i “Ty 
I Premsia 5.2 sy Se Be el 
L_ Protension 3.4 5.6 5.1 5.0 ase 
M Autia 63 6.1 68 3 70 
N_ Shrewdness 5.6 5.6 46 H ks 
O Guilt Proneness 5.5 5.2 43 a ae 
Q1 Radicalism 64 5.9 68 = Pd 
Q2  Self-sufficiency 6.2 6.1 65 ag nee 
Q3 Self-concept control 5.6 5.2 5.1 42 met 
Q+ Ergic Tension 54 49 5.5 ae are oF 


Note: F ratio = 1,61, 
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partner than do males in the low-satisfied group on the fear erg, dominance, 
surgency, and premsia. In the case of surgency, dominance, and premsia the 
male in the low-satisfied group was actually lower than his female partner. 

Males in the low-satisfied group have larger positive differences with their 
dating partner than do males in the high-satisfied group on the self-concept 
sentiment, sweetheart sentiment, and parmia. 

Although it must be remembered that the male and female sten scores are 
assigned by the same sex group norms, the fact that females in the low- 
satisfied group actually scored higher than the males in the low-attracted 
group indicates that a reversal in traditional role relationships (Table 5) may 
lead to an unsatisfactory date. 


D. Discussion 


No conclusions can be drawn concerning the effectiveness of a role theory 
approach to the study of interpersonal attraction in the dating situation. The 
actual role differences that were related to attraction in this study need to 
be replicated. 

This study has questioned, however, the method of requesting Ss to state 
their ideas on the relevancy of various factors and their preferences as to 
optimal levels. Approximation to these ideals was not related to attraction. 
The actual preferences (relationships between actual test scores and attrac- 
tion) were not related in any systematic way to stated preferences. 

There were sex differences in preference for an ideal date which ap- 
proached a fair degree of consensus. There were also significant relationships 
between differential role enactment (as measured by responses to a question- 
naire) and interpersonal attraction. However, there was little relationship 


between stated preference and interpersonal attraction, 


E. SuMMARY 
onships in a computer dating study was 


examined. Ss were requested to state their; ideal date pre en ws Basics 
sonalogical and motivational characteristics. In addition, t a . ie. 
were measured via questionnaires on these same BES Babee eiaency 
to state their preferences for an ideal date with ve a nes ‘deal date was 
within sex. However, a dating partner's approximation to ae ideal ee 
not related to the success of the date nor in Sea #3 fd ss Bhat 
were correlationally related to date attraction. Date satis action ei —. 4 
however, to role differentiation as measured by questionnaire response. 


The importance of role relati 
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number of these fell along traditional conceptualizations of the male-female 


dating role. 
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CONSERVATION OF VOLUME: TESTING AS 
A MEANS OF INSTRUCTION* 


Department of Mathematics, Northern Illinois University 


Grorce W. BricHT 


A. IntTRopUCTION 


Harrison (7) indicated in his review of Piagetain research that much at- 
tention has been paid to the determination both of the mean age-levels at 
which various conservation skills are attained and of appropriate ways to 
lower these age-levels through instruction. Recent work seems to emphasize 
the latter area (e.g., 1, 8, 9, 10, 11, 12). Virtually all of these studies are 
concerned with developing instructional strategies for public school children 
[McKinnon (8) is the sole exception]; no one seems to have addressed the 
problem of identifying and correcting deficiencies in the conservation skills of 
elementary teachers. 

Elkind (3, 4) analyzed the conservation of volume skills of high school 
and college students. He reported that 42% of psychology students could 
not conserve volume. Hall and Kingsley (6) reported that 29% of psychology 
graduate students and 26% of psychology upperclassmen failed a conservation 
of volume question similar to those of Elkind (3). 

Elkind (5) in a reanalysis of conservation of volume tasks identified two 
kinds of tasks: conservation of identity and conservation of equivalence. 


The judgment could be viewed as dealing with the conservation of a 
given weight, length, number, etc. across a reversible transformation 
and with respect to itself alone. . . . Suppose that in the weight con- 
servation problem . . . one employed only a single ball of clay which 
was then rolled into a sausage, and the child was asked whether the 
clay was now the same weight as before. . . . This would be a direct 
assessment of what will hereafter be called the “conservation of ident- 
ity.” . . . Conservation judgments . . . can also . . . [assess] the in- 
variance of a quantitative relation . . . across a transformation of one 
of the elements of that relation. . .. The standard conservation prob- 
lem . . . provides a direct test of the form of conservation which will 
hereafter be called the “conservation of equivalence” (5, pp. 16-17). 


_Eikind further stated that “conservation of identity would seem to be a 


* ‘ i 
at Becceived in the Editorial Office on August 31, 1972, and published immediately 
incetown, Massachusetts. Copyright by The Journal Press. 
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necessary but not a sufficient condition for the attainment of equivalence 
conservation” (5, p. 17). This suggestion of hierarchical ordering can be 
rephrased in the terminology of Eisenberg and Walbesser (2) by stating that 
the ordering should be consistent and complete but perhaps not adequate, 

The questions used by Elkind (3, 4) to assess conservation of volume 
measured conservation of equivalence. No measure was taken of conservation 
of identity. Further, no measure was made of the effect of the testing pro- 
cedures on the subjects’ abilities to conserve volume. Presumably, very little 
instruction would be required to teach college students appropriate volume 
conservation skills. Are the testing procedures sufficient instruction to this 
end? 

‘Two studies were conducted to test the following hypotheses: 

1, Among prospective elementary school teachers, the rate of attainment 
of conservation of identity will be higher than the rate of attainment of 
conservation of equivalence, 

2. Among prospective elementary school teachers, a second administra- 
tion of conservation tests will not result in higher rates of attainment of 
either identity or equivalence conservation. 

3. Among prospective elementary school teachers, Elkind’s hierarchical 
ordering of identity and equivalence conservation is a consistent and complete 
but not adequate ordering. 

The design of the experiment also permitted an analysis of whether testing 
for identity conservation affected the results of a retest for equivalence con- 


servation, and conversely, No directional hypotheses were made for these 
analyses, 


B. Meruop 


1. Questions 

The questions used to 
Elkind (3), and the ques 
constructed by the experimenter, 


2. Procedures 
a. Study I. The subjects in the first study (Study I) were the students 


( N = 120) in one section of the mathematics methods course for prospec- 
tive elementary school teachers, During the last 20 minutes of a Monday 


ee ee ri‘ ee 
————— 
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class period, the subjects were randomly divided into two groups. One group 
was asked the identity conservation questions by a graduate student, and the 
other group was asked the equivalence conservation questions by the ex- 
perimenter. In each case, the questions dealing with amount were asked, 
the clay models were reformed into balls while the subjects watched, and 
the questions were repeated, with “volume” replacing “amount.” Experi- 
menter-prepared scripts were followed in each case, with the students re- 
sponding on prepared answer sheets. 

During the first 20 minutes of the following Wednesday class period, 
the subjects were again randomly divided into two groups. ‘The same pro- 
cedures were followed with one exception, After all other questions had 
been asked, the subjects were asked to respond to the following questions: 
Were you in class Monday? If so, did you see the same demonstration last 
Monday that you saw today? 

The subjects were classified as conservers of amount (or volume) if and 
only if their responses to all four relevant questions were consistent and 
correct, Because of uninterpretable responses, 13 of the subjects in the first 
administration of the questions were discarded. Data of subjects who par- 
ticipated in the second administration of the questions but who indicated they 
did not participate in the first were discarded. 

b. Study II. The subjects in the second study (Study II) were students 
(N = 130) in another section of the same course. The same procedures 
were followed with two exceptions. First, the experimenter was replaced 
by the course instructor as an administrator. Second, the subjects were asked 
to put their names on the answer sheets. Only those subjects who participated 
in both questioning sessions were included in the analysis. Because of missing 
data and uninterpretable responses, six subjects were discarded. 


C. ReEsuLts 


All subjects were conservers of amount, independent of the kind of con- 
servation measured. Hence, all of the following results will deal exclusively 
with conservation of volume. The numbers of conservers and nonconservers 
observed in the first and second questioning sessions are presented in Table 1. 
These totals are broken down by treatment groups in Table 2. ¢ 

To test Hypothesis 1, two x* values were computed. In Study I the dis- 
tribution of conservers and nonconservers for the identity questions was com- 
pared to the distribution for the equivalence questions. The computed value 
was 13.7 (p < 01). The analogous x* value for Study II was .21 (non- 


significant). 
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TABLE 1 
Numpers (PERCENTAGE) OF CONSERVERS AND NONCONSERVERS OF VoLuME 
Study I Study II 
iT f Noncon- Noncon-— 
cone renin Conservers servers Conservers servers ‘ 
First testing 
Identity 37(80)a 9(20) 41(82) 9(18) 
Equivalence 30(49) 31(51) 63(85) 11(15) 
Second testing 
Identity 36(73) 13(27) 52(87) 8(13) 
Equivalence 23(55) 19(45) 57(89) 7(11) 


® All numbers within parentheses are percentages, 


To test Hypothesis 2 and to analyze the instructional effects of testing, 
numerous x? values were computed. These are displayed in Table 3. None of 
the computed values was significant at the .05 level, 

To test Hypothesis 3, the formulae of Eisenberg and Walbesser (2) were 
used. The subjects in Study II were classified with respect to two variables: 
(a) kind of conservation question asked in first session versus kind of ques- 
tion asked in second session, and (4) conserver (C) or nonconserver (N) 
in first session versus C or N in second session. Each subject was then cate- 
gorized in one of the 4 X 4 possible combinations. The results are presented _ 
in Table 4, ' 

Consistency, completeness, and adequacy ratios were computed for all four 
sequences, These results are Presented in Table 5. On the basis of 85. as 


acceptable (2, p. 249), all the ratios were acceptable except the adequacy } 
ratio for the identity-equivalence sequence, 


. 


D. Discussion anp Conc.usions 


Although the subjects within each study were randomly divided, the 
samples across the studies are nonequivalent. Prospective elementary school 
BLE 2 


A: 
Numpers (PERCENTAGE) OF CONSERVERS AND NONCONSERVERS AT THE SECOND 
TEstinc, SEPARATED By ‘TREATMENT Groups 


Type of 
conservation measured Study I Study II 

First Second Noncon- Noncon- 

testing testing Conservers servers Conservers servers 
Identity Identity 17(77)a 5 (23 27(9 

. 5 10, 10 
Equivalence Identity 19(70) aon anaes tag 
Identity Equivalence 10(59) 7(41) 17(85) 3(15 
Equivalence Equivalence 13 (52) 12(48) 40(91) 4( 3 


4 All numbers within parentheses are Percentages, 
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TABLE 4 
‘TABULATION oF STupy II SuBJEcTS IN THE 16 PossiBLE CATEGORIES 


Conservation ability (first testing, second testing) 


Question sequence (C, C) (C, N) (N, C) (N, N) 
Identity-Identity 25 1 2 2 
Equivalence-Identity 2s 2 0 3 
Identity-Equivalence 14 b 3 2 
Equivalence-Equivalence 35 1 5 3 


Note: C = conserver; N = nonconserver. 


teachers at Northern Illinois University are scheduled in their classes in 
blocks of 35-40 people, Normally, four blocks are scheduled to each section 
of methods of teaching mathematics. The assigning of students to blocks is 
not random, though the exact criteria used are not known. Hence, no analysis 
was made across studies. 

The most striking difference between the results of the two studies is 
that a larger percentage of subjects were equivalence conservers of volume in 
Study II than in Study I. Although two different People administered the 
questions in these studies, the script was written out ahead of time and was 
followed closely, The experimenter believes that the differences in admin- 
istrators was not a critical factor. It seems more reasonable to explain the 
different results in terms of possible sample differences. 

The differences in distributions of identity and equivalence conservers and 
nonconservers in Study I support Hypothesis 1 (p < 01), while Study II 
lends no support to the hypothesis. This is explainable in terms of the large 
Percentage of conservers in Study II as measured by both kinds of conserva- 
tion questions, 

The x? values displayed in Table 3 support the hypothesis that Piagetian 


testing does not induce any instructional effects among prospective elemen- 


TABLE 5 
Consistency, ComPLerensss, AND Avgquacy Ratios: Srupy II 
Consistency Completeness Adequacy 
#(C,C) #(C,C) #(C,C) 
e FY; 7 a ee a 
Question sequence #(C,C) + #(C,N) #(C,C) + #(N,N) #(C,C) + #(N,C) 
Identity-Identity +962 
Equivalence-Identity 926 “393 iH 
Identity-Equivalence® 1933 ‘875 Bees 
Equivalence-Equivalence +982 : pees 


921 875 


Note: C = conserver; N = nonconseryer, Ratios a: ji 
ju egtle Sateaory ay opr Te 8 are based on number of subjects 


® The entries in this row are appropriate statistics to test Hypothesis 3. 
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tary school teachers. This result suggests that test-treatment-retest procedures 
to determine treatment effects on conservation skills are appropriate for 
college-age subjects. Additive effects of the testing should not bea problem. 

The results displayed in Table 5 support Hypothesis 3. The identity- 
equivalence sequence is a consistent, complete, but not adequate hierarchy. 
Further, each of the other sequences is consistent, complete, and adequate. 
This is meaningful evidence supporting the claim of Elkind (5) that identity 
precedes equivalence conservation. 

The classification of subjects in Study I with respect to the four sequences 
of questions was made on the basis of the subjects’ reports of which demon- 
stration they experienced first. In Study II a check of these self-reports was 
possible because the subjects signed their answer sheets. Of 124 subjects 
in Study II, only two reported their initial experience incorrectly. 

In conclusion, the data support the contention that Piaget-type questions 
do not in themselves enhance the conservation skills of prospective elementary 
school teachers. Further, the support of Hypothesis 1 in Study I and the 
support of Hypothesis 3 in Study II indicate that Elkind’s analysis of identity 
and equivalence conservation tasks is basically correct. Future studies should 
investigate these findings across age-levels. 


E. SuMMARY 


Two studies were conducted to test Elkind’s classification of conservation 
skills and to determine the effect of repeated measurement of Piagetian con- 
servation skills. The procedures of Elkind (3) were modified, with additional 
questions constructed for the specific setting of these studies. Subjects were 
prospective elementary school teachers at Northern Illinois University. Ques- 
tions were administered on consecutive class days in mathematics methods 
classes, The data supported the following conclusions. Conservation of 
identity was attained by more subjects than conservation of equivalence. The 
procedures used to measure attainment of conservation skills did not affect 
subjects’ responses in a retest with identical questions. Attainment of 
conservation of identity seemed to precede attainment of conservation of 


equivalence. 
Appenpix A 
Questions To AssEss ConsERVATION oF IDENTITY 


I have a ball of clay. There are no tricks intended in what I shall do; 
I am simply repeating an experiment which was originally performed with 
elementary school children. Would anyone like to examine the clay? [PAUSE] 
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Question 1. Does the amount of clay in the ball change if I pick it up 
and move it? 

Question 2. Suppose I make the ball into a sausage shape. Would it then 
contain the same amount of clay? [pause] [po so] 

Question 3. Does the piece of clay contain the same amount of clay as 
before? 

Question 4. Explain your answers. 


APPENDIX B 
Quzstions To Asszss Conservation or EQUIVALENCE 


I have two balls of clay that are identical in every respect except their 
colors. There are no tricks intended ; I am simply repeating an experiment 
which was originally performed with elementary school children. Anyone 
who doubts that the balls are the same should examine them himself. Would 
anyone like to do this? [pausr] 

Question 1. Do the balls contain the same amount of clay? 

Question 2. Suppose I make one of the balls into a sausage shape. Would 
the two pieces of clay still contain the same amount of clay? [pause] [po so] 

Question 3. Do they both contain the same amount of clay now? 

Question 4. Explain your answers, 
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VIGILANCE AND AROUSAL: EFFECTS OF DIFFERENT 
TYPES OF BACKGROUND STIMULATION* * 


The University of British Columbia, Canada 


W. G. Davenport 


A. InTRopucTION 


A number of theoretical models of vigilance behavior have evolved, amongst 
these being the arousal hypothesis, This states, basically, that sensory stimula- 
tion is critical to the maintenance of observer alertness, and several different 
theoretical positions have been established. McGrath (9) outlined the difficulty 
arousal hypothesis of vigilance such that there existed the traditional V-shaped 
relationship between arousal and vigilance performance. Arousal was con- 
sidered to increase monotonically both with sensory variety and: task difficulty. 
Buckner (4) emphasized the effect of signal detectability and its influence on 
arousal in the early stages of the watchkeeping task; signal detectability was 
considered a product of two factors, the physical properties of the detection 
stimulus and the sensory acuity of the observer. In turn it governed two 
things, the initial arousal level of the observer and the baseline performance 
level during the task. Bergum (1) presented a framework of an expanded 
activation level concept is which a taxonomy of task characteristics was de- 
scribed with which the contribution. of task parameters of CNS stimulation 
could be assessed. 

One basic fault of arousal theory is that the stimulus conditions controlling 
arousal in the vigilance situation have not been adequately defined. Irrelevant 
stimulation, such as music, speech, or miscellaneous auditory stimuli, have 
been used in vigilance studies to augment the level of sensory variety in the 
task, and to the extent that these procedures effectively introduced variations 
into the sensory environment of the vigilance observer during the watch, the 
arousal hypothesis would predict an increase in the efficiency of task perfor- 
mance. Evidence supporting this hypothesis has come from several sources. 
McGrath (9) found performance to be better on a one-hour task when a 
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variety audio tape (music, speech, machine noise, and other sounds) was 
played than when constant intensity broadband noise was presented. Addi- 
tional evidence is available indicating the effectiveness of varied auditory 
stimulation in enhancing vigilance performance (7, 8, 10, 15, 16). In all 
cases performance was improved by the addition of the background stimulus, 
In contrast to these results with variable background noise, constant intensity, 
continuous broadband noise has been found to be of either little benefit or 
occasionally detrimental to performance in vigilance tasks, particularly during 
the latter stages of the watch (2, 3, 5, 6, 12, 13). 

Studies employing background music generally have given results in the 
same direction as those with varied background noise. Tarriere and Wisner 
(13) found that visual vigilance performance was significantly better with 
continuous music as a background stimulus than with motor noise or silence, 
Ware, Kowal, and Baker (14) found visual vigilance performance signifi- 
cantly improved when local radio broadcasts were employed as a background 
than when subjects monitored in silence. Wokoun (18) found a large differ- 
ence in visual vigilance performance favoring background music over noise 
on the first of two monitoring sessions, the result being indeterminable in 
the second session. In contrast, Poock and Wiener (11) found two music 
conditions to be of no benefit on a visual task as compared with a white noise 
background. No completely satisfactory explanation of this discrepancy in 
results has been established. 


the literature, by directly comparing a particular ob: 


visual vigilance task while employing a range of background stimulation 
within -the-same-experiment. It was hypothesized on the basis of arousal theory 


= 
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and results of past studies that a subject’s correct detection performance and 
response latency would improve significantly in the sequence from silence, 
broadband noise, continuous music, to random music. 


B. Merxop 
1. Subjects 


Twenty-four male and 24 female subjects were selected from introductory 
psychology class lists at The University of British Columbia, Vancouver, 


2. Procedure 


Subjects responded to a signal which was a .50 second increase in amplitude 
(.05 volts added to baseline of .20 volts/em; 5 cm basic amplitude) of a 
sine-wave pattern presented on an oscilloscope, by pressing a response key as 
quickly as possible whenever they thought they detected a signal. The total 
watch-period was four hours, and during this time each 60 minute session was 
accompanied by one of the four irrelevant background conditions. Additionally 
each 60 minute session was divided, without Ss’ knowledge, into two identical 
30 minute subperiods in each of which 25 signals were presented. Intersignal 
intervals were 62, 40, 51, 142, 45, 24, 71, 56, 99, 35, 111, 19, 72, 118, 27, 
76, 63, 139, 93, 100, 50, 47, 129, 90, 41 seconds, the order being randomized 
for each subject in each 30 minute session, Subjects were randomly assigned to 
the vigilance task in one of the 24 possible counterbalanced orders of the four 
background conditions. 

Subjects were told they would receive a variety of stimuli over stereo head- 
phones (Sharpe HA-660 PRO) from a Ferrograph Tape Recorder (Series 
7, Model 724-A). All Ss’ responses were recorded automatically on an 
electronic counter/timer (Monsanto Model 101-B) and printed on a digital 
printer (Monsanto Model: 510-A). Each of the background conditions was 
presented on a prerecorded tape, the music being stereo instrumental music, 
recorded from background music tapes supplied by an FM-stereo station. 
The sound intensity at the headphones for the broadband noise, continuous 
music, and random music conditions was measured by means of a Bruel and 
Kjaer Sound Level Meter (Model 2203) fitted with an Artificial Ear (Type 
4152) and was set at 60 db SPL. This was judged a “comfortable” level by all 
Ss. Each 30 minute subperiod in the random music condition consisted of thirty 
30 second music intervals and thirty 30 second intervals of silence arranged 
in random order. All Ss were given a 15 minute practice session in which 12 
signals were presented with no background stimulation. Ss performed the 
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experiment in a quiet, well-lit laboratory and were visible to the experimenter 
through a one-way screen, 


C. Resuts 


A 4X 2 X 2 repeated measures analysis of variance (AOV) was com- 
puted (17) for each of the two dependent variables: (a) number of correct 
detections and (4) response latency, Results for each of the dependent vari- 
ables are discussed separately below. False detections were scored in the study, 
but were of such low frequency over all conditions that it was considered that 
if used as a dependent variable, results would be of doubtful validity. 


1, Number of Correct Detections 


‘The frequency of correct detections was transformed with use of an arc-sine 
technique. The AOV indicated significant effects for the background stimula- 
tion conditions (F = 37.75; df = 3, 138; p < -01), time intervals (F = 
42.54; df = 1,46; » < .01), sex (F = 4.54; df = 1, 46; p < .05), and the 
interaction between time intervals and background stimulation conditions 
(F = 7.15; df = 3, 138; p < .01). Scheffé tests calculated on the four levels 
of the background stimulation factor showed that for males the random music 
background was significantly more arousing than the continuous music (F = 
10.76; df = 3, 188; p <.05), broadband noise (F = 42.71, df = 3, 188; 
6 <.01), and silence (F = 24.13; df = 3, 188; » < .01), while the con- 
tinuous music condition was superior to the broadband noise background 
(F = 10.60; df = 3, 188; p < .05). For female Ss the random music back- 
ground was significantly more arousing than the continuous music (Rise 
15.35; df = 3, 188; p < 01), broadband noise (F = 40.59; df = 3, 188; 
? < 01), and silence conditions (F = 14.36; df = 3, 188; p < 
01). Table 1 shows the means and standard deviations of the frequency of 
pert — seme time intervals for both males and females in each 

2. Response Latency 

The analysis using response latency indicated significant effects for back- 
ground stimulation conditions (F = 20,35 i of = 3, 138; » < .01), time 
intervals (F = 17.60; df = 1, 463 9 < 1), sex (F = 4.63; of = 1, 46: 
/ < .05), and the interaction between time intervals and noise conditions 
(F = 5.87; df = 3, 138; p < 01). Scheffé tests calculated on the four levels 
of the background stimulation factor showed that for male Ss the random 
music background produced significantly shorter response latencies than did 


m, 
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TABLE 1 
MEANs AND STANDARD DEVIATIONS OF Frequency or Correct Derncrions* 
over TIME INTERVALS FOR MALES AND FeMALES 


1n Each Backcrounp Conpirion 


. Broadband Continuous 
Silence noise music Random music 
Time inceryal M/W Meee eee en ee 
1 
Mean 56.09 5474 5215 49.95 5852 S411 6284 = 59.36 
sD 12,47 8.71 10.16 11.13 1226 «11.78 «1043 = 11,56 


2 
Mean 48.98 46.35 46.30 40.68 «53.17 46.46 62.19 $7.03 
sD 9.10 6.19 7.24 6.66 721 $8.55 6.81 7.68 


&@ Frequency correct detections subject to arc-sine transformation, 


the broadband noise (F = 37.27; df = 3, 188; p< .01) or the silence condi- 
tion (F = 19.46; df = 3, 188; p <.01), while the continuous music was 
superior to the broadband noise (F = 11.53; df =3, 188; p <.05, For 


sponse latencies than either the continuous music condition (F = 9.49; df = 
3, 188; p < .05) or the broadband noise condition (F = 26.01; df = 3, 188; 
p <.01), while the silence condition was significantly better than the broad- 
band noise (F = 9.03; df = 3, 188; p < .05). 

Table 2 shows the means and standard deviations of the response latencies 
over time intervals for both males and females in each background condition. ( 


D. Discussion 


These results showed general support for the arousal hypothesis, but did 
not completely confirm the hypothesis proposed, ‘The randomized music 


TABLE 2 
MEAns AND STANDARD DEVIATIONS oy Resronst LATENCIES OVER ‘Trae Iwrenvaus 
FOR 


__ Means or Matas An Panis BAER PASS ———— 
Broadband Continuous 


Silence noise 
M r 
Time interval M F M F M F 
: a 6 
m0 0267 c’ ee hd 
rai 4, 061 107 158 081 la O06 068 
3 22 
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condition maintained performance on the visual vigilance task at a significantly 
higher level than did any of the other background conditions, there being 
very little difference between the silence and continuous music conditions, 
while the broadband noise condition was generally the least effective back- 
ground. The direction of these findings was not in agreement with the results 
of most studies employing background music in vigilance studies, in that 
continuous music did not always maintain vigilance performance at a higher 
level than did the silence condition 3 they were more compatible with those of 
Poock and Wiener (11), and on comparison it seemed probable that the tapes 
used in the present study and those employed by Poock and Wiener (11) were 
more homogeneous than in other studies. Thus the heterogeneity of the music 
background presented appeared to be one important factor in extending task 


initial arousal level was high, the uncertainty of reinforcement attached to the 
random music condition would maintain the baseline performance level during 
the task, while more predictable backgrounds (silence, broadband Noise, con- 
tinuous music) would Not serve this function as efficiently, } 
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ducted to detail the most efficient way of selecting and scheduling types of 
background stimulation, in order to optimize the efficiency of the observer in 
a vigilance task. The temporal scheduling and specific type of background 
music applied appear to be significant features contributing to improved 
monitoring, and these findings combined with future research may help to 
explain some of the conflicting results in past studies in this area of vigilance. 


E. Summary 


The effect on the efficiency of visual vigilance performance of having one of 
four different types of irrelevant background stimulation (silence, broadband 
noise, continuous music, music randomized with respect to temporal presenta- 
tion) present during the watch was evaluated. Results showed significantly 
different correct detection rates and response latencies for signals presented 
with the different backgrounds, with the random music background always 
being superior, followed by the continuous music condition, with little differ- 
ence between the other two background conditions. Male Ss started the study 
at a higher performance level for both dependent variables and showed less 
performance decrement over time than did female Ss. All background condi- 
tions except randomized music resulted in significant performance decrements 
over time. The results suggest that a patterned stimulus (music) was superior 
to other types (broadband noise, silence) of background stimulation and that 
both the presentation schedule of the music and the type of music presented 
to the individual S were significant features affecting the efficiency of monitor- 
ing performance. Suggestions were made for future research. 
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MORE COMMENTS ON 
“THE PROJECTED BRAIN ILLUSION”* 


Augusta, Georgia 


Frank M. pu Mas 


Stern (2) has rendered a scientific service by pointing out the probable 
priority of Helmholtz and Purkinje in their partial observation of the pro- 
jected brain illusion (PBI). This phenomenon was described and discussed 
in some detail by du Mas and du Mas (1). 

Many questions remain. The answers to some of the following questions 
may be obvious, but some are not. Why is it that while many sensations and 
perceptions, such as taste, smell, organic, and even sound, may be perceptually 
located deep within the head, PBI is located outside the body? Why is it 
observed to be in the particular position that it is? Why does it have the 
size that it has? Why is the shape approximately oval? Why are the ends 
of the black lines pointed? Since the intervening space between the black 
lines is beige grey in color and has a symmetric grainy appearance quite 
different from so-called “brain grey,” could it be that we are observing our 
own retinas as well as the vascular system? Why it is that even with such 
high contrast, sharply defined figure and ground relationships, adaptation is 
so fast? Is movement of the eye actually necessary to observe PBI? What 
is the basis for the tremendous magnification effects in PBI? And although 
we, too, (1, pp. 116, 117) suggested the possibility that PBI is some kind 
of observation of the retina, is this really so? 

Even after Purkinje, Helmholtz, du Mas and du Mas, and Stern, PBI 
teems with inadequately answered questions regarding location, relative posi- 
tion, size, color, shape, movement, magnification, and what is being observed. 

Since this is a fruitful area for future observation and experimentation, 
I should like to suggest a novel experimental design which might permit the 
utilization of facilitating flicker effects with a quasi-moving but motionless 
stimulus observed by a stationary eye. A group of motionless discrete electrical 
light bulbs could be imbedded in a flat black wall in the form of a circle or 
line. Preliminary research could establish the optimum duration, intensity, and 
color of the light sources, as well as the optimizing interval between the time 
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one light goes out and another light goes on. With these Parameters built 
into the stimulus apparatus it might be possible that PBI could be subjected 
to sustained observation under laboratory conditions. 


SumMMary 


PBI teems with unanswered questions regarding location, relative Position, 
size, color, shape, movement, magnification, and what, precisely, is being 
observed. A novel experimental design is suggested which might permit sus- 
tained observation of PBI with use of facilitating flicker with a quasi-moving 
but motionless stimulus and a stationary eye. 
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VISUAL EXPLORATORY BEHAVIOR AS A FUNCTION 
OF MANIFEST ANXIETY* 


Indiana University of Pennsylvania 


Gary W. R. Patron anp Catuy A. Kotrick 


A. INTRODUCTION 


Berlyne and Lewis (1) found that artificially induced increases in anxiety 
affect an S’s visual exploratory behavior in two ways: (a) he shuns novel 
stimuli more than he normally would and (4) the length of time he 
voluntarily spends looking at visual stimuli increases. Two of three other 
experiments where hypothesis (a) was also tested by artificially increasing 
anxiety confirmed this result. Haywood (3) found that anticipation of a test 
based on a difficult tape-recorded lecture decreased Ss’ tendency to select in- 
congruous slides rather than familiar ones. Similarly, Skrzypek (7) found that 
exposure of neurotic delinquents to a difficult discrimination task, where correct 
choices resulted in monetary rewards and incorrect ones produced bursts of 
unpleasant noise and loss of reward money, later decreased their preference 
for both complex and novel stimuli, though changes in the latter measure were 
nonsignificant. In contrast, Haywood and Hunt (4) found that injections of 
epinephrine did not reduce Ss’ preference for novel slides. 

One of two experiments where hypothesis (a) was tested by examining 
manifest, rather than induced, differences in anxiety also supported it. 
Skrzypek (7) showed that psychopathic delinquents preferred greater stimulus 
novelty and complexity than neurotic delinquents, but Haywood (3) was 
unable to find comparable differences among high, medium, and low anxious 
college students. Thus, on the whole, hypothesis (a) has received substantial, 
though not unanimous, support from experimental studies. 

Hypothesis (4) has been tested less frequently and less successfully (3), 
and, in particular, no one, including Berlyne and Lewis (1), has yet demon- 
strated that duration of visual inspection varies as a function of differences in 
manifest anxiety. Accordingly, in the present experiment, the following 
hypothesis was proposed : If undergraduate $s are categorized as high, medium, 
or low anxious by means of Welsh’s (8) Anxiety Index (AI), then high 
anxious Ss will spend a significantly longer time viewing 20 slides than either 
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the organism investigates novel stimuli to extract new information from them 
regarding the benefits and hazards of his environment. Under threatening 
conditions, this response tendency may become maladaptive. When an organism 
is endangered, safety dictates that it avoid strange stimuli and carefully inspect 
stimuli already within view. 

Another, less teleological, explanation of increased viewing time is also 
possible. Berlyne and Lewis (1) note that in the type of task used in the 
present experiment, the subject tries to prolong viewing time until he has 
received enough information to reduce uncertainty and conflict, and thereby 
his curiosity, to a threshold value. In addition, Easterbrook (2) states that 
the total amount of information transferred through the actions of an animal 
falls as drive level is increased. In other words, channel capacity is reduced in 
emotion. Thus, according to this interpretation, because his ability to process 
visual input is reduced, the anxious subject must scan the stimulus for a longer 
time before he has sufficient information to satisfy his curiosity, 

Previously cited studies, where exploration was purely visual, conform 
with data from experiments where exploration was both visual and tactile or 


general feature of human behavior, 
However, Mendel’s finding was a byp: 
data were, by his own admission, 
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n of inspection of novel visual stimuli. Related experiments employing 
ificially induced increases in anxiety suggested the hypothesis that increases 
anifest anxiety would be associated with longer periods of inspection. 

‘o test this hypothesis, the MMPI was administered to 68 undergraduate 
Ss. After elimination of eight Ss who exceeded the desired limits on the 
validity scales, the remaining 60 were ranked in terms of manifest anxiety with 
use of a derived scale, Welsh’s (8) Anxiety Index. The 20 Ss with the highest 
scores were assigned to the high anxious group; the middle 20 to the medium 
anxious group; and the lowest 20 to the low anxious group. In a second part 
of the experiment, each S individually viewed a series of 20 slides, while the 
E unobtrusively recorded his viewing time. As predicted, viewing times were 
longer for high anxious Ss than medium anxious Ss, and longer for medium 
anxious Ss than low anxious Ss; however, only the difference between the high 
and low anxious Ss was statistically significant. 

On the whole, the data were in line with previous research suggesting that 
an S who is either manifestly anxious or has an anxiety state induced artificially 
will display less preference for novel visual stimuli and will take more time to 
inspect stimuli already within view. Explanations of this response pattern 
and its likely expression via the tactile, as well as the visual, sense were also 


discussed. 
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